Text normalization is a procedure to generate information , such as pronunciation , rhythm and so on , for special symbols correctly. In this paper , a method based on hierarchical , external rules is presented. By matching rules , we can recognize normal special symbols and generate correct information. This paper introduces the concept of analysis tree firstly , then shows the steps of constructing rules and presents the experiment results. The results show that we can achieve easy-maintainability and easy-expandability , and the correct rate of open test is 99.76%.
CHEN Zhi-gang,HU Guo-ping,WANG Xi-fa.
Text Normalization In Chinese Text-To-Speech System. Journal of Chinese Information Processing. 2003, 17(4): 46-52
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] Richard Sproat. Multilingual text analysis for text-to-speech synthesis [C] , ICSLP'96. [2] Richard Sproat , Alan Black , Stanley Chen , Shankar Kumar , Mari Ostendorf , Christopher Richards. Normalization of Non-Standard Words [C] : WS'99 Final Report (1999) . [3] Wu Xiaoru. Special Text Processing Based External Descriptor Rule [C] , ICSLP'2000. [4] Andrew Breen ,Barry Eggleton. Refocussing on the text normalization process in Text-to-speech Systems [C]. ICSLP'2002. [5] Mehryar Mohri ,Richard Sproat. A Efficient Compiler for Weighted Rewrite Rules [C]. Meeting of the Association for Computational Linguistics ,1996. [6] 陈意云. 编译原理和技术[M] . 合肥:中国科技大学出版社.