潘逸倩,魏思,王仁华. 基于韵律信息的连续语流调型评测研究[J]. 中文信息学报, 2008, 22(4): 88-93.
PAN Yi-qian, WEI Si, WANG Ren-hua. Tone Evaluation of Chinese Continuous Speech Based on Prosodic Words. , 2008, 22(4): 88-93.
基于韵律信息的连续语流调型评测研究
潘逸倩,魏思,王仁华
中国科学技术大学 讯飞语音实验室,安徽 合肥,230027
Tone Evaluation of Chinese Continuous Speech Based on Prosodic Words
PAN Yi-qian, WEI Si, WANG Ren-hua
Man Machine Voice Communication Laboratory,University of Science&Technology of China,Hefei,Anhui 230027,China
Abstract:The tone evaluation of Chinese continuous speech is a key aspect in Mandarin Chinese pronunciation test. Taking advantage of the close correlation between the prosody framework and the modified tonal curve, this paper presents a Multi-Space Distribution Hidden Markov Model (MSD-HMM) built on the prosodic word for the tone evaluation. The experimental results show that the proposed Mandarin Chinese Pronunciation Evaluation System improves from 82.0% to 84.6% in the performance of tonal syllable error rate for the standard Chinese continuous speech. And for the non-standard Chinese Mandarin speech, the correlation between computer score and expert score achieves over 3.0% absolute improvements compared with that of the baseline system without tone pronunciation test.
[1] KeiKichi Hirose, Jin-song Zhang. Tone Recognition of Chinese Continuous Speech Using Tone Critical Segments[C]//ICSLP98. Sydney, Australia: Dec. 1998, 703-706. [2] J. Zhang, K. Hirose. Tone Nucleus Modeling for Chinese lexical Tone Recognition[C]//Speech Communication, Vol. 42, 2004, 447-466. [3] Chen, C.J. , Haiping Li, Liqin Shen, Guokang Fu. Recognize Tone Languages Using Pitch Information on the Main Vowel of Each Syllable[C]//2001, 61-64. [4] K. Tokuda, T. Masuko, N. Miyazaki, et al. Multi-space Probability Distribution HMM[J]. IEICE TRANSACTIONS on Information and Systems, 2002, E85-D(3): 455-464. [5] Y. W. Wong, Eric Chang. The Effect of Pitch and Lexical Tone on Different Mandarin Speech Recognition Tasks[C]//EUROSPEECH-2001. 2001, 2741-2744. [6] European Telecommunications Standards Institute (ETSI) Standard ES 202 050, Extended Advanced Front-end Feature Extraction Algorithm[S]. [7] 林茂灿. 普通话语句的韵律结构和基频(F0)高低线构建[J]. 当代语言学,2002,(4): 254-265. [8] 冯勇强,初敏,贺琳,吕士楠. 汉语话音节时长统计分析[C]//第五届全国现代语音学学术会议论文集,2001 年,66-69. [9] 熊子瑜. 基频重设与语流间断[C]//第五届全国现代语音学学术会议论文集,2001 年,189-193. [10] 胡伟湘,徐波,黄泰翼. 汉语韵律边界的声学实验研究[J]. 中文信息学报,2002,16(1): 43-48. [11] 魏思,刘庆升,胡郁,王仁华. 普通话水平测试电子化系统[J]. 中文信息学报,2006,20(6): 89-96. [12] SiWei, et al. Putonghua Proficiency Test and Evaluation, Advances in Chinese Spoken Language Processing, Chapter 18 [M]. Springer Press, 2006.