朱维彬. 支持重音合成的汉语语音合成系统[J]. 中文信息学报, 2007, 21(3): 122-128.
ZHU Wei-bin. A Chinese Speech Synthesis System with Capability of Accent Realizing. , 2007, 21(3): 122-128.
支持重音合成的汉语语音合成系统
朱维彬
北京交通大学 信息科学研究所, 北京100044
A Chinese Speech Synthesis System with Capability of Accent Realizing
ZHU Wei-bin
Institute of Information Science, Beijing Jiaotong University, Beijing 100044, China
Abstract:To aim to predict and realize Chinese accent in a unit-selection based speech synthesis system, a data-driven method was used to build an accent-supported prosody module. First, with the help of Accent-Index detector which had been optimized with perceptual annotations, a speech corpus had been auto-annotated with Accent-Index. Then, a prosody predictive module supporting accent had been trained with the corpus. Replaced with the new prosody predictive module, the speech synthesis system could synthesize speech with various levels of accent. The results on the experiments had proved the accuracy of auto-detected accents, and the validity of the prosody predictor, and also the capability of accent realizing of the speech synthesis system.
[1] Ma Xijun, Zhang Wei, Zhu Weibin, et al. Probability Prosody Model for Unit Selection [A]. In: Proc. of ICASSP 2004 [C]. Montreal, Canada, 2004. 649-652. [2] Chu Min, Peng Hu, et al. Selecting Non-Uniform Units from a Very Large Corpus for Concatenative Speech Synthesizer [A]. In: Proc. of ICASSP 2001 [C]. Sale Lake City, USA, 2001. [3] 初敏. 自然言语的韵律组织中的不确定性及其在语音合成中的应用 [J]. 中文信息学报,2004,18(4): 66-71. [4] 陶建华,赵晟,蔡莲红. 基于统计韵律模型的汉语语音合成系统的研究 [J]. 中文信息学报,2002,16(4): 1-6. [5] 吴志勇,蔡莲红. 语音合成中的韵律关联模型 [J]. 中文信息学报,18(2): 44-50. [6] 吴晓如,王仁华,刘庆峰. 基于韵律特征和语法信息的韵律边界检测模型 [J]. 中文信息学报,2003,17(5): 48-54. [7] Chen Gaopeng, Hu Yu, Wang Renhua, et al. Quantitative Analysis and Synthesis of Focus in Mandarin [A]. In: Proc. of TAL 2004 [C]. Beijing, China, 2004. 25-28. [8] Greg Kochanski, Chilin Shih, et al. Hierarchical Structure and Word Strength Prediction of Mandarin Prosody [J]. International Journal of Speech Technology, 2003,6(1): 33-43. [9] 王韫佳,初敏,贺琳. 普通话语句重音在双音节韵律词中的分布 [J]. 语言科学,2004,3(5): 38-48. [10] Zhu Weibin, Shi Qin, et al. Corpus Building for Data-Driven TTS Systems [A]. In: Proc. of IEEE TTS Workshop 2002 [C]. Santa Monica, USA, 2002. 199-202. [11] Zhu Weibin, Zhang Wei, Shi Qin, et al. Automatic Detection of Chinese Accent-Index Based on Approximation-Ratio [A]. In: Proc. of ISCSLP 2004 [C]. Hong Kong, China, 2004. 85-88. [12] Zhu Weibin. Perceptual Optimization of the Chinese Accent-Index Detector [A]. In: Proc. of Speech Prosody 2006 [C]. Dresden, German, 2006. [13] Xu Yi. Effects of tone and focus on the formation and alignment of f0 contours [J]. Journal of Phonetics, 1999, 27: 55-105. [14] Shi Qin, Ma Xijun, Zhu Weibin, et al. Statistic Prosody Structure Prediction [A]. In: Proc. of IEEE TTS Workshop 2002 [C]. Santa Monica, USA, 2002.