本文介绍了汉语语音合成语料库TH-CoSS的建设和分析。本语料库包括男女声朗读语句约2万个。语料库分为四个部分: TTS系统建库用语句、TTS系统测试用语句、特殊语调语句和特殊音节组。语料设计考虑了语料的平衡和音段、韵律信息的丰富。语料库中除了文本、语音数据外,还带有音段切分标志,标注文件采用XML格式。为了方便语音分析与开发,特研制了标注软件。本文还给出了语境特征对语音韵律影响的分析结果。
Abstract
This paper states our work which focuses on the building and analysis of corpus for Mandarin Text-to-Speech System, named TH-CoSS. The text script consists of four parts: sentences for TTS system building, sentences for TTS system evaluation, special syllable groups, and sentences with special sentence type to convey special intonation. The finished corpus has about 20K sentences read by one female and one male. The annotation files are in XML format, including segmental and prosodic tags. Software tools are developed as well. On the basis of the syllables in TH-CoSS, an analysis of the influences of context features on the prosody of speech is carried out.
关键词
计算机应用 /
中文信息处理 /
语音合成 /
汉语 /
语料库
{{custom_keyword}} /
Key words
computer application /
Chinese information processing /
speech synthesis /
Chinese /
corpus
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] 蔡莲红 ,赵世霞 . 汉语语音合成语料库的研究与建立 [J] . 语言文字应用 ,1999 , 31(2).
[2] Weibin Zhu, Wei Zhang, Corpus Building for Data-driven TTS Systems [A]. In: Proceedings of 2002 IEEE Workshop on Speech Synthesis [C]. 11-13 Sept. 2002. 199-202.
[3] 孙岭 ,胡郁 ,王仁华. 中文语音合成系统中的语料库设计[A]. 第六届全国人机语音通讯学术会议[C]. 深圳:2001.11.
[4] Yiqing ZU, Yingzhi CHEN. A Super Phonetic System and Multi-dialect Chinese Speech Corpus for Speech Recognition [A]. In: ISCSLP [C]. 2002.
[5] Blouin, C.,Bagshaw, P.C., Rosec, O.. A Method of Unit Pre_selection of Speech Synthesis Based on Acoustic Clustering and Decision trees [A]. In: ICASSP [C]. 2003.
[6] 崔丹丹 ,蔡莲红. 基于决策树的语料库分析 [J]. 计算机工程, 2006.12.
[7] 蔡莲红 ,蔡锐. 现代语音技术基础与应用[M]. 北京:清华大学出版社,2003.
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}
基金
国家863计划和国家自然科学基金资助项目(60418012,60433030)
{{custom_fund}}