汉语普通话是一种带声调的语言。声调可以用基音的轮廓信息进行描述。传统基音的平滑方法:线性平滑、中值平滑和一般的线性插值方法都不能很好地处理连续的基音频率有随机错误点的情况。本文提出了一种通过搜索来得到更精确的基音轮廓的新的基音平滑方法。这种方法具有简单可靠,快速高效的特点。实验表明这种方法比传统的方法识别错误率降低约40%。
Abstract
Mandarin is a tonal language. The tones are recognized by using the pitch contour information which can be expressed by fundamental frequencies. The classic approaches for fundamental frequencies smoothing ,such as linear smoothing ,median smoothing and linear interpolation ,can not work well in the case of that fundamental frequency is not detected correctly and several continulus frames. In this paper ,a new smoothin approach is presented ,in which a searching method is used to get a preferable accurate pitch contour. This approach is characterized by its simple ,reliable and fast performance. Experimental results show that the new approach can decrease the recognition error rate by 40%.
关键词
基音检测 /
平滑 /
声调识别 /
语音识别
{{custom_keyword}} /
Key words
pith detection /
smoothing /
tone recognition /
speech recognition
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] 杨行峻,迟惠生等. 语音信号数字处理. 北京:电子工业出版社,1995 ,20 - 33
[2] Yoav Medan , Eyal Yair ,Dan Chazan. Super resolution pitch determination of speech signals. IEEE Trans. ,Jan. 1991 ,Sp - 39 (1) :40 - 48
[3] Gang W,Zheng OYJ . Chinese 4 - tone recongition based on analysis of the nonlinear trace of the pitch period with neural networks. ICIIPS92 ,1992 ,214 - 217
[4] 关存太,陈永彬. 非特定人四声识别. 声学学报,1993 ,18 (5)
[5] 徐士林. 四声模糊识别方法. 电子学报,1996 ,24 (1) :119 - 121
[6] Joe Tebelskis . Speech Recognition using Neural Networks. Technical Report ,Carnegie Mellon University ,May 1995
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}
基金
国家自然科学基金(69982005);国家重点基础研究发展规划项目(G19803050703)
{{custom_fund}}