王磊,刘加. 基于RFC模型的基频曲线导数域编码方法研究[J]. 中文信息学报, 2009, 23(6): 86-91.
WANG Lei, LIU Jia. The Derivative Domain Codes of Pitch Curse and Applications. , 2009, 23(6): 86-91.
基于RFC模型的基频曲线导数域编码方法研究
王磊,刘加
清华信息科学与技术国家实验室(筹) 清华大学电子工程系,北京100084
The Derivative Domain Codes of Pitch Curse and Applications
WANG Lei, LIU Jia
Tsinghua National Laboratory for Information Science and Technology, Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
Abstract:Fundamental frequency (or pitch), usually named as F0, is the vibration frequency of vocal cord during the production of voiced sounds. In a syllable or continuous voice paragraph, F0 changes with time and yields the fundamental frequency (or pitch) curves. It is particularly important to descript and investigate the F0 curve because it usually reflects the rhythm information, such as tone and stress. This paper first proposes a new method to describe F0 curve——the derivative domain codes, and then it discusses the role of the coding method on the rhythm in the evaluation of speech pronunciation. Experimental results show that the method can be used to evaluate the English prosody. The correlation coefficient between the subjective and objective scores of pitch extreme difference improves from 0.38 to 0.49. Keywordartifical intelligence;pattern recognition;pitch;derivative;codes;application
[1] 陈高鹏,胡郁,王仁华.考虑语速和前后环境的基频Target模型及实现[J].中文信息学报,2004,18(3): 81-85. [2] 韩纪庆,张磊,郑铁然.语音信号处理[M].北京: 清华大学出版社,2004. [3] H.Fujisaki, S.Ohno, O.Tom Ita. Automatic Parameter Extraction of Fundamental Frequency Contours of Speech Based on a Generative Model[J].Proceedings of ICSP'96,1996,1: 729-732. [4] Paul A. Taylor. The Rise/Fall/Connection Model of Intonation[J].Speech Communication,1995,15: 169-186. [5] 朱芸.计算机辅助英语学习系统中的韵律分析与建模方法研究[D].北京: 清华大学,2004. [6] 高等数学(第五版)[M].同济大学应用数学系,北京: 高等教育出版社,2005. [7] 王文剑,王长富,戴蓓倩,等.基于藤崎模型的汉语语音基频轮廓的参数提取[J].小型微型计算机系统,1999,20(10): 756-759. [8] 覃福森.英语音高与英语语调关系研究[J].学术问题研究(综合版),2007,(1): 76-82. [9] 李超雷.交互式语言学习系统中的发音质量客观评价方法研究[D].北京: 中国科学院电子学研究所,2007. [10] IEEE. IEEE recommended practice for speech quality measurements [J]. IEEE Trans. on Audio and Electroacoust Sep. 1969: 227-246.