梁奇,郑方,徐明星,吴文虎. 基于trigram语体特征分类的语言模型自适应方法[J]. 中文信息学报, 2006, 20(4): 70-76.
LIANG Qi,ZHENG Fang,XU Ming-xing,WU Wen-hu. Language Model Adaptation Based on the Classification of a Trigram’s Language Style Feature. , 2006, 20(4): 70-76.
基于trigram语体特征分类的语言模型自适应方法
梁奇,郑方,徐明星,吴文虎
清华大学计算机科学与技术系智能技术与系统国家重点实验室语音技术中心
Language Model Adaptation Based on the Classification of a Trigram’s Language Style Feature
LIANG Qi,ZHENG Fang,XU Ming-xing,WU Wen-hu
The State Key laboratory of Intelligence Technology and System , Department of Computer Science and Technology , Tsinghua University
Abstract:In this paper, a language style based adaptive method for language model is proposed based on the differences between oral and written languages. Several interpolation methods based on trigram counts are used for the adaptation. An interpolation method considering Katz smoothing computes weights according to the confidence score of a trigram. An adaptation method based on the classification of a trigram’s style feature computes weights dynamically according to the trigram’s language style tendency with several weight generation functions proposed. Experiments on spoken Chinese corpora show that these methods could reduce the Chinese character error rate for pinyin-to-character conversion to some extent, more or less, and the one considering both a trigram’s confidence and style tendency achieved the best performance with character error rate reduction of 50.2% and 23.7% , respectively, compared with two baselines in this paper.
[1] Gengqing Wu, Fang Zheng. A Method to Build a Super Small but Practically Accurate Language Model for Handheld Devices[J]. J. Computer Science & Technology, 2003, 18 (6) : 747 - 755. [2] Fang Zheng, Zhanjiang Song, Pascale Fung, et al. Mandarin Pronunciation Modeling Based on CASS Corpus [J]. J. Computer Science & Technology, May 2002, 17 (3) : 249 - 263. [3] R. Rosenfeld, et al. Error Analysis and Disfluency Modeling in the Switchboard Domain [A]. In: proceedings of the 4th International Conference on Speech and Language Processing (ICSLP) [C]. Philadelphia, PA, USA, 1996. [4] 吴根清,郑方,金凌,等,一种在线递增式语言模型自适应方法[J]. 中文信息学报, 2002, 16 (1) : 60 - 65. [5] Rukmini M. Iyer, Mari Ostendorf, Modeling long distance dependence in language: topic mixtures versus dynamic cache models[J]. IEEE Transactions on Speech and Audio Processing, Volume 7 Issue 1, Jan 1999. Page(s) : 30 - 39. [6] Daniel Gildea, Thomas Hofmann. Topic Based Language Models Using EM[A]. In: proceedings of 6th European Conference on Speech Communication and Technology (Eurospeech′99) [C]. 1999, pages 2167 - 2170. [7] R. Rosenfeld. A Maximum Entropy Approach to Adaptive Statistical LanguageModel[J]. Computer Speech & Language, 1996, 10: 187 - 228. [8] S. M. Katz. Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer[J]. IEEE Transaction on Acoustic, Speech and Signal Processing, 1987, 35 (3) : 400 - 401. [9] Genqing WU, Fang ZHENG, Wenhu WU, et al, Improved Katz smoothing for language modeling in speech recognition[A]. International Conference on Spoken Language Processing 2002 [C]. Colorado, USA, Sep.16 - 20, 2002, pp. 925 - 928.