顾平,朱巧明,李培峰,钱培德. 智能型汉字数码输入技术的研究[J]. 中文信息学报, 2006, 20(4): 102-107.
GU Ping,ZHU Qiao-ming,LI Pei-feng,QIAN Pei-de. Research on the Intelligent Chinese Character Input Technique Using Digital Code. , 2006, 20(4): 102-107.
智能型汉字数码输入技术的研究
顾平,朱巧明,李培峰,钱培德
苏州大学计算机科学与技术学院,江苏省计算机信息技术重点实验室
Research on the Intelligent Chinese Character Input Technique Using Digital Code
GU Ping,ZHU Qiao-ming,LI Pei-feng,QIAN Pei-de
School of Computer Science and Technology , Key Lab of Computer Information Processing Technology of Jiangsu Province
Abstract:An intelligent digital code-based input technique for Chinese characters, which features in improving the input rules without modifying the original coding scheme and combining the language model, is proposed. The paper disusses how to design the Chinese character and word code to meet the various input modes at first. then designs a dynamic self-study language model, and analyses the data smoothing algorithm in the language model. The experimental results regarding the input performance are given at last, by comparing the intelligent input method with the orginal method, showing that the proposed input technique can not only reduce the average input code length, but also improve the hit rate of the first candidate character.
[1] 马少平,夏莹,等. 基于词词同现概率的拼音汉字自动转换方法[J]. 电子计算机与外部设备, 1997, 21 (3) : 16 - 19. [2] 吴军,王作英,等. 一种基于语言理解的输入方法[J]. 中文信息学报, 1996, 10 (2) : 56 - 61. [3] 徐志明,王晓龙,姜守旭. 一种语句级汉字输入技术的研究[J]. 高技术通讯, 2000, (1) : 51 - 56. [4] 马少平,夏莹,张金岭. 智能型数字码汉字输入技术[J]. 电子计算机与外部设备, 1999, (2) : 27 - 29. [5] 王华,王晋豪,杨妙玲. 智能笔划输入法的研制和应用[J]. 艺术科技, 2003, (1) : 50 - 52. [6] 陈一凡,朱亮. 汉字键盘输入智能处理软件综述[J]. 中文信息学报, 2003, 17 (2) : 60 - 65. [7] 人民日报数据. http://library.suda.edu.cn /wlsjk/jinbao.htm. [8] Stanley F Chen, Joshua Goodman. An Empirical Study of Smoothing Techniques for Language Modeling[J]. In proceedings of the 34th Annual Meeting of the ACL, 1996: 310 - 318. [9] Stanley F Chen, Joshua Goodman. An Empirical Study of Smoothing Techniques for Language Modeling[R]. Technical Report TR-10-98, Center for Research in Computing Technology, Harvard University, 1998. [10] Kenneth W Church, William A Gale. A comparison of the enhanced Good-Turing and deleted estimation methods for estimating probabilities of English bigrams [J]. Computer Speech and Language, 1991, (5) : 19 - 54. [11] SlavaM Katz. Estimation of probabilities from sparse data for the language model component of a speech recognizer[R]. IEEE Transactions on Acoustics, Speech and Signal Processing, 1987, 35 (3) : 400 - 401. [12] 张华平,刘群. 基于N2最短路径的中文词语粗分模型[J]. 中文信息学报, 2002, 16 (5) : 1 - 7. [13] ICTCLAS. http://www.nlp.org.cn. [14] GB/T19246 - 2003,信息技术通用键盘汉字输入通用要求[S].