With the trend of unifying all the native character encoding schemes in computers, ISO has published an international standard named ISO/IEC 10646 to meet that developing tide. In this paper, we firstly analyze the limitation of the existing Chinese character input methods. It has been observed that almost all the existing Chinese character input methods are based on ANSI Code, such as GB2312, GBK, BIG-5, and these input methods have many shortcomings including the inconvenience of transference and the lack of supporting cross-lingual platforms. Then we propose a model of Chinese character input method based on ISO10646/IEC which consists of six parts: input method management, code mapping table based on ISO10646/IEC, code searching and filtering, interface with OS, input method kernel as well as localization interface. At last, we discuss the design and processing technology for the input codes-Chinese characters mapping table, a key factor in the proposed model.
LI Pei-feng,ZHU Qiao-ming,QIAN Pei-de.
A Chinese Character Input Model Based on ISO/IEC 10646. Journal of Chinese Information Processing. 2006, 20(5): 93-98
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] 朱巧明,李培峰. 基于Windows9x/2000/NT平台汉字输入法的设计[J]. 小型微型计算机系统. 2000, 21 (11) : 1217 - 1220. [2] Wu Xian, Zhu Qiaoming, Li Peifeng, et al. The Pretreatment of Chinese Character Database based on ISO 10646 [A]. Wuhan: Proceedings of the Fourth International Conference on Computer and Information Technology (CIT 2004) [C] , 2004: 1134 - 1140. [3] ISO/IEC 10646 - 1: 1993 (E) /10646 - 1: 2000 (E)/10646 - 2: 2001 (E). Universal Multiple-Octet Coded Character Set (UCS) [S]. [4] 李培峰,朱巧明,钱培德. 多文种环境下汉字内码识别算法的研究[J]. 中文信息学报. 2004, 18 (2) : 73 - 79. [5] 李培峰,朱巧明,钱培德. 一个基于多内码的中文屏幕实时解释引擎的设计[J]. 中文信息学报, 2005, 19 (5) : 90 - 96. [6] Information Technology Services Department & Official Languages Agency of Hong Kong, Hong Kong Supplementary Character Set 2004 [Z] , 2004.