一个基于ISO/IEC10646的汉字输入模型

李培峰,朱巧明,钱培德

PDF(140 KB)
PDF(140 KB)
中文信息学报 ›› 2006, Vol. 20 ›› Issue (5) : 93-98.

一个基于ISO/IEC10646的汉字输入模型

  • 李培峰,朱巧明,钱培德
作者信息 +

A Chinese Character Input Model Based on ISO/IEC 10646

  • LI Pei-feng,ZHU Qiao-ming,QIAN Pei-de
Author information +
History +

摘要

计算机中各国文字编码的统一是必然趋势,而ISO/IEC10646正是顺应这种趋势而诞生的一个国际标准。现有的输入法绝大多数是基于本地代码页(ANSI CODE),存在着移植困难、不能跨语言平台以及向国际化标准过渡困难等缺点。本文首先分析了现有本地化输入法存在的问题,并在此基础上阐述了基于ISO10646的汉字输入法的实现方法,并给出了一个以ISO10646为核心的通用汉字输入法模型和原理,该模型由输入法管理/服务器、ISO10646输入码对照表、码本检索/过滤模块、输入法与OS接口模块、输入法内核和本地化接口六部分构成。最后,本文重点论述了输入法的核心—输入码对照表的设计和检索技术。

Abstract

With the trend of unifying all the native character encoding schemes in computers, ISO has published an international standard named ISO/IEC 10646 to meet that developing tide. In this paper, we firstly analyze the limitation of the existing Chinese character input methods. It has been observed that almost all the existing Chinese character input methods are based on ANSI Code, such as GB2312, GBK, BIG-5, and these input methods have many shortcomings including the inconvenience of transference and the lack of supporting cross-lingual platforms. Then we propose a model of Chinese character input method based on ISO10646/IEC which consists of six parts: input method management, code mapping table based on ISO10646/IEC, code searching and filtering, interface with OS, input method kernel as well as localization interface. At last, we discuss the design and processing technology for the input codes-Chinese characters mapping table, a key factor in the proposed model.

关键词

计算机应用 / 中文信息处理 / 输入法模型 / ISO/IEC10646 / Unicode / 输入码对照表

Key words

computer application / Chinese information processing / Chinese character input model / ISO/IEC 10646 / Unicode / input codes-Chinese characters mapping table

引用本文

导出引用
李培峰,朱巧明,钱培德. 一个基于ISO/IEC10646的汉字输入模型. 中文信息学报. 2006, 20(5): 93-98
LI Pei-feng,ZHU Qiao-ming,QIAN Pei-de. A Chinese Character Input Model Based on ISO/IEC 10646. Journal of Chinese Information Processing. 2006, 20(5): 93-98

参考文献

[1] 朱巧明,李培峰. 基于Windows9x/2000/NT平台汉字输入法的设计[J]. 小型微型计算机系统. 2000, 21 (11) : 1217 - 1220.
[2] Wu Xian, Zhu Qiaoming, Li Peifeng, et al. The Pretreatment of Chinese Character Database based on ISO 10646 [A]. Wuhan: Proceedings of the Fourth International Conference on Computer and Information Technology (CIT 2004) [C] , 2004: 1134 - 1140.
[3] ISO/IEC 10646 - 1: 1993 (E) /10646 - 1: 2000 (E)/10646 - 2: 2001 (E). Universal Multiple-Octet Coded Character Set (UCS) [S].
[4] 李培峰,朱巧明,钱培德. 多文种环境下汉字内码识别算法的研究[J]. 中文信息学报. 2004, 18 (2) : 73 - 79.
[5] 李培峰,朱巧明,钱培德. 一个基于多内码的中文屏幕实时解释引擎的设计[J]. 中文信息学报, 2005, 19 (5) : 90 - 96.
[6] Information Technology Services Department & Official Languages Agency of Hong Kong, Hong Kong Supplementary Character Set 2004 [Z] , 2004.

基金

江苏省高技术研究资助项目(BG2005020);江苏省教育厅自然基金资助项目(04KKB320134)
PDF(140 KB)

765

Accesses

0

Citation

Detail

段落导航
相关文章

/