As we process minority scripts in computers based on ISO/IEC 10646 and Unicode standards , there is a bottle - neck problem that variations of presentation characters have no definite code points. It is why many software systems processing minority scripts are produced in repetition and are incompatible with each other. Based on scripts processing architecture in ICU , this paper illustrates methods of implementation of minority scripts processing complying with Unicode standard. Firstly , we analyze the characteristics of minority scripts , and point out the difficulties of processing them. Then the OpenType font technology , which can satisfy the requirements of minority languages processing , is introduced. Lastly , we illuminate the principle of Layout Engine , as well as present how to embed minority scripts processing in ICU.
DONG Zhi-jiang,WU Jian,ZHONG Yi-xin.
Implementation of Minority Languages Processing in ICU. Journal of Chinese Information Processing. 2004, 18(2): 67-73
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] International Standard ISO/IEC 10646 - 1 Second Edition. Information technology - Universal Multiple-Octet Coded Character Set (UCS) [S] ,2000. [2] The Unicode Consortium. http://www.Unicode.org[EB]. [3] Introduction to ICU. http://oss.software.ibm.com/icu/userguide/icu.pdf [EB]. [4] OpenType specification. http://www.microsoft.com/typography/otspec/ [EB]. [5] 确精扎布. 蒙古文编码[M] . 呼和浩特:内蒙古大学出版社,2000. [6] Owen Taylor. Pango : internationalized text handling . http://lwn.net/2001/features/OLS/pdf/pdf/pango.pdf [EB] ,2001.