该文介绍了哈萨克文专用字母

的特殊书写习惯,以及哈萨克文编码字符处理现状。指出当前广泛使用的字母替换法不符合国际和国家相关标准,并且会导致哈萨克文排序错误,增加文字转换、语音合成等功能的实现难度。为解决上述不足,对字母替换法进行了三个改进,包括用专用字母与符号“

”结合表示它们自己;专用字母各种书写形式带符号

的字形中,仅将独立字符形式带符号“

”的字形包含在OpenType字体中;用字形替换规则<calt>识别专用字母与哈萨克文字母不相邻的上下文环境。为便于改进方法的应用,该文介绍了与改进方法一致的OpenType字体字形替换规则设置方法。
Abstract
This paper describes the special writing rules of the Kazakh letters

and

, pointing out the current substitution method does not comply with international or national standards and obstructs Kazakh processing in text sorting, script conversion and speech synthesis. This paper proposed three improvements, i.e. 1) representing the four special letters with the combination of themselves and character

; 2) include only isolated forms

with

in OpenType font; and 3) identifying the contexts that are not adjacent to the Kazakh letter based on the glyph substitute rule <calt> in OpenType font. To facilitate the application of the above suggestions, this paper describes the set of the glyph substitution rules in OpenType font which is consistent with the improved method.
关键词
哈萨克文 /
编码字符 /
Unicode /
OpenType
{{custom_keyword}} /
Key words
Kazakh /
coded character /
Unicode /
OpenType
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] 中华人民共和国国家统计局.第六次人口普查数据.[EB/OL]. http://www.stats.gov.cn/tjsj/.html,2015-12-16.
[2] Unicode 8.0.0 Character Code Charts. Arabic [EB/OL].http://www.unicode.org/charts/PDF/U0600.pdf,2015-12-16.
[3] 陈壮. 中国在ISO/ IEC JTC1/ SC2 的活动与中文编码的国际标准化[J]. 中文信息学报, 2007, 21(4): 122-128.
[4] Unicode Bidirectional Algorithm.[EB/OL]. http://www.unicode.org/reports/tr9/tr9-33.html,2015-12-16.
[5] The Unicode Standard Version 8.0.0-Core Specification, Middle East-I Modern and Liturgical Scripts Eastern Script [EB/OL].http://www.unicode.org/versions/Unicode8.0.0/ch09.pdf,2015-12-20.
[6] Unicode 8.0.0 Character Code Charts. Arabic Presentation Forms-A[EB/OL].http://www.unicode.org/charts/PDF/UFB50.pdf,2015-12-16.
[7] Unicode 8.0.0 Character Code Charts. Arabic Presentation Forms-B[EB/OL].http://www.unicode.org/charts/PDF/UFE70.pdf,2015-12-20.
[8] 全国信息技术标准化技术委员会.GB 21669-2008,信息技术 维吾尔文、哈萨克文、柯尔克孜文编码字符集[S].北京: 中国标准出版社,2008: 4.
[9] 肖明,胡金柱,赵慧. 字形技术及OpenType字体文件格式研究[J]. 中文信息学报, 1999, 13(6): 54-61.
[10] 木合亚提·尼亚孜别克, 古力沙吾利. 哈萨克文信息处理的现状和发展方向[J]. 中文信息学报, 2010, 24(4): 111-114.
[11] Microsoft Typography Home. OpenType Registered features[EB/OL].http://www.microsoft.com/typography/otspec/features_ae.html,2015-12-20.
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}
基金
中科院西部之光项目(No. YBXM-2014-04);中科院仪器设备功能开发技术创新项目(No. YG2012114)
{{custom_fund}}