目前流行的各种大型数据库系统都缺乏对民族语言如藏、蒙、维文的支持。如何实现民文信息在数据库中存储、查询和检索等处理及支持各种基于民文的数据库应用,是一个重要问题。本文提出了一个数据库管理系统多民族语言支持框架,支持多民族语言、数据库客户端工具和应用编程接口;并在此框架下提出了一种符合ISO/IEC14651语义的藏文排序方法,从而实现了PostgreSQL数据库对藏文信息处理的全面支持。并在Linux平台的PostgreSQL数据库系统上加以实现。
Abstract
Almost all the large database systems currently in use such as Oracle , Sybase and DB2 lack the support to minority languages of China. How to storage , query and index minority language information in databases and how to support database applications in such a multi-lingual environment are important tasks. This paper proposes a DBMS multi-lingual support framework for minority languages , along with a multinational language application programming interface. Moreover , it proposes a sorting algorithm for Tibetan words according to the semantics of ISO/IEC 14651 , leading to a full support in . PostgreSQL for Tibetan information processing . The framework has been implemented in PostgreSQL database on the Redflag Linux OS.
关键词
计算机应用 /
中文信息处理 /
数据库管理系统 /
民族语言支持 /
藏文 /
字典序
{{custom_keyword}} /
Key words
Computer application /
Chinese information processing /
DBMS /
Multi-lingual support for minority languages /
Tibetan /
dictionary order
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] C. J. Date. An Introduction to Database System[M] . 机械工业出版社,2000.
[2] 江荻,周季文. 论藏文的序性及排序方法[J] . 中文信息学报,2000 ,14 (1) :56 - 64.
[3] 江荻, 康才晙. 书面藏语排序的数学模型及算法[J] . 计算机学报,2004 ,4 : 524 - 529.
[4] 林河水,程伟,曹晖,等. 一种符合ISO14651语义的藏文排序实现方法[J] . 中文信息学报2004 ,18 (5) :36 - 41.
[5] ISO/IEC. ISO/IEC FCD 14651 - International String Ordering-Method for comparing Character Strings and Description of the common Template Tailorable ordering ,1997.
[6] ISO/IEC. ISO/IEC DTR2 14652 - Specification method for cultural conventions. 2001.
[7] http://www.postgresql.org/.
[8] http://www.pgadmin.org/pgadmin3/index.php.
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}
基金
国家“863”计划资助项目(2003AA1Z2110);中科院知识创新资助项目(KGCX2-SW-504)
{{custom_fund}}