1. Qinghai Normal University Tibetan Information Research Center, Xining, Qinghai 810008, China; 2. Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; 3. Computer Science School of Shaanxi Normal University, Xian, Shanxi 710062, China
Abstract:According dependency syntactic theory this paper gave Tibetan typed dependencies and its hierarchy, and then we analyzed some problems in building Tibetan dependency Treebank. We proposed a mode to construct dependency tree semi-automatically, it includes word-pairs dependency classification model and dependency edges annotation model with rich features template based on Tibetan language grammar. And we implemented visualized tool which used to build and proofreading 11 thousand sentences Treebank. On the baseline system the experimental results show that, the dependency recognition accuracy obtains an improvement of 3%. Key wordsTibetan dependency syntax; word-pair dependency classification; Tibetan Treebank; Tibetan dependency annotation tool
[1] 华却才让,赵海兴.基于判别式的藏语依存句法分析[J].计算机工程. 2013, 39 (4): 300-304. [2] 胡书津. 简明藏文文法[M]. 昆明: 云南民族出版社, 1988. [3] Peter Hellwig. Dependency Unification Grammar[C]//Proceeding of Coling86. 1986. [4] Marie-Catherine de Marne de, Christopher D. Manning[M]. Stanford typed dependencies manual. 2008. [5] 周明, 黄昌宁. 面向语料库标注的汉语依存体系的探讨[J]. 中文信息学报, 1994, 8(3): 35-51. [6] 格桑居冕. 实用藏文文法[M]. 成都: 四川民族出版社, 1987. [7] 华却才让,赵海兴.现代藏语依存句法标注初探[C].第十二届全国少数民族语言文字信息处理学术研讨会,2011.7. [8] 才让加. 藏语语料库词语分类体系及标记集研究[J]. 中文信息学报, 2009, 23(4): 146-148. [9] Jason M. Eisner. Three new probabilistic models for dependency parsing: An exploration[C]//Proceedings of COLING, 1996: 340-345. [10] Jiang Wenbin, Liu Qun. Dependency Parsing and Projection Based on Word Pair Classification[C]//Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Uppsala, Sweden: [s. n.], 2010: 12-20. [11] McDonald R, Crammer K, Pereira F. Online Large-margin Training of Dependency Parsers[C]//Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2005: 91-98. [12] Collins M. A New Statistical Parser Based on Bigram Lexical Dependencies[C]//Proceedings of the 34th Annual Meeting on Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 1996: 184-191.