Rule Based Identification of Compound Sentences Relation Words
JIA Suimin1, LEI Lili2, HU Mingsheng1
1. College of Information Science & Technology, Zhengzhou Normal University, Zhengzhou, Henan 450044, China; 2. Comprehensive Experimental & Training Center, HeNan College of Finace & Taxation, Zhengzhou, Henan 451464, China
Abstract:Automatic identifying the relation words of compound sentences is a fundamental issue in the field of Chinese information processing. This paper describe a rule based method for automatic identification of compound sentence relation words. To construct the rule, 12 featuresare summarized from the corpus. Then a match algorithm is described to obtaind the candidate relation word sequence. Finally the context of the relation words is employed to match with the rules. Experiment results show that this method achieves an accuracy of 70.9%.
[1] 胡金柱,沈威,杜超华.基于规则的复句中的关系词标注探讨[J].福建电脑,2009,4:398-401. [2] 胡金柱,舒江波,姚双,等.面向中文信息处理的复句关系词提取算法研究[J].计算机工程与科学,2009,31(10):90-93. [3] 舒江波.面向中文信息处理的复句关系词自动标识研究[D].武汉:华中师范大学博士学位论文,2011. [4] 陈江曼.复句关系词自动标识系统中规则库及其维护方法研究[D].武汉:华中师范大学硕士学位论文,2012. [5] 胡金柱,雷利利,杨进才,等.多重复句关系标记搭配的求解模型研究[J].计算机工程与科学,2011,33(11):177-182. [6] 胡金柱,陈江曼,杨进才,等.基于规则的连用关系标记的自动标识研究[J].计算机科学,2012,39(7):190-194. [7] 雷利利.复句关系词自动标识系统中规则解析器的研究[D].武汉:华中师范大学硕士论文,2012. [8] Peter Linz著,孙家骕等译.形式语言与自动机导论[M].北京:机械工业出版社,2004. [9] 胡金柱,俞小娟,李琼,等.基于规则库和聚类分析的复句短语字段的自动识别研究[J].华中师范大学学报(自然科学版),2008,42(2):190-194. [10] 张金,王军海,耿标.基于规则解析的柔性编码系统[J].计算机系统应用,2006,3:17-20. [11] Schubert Foo, Hui Li. Chinese word segmentation and its effect on information retrieval [J]. Information Processing and Management, 2004, 40(1):161-191. [12] George A Miller. WordNet: A Lexical Database for English[C]//Proceedings of Communications of the ACM. 1995, 38:39-41. [13] Lafferty J, McCallum A, Pereira F. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data[C]//Proceedings of the 18th ICML-01, 2001:282-289. [14] Zhang Kunli, Zhang Wencong, Zan Hongying, et al. Studies on automatic recognition of several common Chinese adverbs usages based on BP neural networks[C]//Proceedings of the 10th Chinese Lexical Semantics Workshop. 烟台:鲁东大学出版社,2009: 31-37. [15] Lovasz L, Plummer M D. Matching theory [M]. Amsterdam: Elsevier Science, 2009. [16] 刘盈盈,罗森林,冯扬,等. BFS-CTC汉语句义结构标注语料库[J].中文信息学报,2013,27(1):72-80. [17] 张坤丽,赵丹,昝红英,等. 常用现代汉语副词用法自动识别研究[J].中文信息学报,2012,26(6):65-71.