复句关系词规则生成系统中的冲突检测与处理

杨进才,谢 芳,王中华 ,胡金柱

PDF(1580 KB)
PDF(1580 KB)
中文信息学报 ›› 2015, Vol. 29 ›› Issue (4) : 8-15.
句法语义分析

复句关系词规则生成系统中的冲突检测与处理

  • 杨进才1 ,谢 芳2 ,王中华1 ,胡金柱1
作者信息 +

Rule Conflict Resolution for Relation Word in Chinese Compound Sentences

  • YANG Jincai1, XIE Fang2, WANG Zhonghua1, HU Jinzhu1
Author information +
History +

摘要

复句中的关系词对研究复句中各分句的语义关系有着重要意义,在基于规则的关系词自动识别中需要大量的规则,并且规则库是动态变化和不断完善的,向规则库中入库规则时会出现规则冲突和入库错误的情况,该文探讨如何在入库时识别产生冲突的规则,并对规则进行相关的处理。对复句的普通规则、连用词规则、普通句式规则、连用句式规则四类规则进行了形式化的表示与存储,在此基础上设计了关系词检测、约束类型检测、约束条件检测、结论检测的检测流程。提出了两种冲突处理方式——优先级方式和有向无环图方式,对两种方法进行了比较。利用该检测方法和有向无环图的处理方式,入库了千余条规则。实验表明,利用该方法冲突规则的检测和处理正确率达到100%。

Abstract

Relation words are very important to the study of semantic relationships among clauses in compound sentences. Rule based relation word identification demands dynamic and constantly improved rules. This article investigates how to recognize the rule conflicts and solve them. Compound sentences have four kinds of rules: common rules, even words rules, common sentence pattern rules, and collocation patterns rule. This article gives a formal description of all the rules and the way of storing them, based on which we designed the flow of relation word detection, rule condition detection, result detection. A way of detecting the conflicts is given, include another two ways of solving the conflicts-priority mode and directed acyclic graph mode. With this proposed method, we have imported more than 1067 rules, with a correct rate of 100%.

关键词

复句关系词 / 规则冲突 / 有向无环图

Key words

relation words in compound sentences / rule conflicts / directed acyclic graph

引用本文

导出引用
杨进才,谢 芳,王中华 ,胡金柱. 复句关系词规则生成系统中的冲突检测与处理. 中文信息学报. 2015, 29(4): 8-15
YANG Jincai, XIE Fang, WANG Zhonghua, HU Jinzhu. Rule Conflict Resolution for Relation Word in Chinese Compound Sentences. Journal of Chinese Information Processing. 2015, 29(4): 8-15

参考文献

[1]刘迁,贾惠波. 中文信息处理中自动分词技术的研究与展望[J].计算机工程与应用,2006,(03): 175-177.
[2] Sproat R, Emerson T. The First International Chinese Word Segmentation Bakeoff[C]//Proceedings of the Second SIGHAN Workshop on Chinese Language Processing.Sapporo, Japan: July 11-12,2003:133-143.
[3] 黄昌宁,赵海.中文分词十年回顾[J].中文信息学报,2007,21(3):8-18.
[4] 贾宗福,王知非.中文句子相似度计算的研究[J].科技信息,2009,(11): 402-403.
[5] 昝红英,左维松,张坤丽等.规则和统计相结合的情感分析研究[J]. 计算机工程与科学,2011,(5):146-150.
[6] 尤昉,李涓子,王作英. 基于语义依存关系的汉语语料库的构建[J].中文信息学报,2003,17(1):46-53.
[7] 邢福义.复句与关系词[M].哈尔滨: 黑龙江人民出版社, 1985.
[8] 胡金柱,舒江波,等.面向中文信息处理的复句关系词提取算法[J].计算机工程与科学, 2009 (10).
[8] 周强,黄昌宁.汉语句法规则的自动构造方法研究[J].中文信息学报,1998,12(3):1-7.
[9] 李渝勤,孙丽华.基于规则的自动分类在文本分类中的应用[J],中文信息学报,2004,18(4):9-14.
[10] 傅间莲,陈群秀.基于规则和统计的中文自动文摘系统[J],中文信息学报,2006,20(5):10-16.
[11] 代翠,周俏丽,蔡东风,等.统计和规则相结合的汉语最长名词短语自动识别[J],中文信息学报,2008,22(6):110-115.
[12] 于淼,吕雅娟,苏劲松,等.规则和统计相结合的中文地址翻译方法[J],中文信息学报,2012,26(3):49-53.
[13] 胡金柱,陈江曼等.基于规则的连用关系标记的自动标识研究[J].计算机科学,2012,(7):190-194.

基金

国家教育部人文社科基金(13YJAZH117),国家社科基金(11BYY052)
PDF(1580 KB)

567

Accesses

0

Citation

Detail

段落导航
相关文章

/