刘 炜,刘菲京,王 东,刘宗田. 一种基于事件本体的文本事件要素提取方法[J]. 中文信息学报, 2016, 30(4): 167-175.
LIU Wei, LIU Feijing, WANG Dong, LIU Zongtian. A Text Event Elements Extraction Method Based on Event Ontology. , 2016, 30(4): 167-175.
一种基于事件本体的文本事件要素提取方法
刘 炜,刘菲京,王 东,刘宗田
上海大学 计算机科学与工程学院,上海 200444
A Text Event Elements Extraction Method Based on Event Ontology
LIU Wei, LIU Feijing, WANG Dong, LIU Zongtian
School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
Abstract:Extraction of event elements is a challenge in event-based information extraction. Currently, the main solutions are based on machine learning method which is subject to the corpus sparsity. This paper proposes an event element extraction method based on event ontology. Event elements reasoning process includes two steps: Firstly, elements values are initially complemented according to positional relations between event elements words and event indicators words, selecting the event with the highest confidence as the seed event; Secondly, search the seed events to for their event classes restrictions and non-taxonomic relations from event ontology, to complement and revise event elements. The experimental results show that this method can improve the accuracy of event elements extraction.
[1] Saeedi P, Faili H. Feature engineering using shallow parsing in argument classification of Persian verbs[C]//Proceedings of the 16th CSI International Symposium on Artificial Intelligence and Signal Processing (AISP 2012), 2012: 333-338. [2] Wang W, Zhao D Y, Wang D. Chinese news event 5w1h elements extraction using semantic role labeling[C]//Proceedings of the Third International Symposium on Information Processing (ISIP), 2010: 484-489. [3] 杨尔弘. 突发事件信息提取研究[D]. 北京语言大学博士学位论文, 2005. [4] Chieu H L, Ng H T. A maximum entropy approach to information extraction from semi-structured and free text[C]//Proceedings of the 18th National Conference on Artificial Intelligence(AAAI 2002), 2002:786-791. [5] Chen Z, Ji H. Language specific issue and feature exploration in Chinese event extraction[C]//Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, 2009: 209-212. [6] Ahn D. The stages of event extraction[C]//Proceedings of COLING/ACL 2006 Workshop on Annotating and Reasoning about Time and Events, 2006: 1-8. [7] 赵妍妍, 秦兵, 车万翔, 等. 中文事件抽取技术研究[J]. 中文信息学报, 2008, 22(1): 3-8. [8] 丁效, 宋凡, 秦兵, 等. 音乐领域典型事件抽取方法研究[J]. 中文信息学报, 2011, 25(2): 15-20. [9] Surdeanu M, Harabagiu S. Infrastructure for open-domain information extraction[C]//Proceedings of the Human Language Technology Conference (HLT 2002), 2002: 325-330. [10] 周剑辉, 苑春法, 黄锦辉, 等. 金融领域内信息抽取规则的自动获取, in Advances in Computation of Oriental Languages[C]//Proceedings of the 20th International Conference on Computer Processing of Oriental Languages, Shenyang, China, 2003: 410-416. [11] 梁晗, 陈群秀, 吴平博. 基于事件框架的信息抽取系统[J]. 中文信息学报, 2006, 20(2): 40-46. [12] Tan H Y, Zhao T J, Zheng J H. Identification of Chinese event and their argument roles[C]//Proceedings of IEEE 8th International Conference on Computer and Information Technology Workshops, 2008: 14-19. [13] 刘宗田, 黄美丽, 周文, 等. 面向事件的本体研究[J]. 计算机科学, 2009, 36(11): 189-192. [14] CEC-Corpus, https://github.com/daselab/CEC-Corpus[OL]. [15] 仲兆满. 事件本体及其在查询扩展中的应用[D]. 上海大学博士学位论文,2011.