|
|
Research on Chinese Event Extraction |
ZHAO Yan-yan, QIN Bing, CHE Wan-xiang, LIU Ting |
Information Retrieval Laboratory, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China
|
|
|
Abstract Event Extraction is an important research point in the area of Information Extraction. This paper makes an intensive study of the two stages of Chinese event extraction, namely event type recognition and event argument recognition. A novel method combining event trigger expansion and a binary classifier is presented in the step of event type recognition while in the step of argument recognition, one with multi-class classification based on maximum entropy is introduced. The above methods solved the data unbalanced problem in training model and the data sparseness problem brought by the small set of training data effectively, and finally our event extraction system achieved a better performance.
|
Received: 31 May 2007
|
|
|
|
|
[1] Naomi Daniel, Dragomir Radev and Timothy Allison. Sub-event based Multi-document Summarization [A]. In: Proceedings of the HLT-NAACL Workshop on Text Summarization [C]. 2003. 9-16. [2] Elena Filatova and Vasileios Hatzivassiloglou. Event-based Extractive summarization [A]. In: Proceedings of ACL Workshop on Summarization [C]. 2004.104-111. [3] Wenjie Li, Mingli Wu and Qin Lu. Extractive Summarization using Inter- and Intra- Event Relevance [A]. In: Proceedings of the 44th Annual Meeting of the Association for Computational Liguistics [C]. 2006.369-376. [4] David Ahn. The stages of event extraction [A]. In: Proceedings of the Workshop on Annotations and Reasoning about Time and Events [C]. 2006.1-8. [5] ACE (Automatic Content Extraction) Chinese Annotation Guidelines for Events. National Institute of Standards and Technology [R]. 2005. [6] 姜吉发. 自由文本的信息抽取模式获取的研究[D]. 中国科学院博士学位论文, 2004: 1-18. [7] Mihai Surdeanu, Sanda Harabagiu, John Williams, et al. Using Predicate-Argument Structures for Information Extraction [A]. In: Proceedings of ACL[C]. 2003.8-15. [8] Mihai Surdeanu and Sanda Harabagiu. Infrastructure for Open-Domain Information Extraction [A]. In: Proceedings of the Human Language Technology Conference [C]. 2002.325-330. [9] Hai Leong Chieu, Hwee Tou Ng. A Maximum Entropy Approach to Information Extraction from Semi-Structured and Free Text [A]. In: Proceedings of the 18th National Conference on Artificial Intelligence [C]. 2002.786-791.
|
|
|
|