邱盈盈,洪宇,周文瑄,姚建民,朱巧明. 面向事件抽取的深度与主动联合学习方法[J]. 中文信息学报, 2018, 32(6): 98-106.
QIU Yingying, HONG Yu, ZHOU Wenxuan, YAO Jianmin, ZHU Qiaoming. Combining Deep Learning and Active Learning for Event Extraction. , 2018, 32(6): 98-106.
面向事件抽取的深度与主动联合学习方法
邱盈盈,洪宇,周文瑄,姚建民,朱巧明
苏州大学 江苏省计算机信息处理重点实验室,江苏 苏州 215006
Combining Deep Learning and Active Learning for Event Extraction
QIU Yingying, HONG Yu, ZHOU Wenxuan, YAO Jianmin, ZHU Qiaoming
Provincial Key Laboratory of Computer Information Processing Technology, Soochow University, Suzhou, Jiangsu 215006, China
Abstract:Event extraction aims at extracting event information from raw texts and representing them as a structured text. As a basic event extraction method,supervised learning often suffers from small scale,imbalanced distribution and uneven quality of training corpus. Moreover,traditional event extraction methods based on feature engineering are complicated and will always cause error propagation. To address these issues,this paper presents a method to combine deep learning and active learning by the confidence of the query function based on RNN's trigger classification,in order to improve the quality and efficiency of corpus annotation as well as the ultimate performance. The experimental results show that this joint learning method can improve the event extraction,with substantial room for further exploration.
[1] Shasha Liao,Ralph Grishman.Using prediction from sentential scope to build a Pseudo co-testing learner for event extraction[C]//Proceedings of the 5th International Joint Conference on Natural Language Processing(IJCNLP),Chiang Mai,Thailand,2011:714-722. [2] Yu Hong,Jianfeng Zhang,Bin Ma,et al.Using cross-entity inference to improve event extraction[C]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics(ACL),Portland,USA,2010:1127-1136. [3] Heng Ji,Ralph Grishman.Refining event extraction through cross-document inference[C]//Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics(ACL),Colunbus,USA,2008:254-262. [4] Qi Li,Heng Ji,Liang Huang.Joint event extraction via structured prediction with global features[C]//Proceedings of the 51th Annual Meeting of the Association for Computational Linguistics(ACL).Sofia,Bulgaria,2013:73-82. [5] 肖升,何炎祥.事件超图模型及类型识别[J].中文信息学报,2013,27(01):30-38. [6] Ofer Bronstein,Ido Dagan,Qi Li,et al.Seed-based event trigger labeling:How far can event descriptions get us? [C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing(ACL-IJCNLP),Beijing,China,2015:372-376. [7] Thien Huu Nguyen,Ralph Grishman.Event detection and domain adaptation with convolutional neural networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing(ACL-IJCNLP),Beijing,China,2015:365-371. [8] Hu B,Lu Z,et al.Convolutional neural networkarchitectures for matching natural language sentences[C]//Proceedings of the Advances in Neural Information Processing Systems(NIPS),Quebec,Canada,2014,2042-2050. [9] Yubo Chen,Liheng Xu,Kang Liu,et al.Event extraction via dynamic multi-pooling convolutional neural networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing(ACL-IJCNLP),Beijing,China,2015:167-176. [10] David Yarowsky.Unsupervised word sense disambiguation rivaling supervised methods[C]//Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics(ACL).Cambridge,US,1995:189-196. [11] Prashant Gupta,Heng Ji.Predicting unknown time arguments based on cross-event propagation[C]//Proceedings of the ACL-IJCNLP 2009 Conference Short Papers,Suntec,Singapore,2009:369-372. [12] Shasha Liao,Ralph Grishman.Using document level cross-event inference to improve event extraction[C]//Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics(ACL),Uppsala,Sweden,2010:789-797. [13] Shasha Liao,Ralph Grishman.Can document selection help semi-supervised learning? A case study on event extraction[C]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics(ACL),Oregon,Portland,2011:260-265. [14] Ralph Grishman,David Westbrook,Adam Meyers.NYU’s English ACE 2005 system description[C]//Proceedings of ACE 2005 Evaluation Workshop,Gaithersburg,USA,2005:5-19. [15] Peifeng Li,Qiaoming Zhu,Guodong Zhou.Employing event inference to improve semi-supervised chinese event extraction[C]//Proceedings of COLING 2014 and the 25th International Conference on Computational Linguistics(Coling),Dublin,Ireland,2014:2161-2171. [16] Shasha Liao,Ralph Grishman.Filtered ranking for bootstrapping in event extraction[C]//Proceedings of the 23rd International Conference on Computational Linguistics(Coling),Beijing,China,2010:680-688. [17] 徐霞,李培峰,朱巧明.一个半监督的中文事件抽取方法[J].中文信息学报,2016,30(02):168-174. [18] Mesnil G,He X,Deng L,et al.Investigation of recurrent-neural-network architectures and learning methods for spoken language understanding[C]//Proceedings of Interspeech(Interspeech),Lyon,France,2013:3771-3775. [19] George A M.WordNet:A lexical database for English [J].Communications of the ACM,1995,38(11):39-41. [20] Lewis,J Catlett.Heterogeneous uncertainty sampling for supervised learning[C]//Proceedings of the International Conference on Machine Learning(ICML).Morgan Kaufmann,1994,148-156. [21] Minling Zhang,Zhihua Zhou.ML-KNN:A lazy learning approach to multi-label learning[J].Pattern Recognition,2007,40(7):2038-2048. [22] Schein A I,Ungar L H.Active learning for logistic regression:an evaluation[J].Machine Learning,2007,68(3):235-265.