李斌,闻媛,宋丽,卜丽君,曲维光,薛念文. 融合概念对齐信息的中文AMR语料库的构建[J]. 中文信息学报, 2017, 31(6): 93-102.
LI Bin, WEN Yuan, SONG Li, BU Lijun, QU Weiguang, XUE Nianwen. Construction of Chinese Abstract Meaning Representation Corpus with Concept-to-word Alignment. , 2017, 31(6): 93-102.
Construction of Chinese Abstract Meaning Representation Corpus with Concept-to-word Alignment
LI Bin1, WEN Yuan1, SONG Li1, BU Lijun1, QU Weiguang2,3, XUE Nianwen4
1.School of Chinese Language and Literature, Nanjing Normal University, Nanjing, Jiangsu 210023, China; 2.School of Computer Science and Technology, Nanjing Normal University, Nanjing, Jiangsu 210023, China;3.Fujian Provincial Key Laboratory of Information Processing and Intelligent Control, Minjiang University, Fuzhou, Fujian 350121, China;4.Michtom School of Computer Science, Brandeis University, Waltham, MA 02453, USA
Abstract:As a new sentence-level meaning representation, abstract meaning representation (AMR) uses a rooted acyclic directed graph to represent the meaning of a sentence. A large AMR bank has been constructed for English, but the concepts of an AMR graph are not aligned to the words in a sentence, which increases the difficulty in manual annotation as well as automatic parsing. This paper describes the construction of a Chinese AMR corpus, based on guidelines adapted from English for Chinese-specific properties. We also designs an efficient annotation framework that incorporates concept-to-word alignment, taking advantage of the morphology-poor nature of Chinese. We have annotated the AMRs of 6 923 sentences selected from the Chinese TreeBank, among which 48% of the sentences are graphs, 1% of the sentences are cycles, and 32% have non-projective subtrees. We plan to publicly release this data for linguistic and NLP research.
[1]Katz J J,Fodor J A.The structure of a semantic theory [J].Language,1963,39(2), 170-210. [2]Montague.Universal Grammar[J].Theoria,1970,36:373-398. [3]Jackendoff R.Towards an explanatory semantic representation[J].Linguistic Inquiry,1976,7(1):89-150. [4]Banarescu L,Bonial C,Cai S,et al.Abstract meaning representation for sembanking [C]//Proceedings of the 7th Linguistic Annotation Workshop,Sophia,Bulgaria,2013:178-86. [5]Bos J.Expressive power of abstract meaning representations[J].Computational Linguistics,2016,42(3):527-535. [6]May J.SemEval-2016 Task 8:Meaning representation parsing[C]//Proceedings of SemEval-2016,San Diego,California,2016:1063-1073. [7]Pourdamghani N,Gao Y,Hermjakob U,et al.Aligning English strings with abstract meaning representation graphs[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP).2014:425-429. [8]Xue N,Bojar O,Hajicˇj, et al.Not an interlingua,but close:Comparison of English AMRs to Chinese and Czech [C]//Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC’14),Reykjavik,Iceland,May 26-31,2014:1765-1772. [9]李斌,闻媛,卜丽君,等.英汉《小王子》AMR语义图结构的对比分析[J].中文信息学报,2017,31(1):50-57. [10]Palmer M,Daniel G,Paul K.The Proposition Bank:An Annotated Corpus of Semantic Roles [J].Computational Linguistics,2005,Vol.31(1):71-106. [11]Wang Y,Guo J,Che W,et al.Transition-based Chinese semantic dependency graph parsing[C]//Proceedings of China National Conference on Chinese Computational Linguistics.Yantai,China.2016:12-24. [12]Chen B,Ji D.Chinese semantic parsing based on dependency graph and feature structure[C]//Proceedings of the International Conference on Electronic14 and Mechanical Engineering and Information Technology,2011,4:1731-1734. [13]袁毓林,詹卫东,施春宏.汉语“词库—构式”互动的语法描写体系及其教学应用[J].语言教学与研究,2014(2):17-25. [14]Banarescu L,Bonial C,Cai S,et al.Abstract meaning representation (AMR) 1.2.2 specification[DB/OL].[2015].https://github.com/amrisi/amr-guidelines/blob/master/amr.md. [15]Cai S,Knight K.Smatch:an evaluation metric for semantic feature structures[C]//Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics.Sofia,Bulgaria,August 4-9,2013:748-752. [16]Xue N,Xia F,Chiou F,et al.The penn Chinese TreeBank:Phrase structure annotation of a large corpus[J].Natural Language Engineering,2005,11(2):207-238. [17]Hays D.Dependency theory:A formalism and some observations[J].Language,1964,40(4):511-525. [18]Percival W K.Refelections on the history of dependency notions in linguistics[J].Historiographia Linguistica,1990,17(1-2):29-47. [19]Holan TomDsˇ,Vladislav Kuboň,Karel Oliva,et al.Two useful measures of word order complexity[C]//Alain Polguère and Sylvain Kahane,eds.Proceedings of Dependency-Based Grammars Workshop,COLING/ACL,1998:21-28. [20]Havelka Jirí. Beyond projectivity:multilingual evaluation of constraints and measures on non-projective structures[C]//Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL),Prague,Czech Republic,2007:608-615. [21]郑丽娟,邵艳秋,杨尔弘.中文非投射语义依存现象分析研究[J].中文信息学报,2014,28(6):41-47.