Language Analysis and Calculation
LIU Yahui, YANG Haoping, LI Zhenghua, ZHANG Min
2020, 34(4): 10-20.
As the main formalism of shallow semantic parsing, semantic role labeling is one of the hot research topics in natural language processing (NLP). There are three main problems in current existing annotation guidelines (i.e., the PropBank annotation guideline and the Peking University guideline). First, the span-based argument representation complicates the annotation process. Second, it is difficult to define the frames of the predicates in the PropBank annotation guideline. Third, the Peking University guideline does not annotate omitted arguments. Through thorough investigation of existing Chinese and English annotation guidelines, we develop a lightweight annotation guideline for Chinese semantic role labeling suitable for ordinary annotators by combining the advantages of existing guidelines and considering the real problems during our annotation process. First, we choose the word-based argument representation to avoid determination of span boundary and thus reduce annotation difficulty. Second, annotators can directly annotate the arguments of a predicate word according to the sentential context information, without pre-defining all semantic frames of the predicate word. Third, we explicitly annotate the omitted core arguments to more precisely describe the semantic information of sentences. Additionally, in order to ensure the annotation consistency and improve the quality of annotation, the proposed guideline gives clear priority and difficulty analysis for various complex linguistic phenomena.