%0 Journal Article %A SONG Heng %A CAO Cungen %A WANG Ya %A WANG Shi %T Construction of a Finely-Grained Training Dataset for Chinese Semantic-Role Labeling %D 2023 %R %J Journal of Chinese Information Processing %P 52-66,73 %V 36 %N 12 %X Semantic roles play an important role in the natural language understanding, but most of the existing semantic-role training datasets are relatively rough or even misleading in labeling semantic roles. In order to facilitate the fine-grained semantic analysis, an improved taxonomy of Chinese semantic roles is proposed by investigating a real-world corpus. Focusing on a corpus formed with sentences with only one pivotal semantic role, we propose a semi-automatic method for fine-grained Chinese semantic role dataset construction. A corpus of 9,550 sentences has been labeled with 9,423 pivot semantic roles, 29,142 principal peripheral semantic roles and 3,745 auxiliary peripheral semantic roles. Among them, 172 sentences are double-labeled with semantic roles and 104 sentences are labeled with semantic roles of uncertain semantic events. With a Bi-LSTM+CRF model, we compare the dataset against the Chinese Proposition Bank and reveal differences in the recognition of principal peripheral semantic roles, which provide clues for further improvement. %U http://jcip.cipsc.org.cn/EN/abstract/article_3441.shtml