王 倩,罗森林,韩 磊,潘丽敏. 基于谓词及句义类型块的汉语句义类型识别[J]. 中文信息学报, 2014, 28(2): 8-16.
WANG Qian, LUO Senlin, HAN Lei, PAN Limin. Chinese Sentential Semantic Type Recognition Based on Predicate and Sentential Semantic Type Chunk. , 2014, 28(2): 8-16.
王 倩,罗森林,韩 磊,潘丽敏
北京理工大学 信息与电子学院信息系统安全对抗实验中心, 北京 100081
Chinese Sentential Semantic Type Recognition Based on Predicate and Sentential Semantic Type Chunk
WANG Qian, LUO Senlin, HAN Lei, PAN Limin
Lab of Information Security & Countermeasures Technology, School of Information & Electronics, Beijing Institute of Technology, Beijing 100081, China
Abstract：According to modern Chinese semantics, there are 4 semantic types (single, complex, compound and multiple). Attempted to capture the overall sentential semantic structures, sentential semantic type recognition is an important step to the whole sentential semantic structure parsing. This paper proposes a 4-semantic-types recognition method based on predicate and sentential semantic type chunk. This method firstly identifies some single semantic type sentences by the predicate number in each sentence. For the rest sentences, C4.5 algorithm is applied to get the maximum number of sentential-semantic-type chunk of predicates in sentential semantic structure, and then the sentential semantic type of each sentence is identified by combining the top sentence node in syntax structure. The experimental data contains 10221 sentences chosen from Beijing Forest Studio-Chinese Tag Corpus. The accuracy rate of sentential semantic type is up to 97.6% in open test.