朴敏浚,李 强,袁毓林. 汉语“比”字句关键要素的常规序列模式探索[J]. 中文信息学报, 2016, 30(4): 12-20.
PARK Minjun , LI Qiang , YUAN Yulin. A Study on the Sequential Patterns of Semantic Constituents of the Bi-Comparative Structure. , 2016, 30(4): 12-20.
A Study on the Sequential Patterns of Semantic Constituents of the Bi-Comparative Structure
PARK Minjun1 , LI Qiang2 , YUAN Yulin1
1. Dept. of Chinese Language and Literature, Peking University, Beijing 100871, China; 2. Dept. of Chinese Language and Literature, Shanghai University, Shanghai 200444, China
Abstract:The Bi-structure, which highlights a contrasting characteristic between two elements, is the key comparative sentence structure in Chinese. This structure consists of 7 types semantic items (SUB, BI, OBJ, ITM, DIM, RES, EXT), of which various sequential patterns may occur. To provide meaningful information for the keyword extraction task of this comparative structure, this study first begins with the tagging of the 7 semantic items on about 460 sentences. Second, association rules and sequential patterns are extracted using the Apriori and PrefixSpan algorithms, from which 6 rules of the item distribution are established. Finally, this paper illustrates the rationale behind the construct of these 6 rules, providing a better understanding of the linguistic characteristics for feature selection task of the Bi-comparative structure in Chinese.