李文杰1,2,穗志方1,2. 基于并列结构的概念实例和属性的同步提取方法[J]. 中文信息学报, 2012, 26(2): 82-88.
LI Wenjie1,2, SUI Zhifang1,2. To Extract Concept Instances and Concept Attributes Based on Coordinate Structure. , 2012, 26(2): 82-88.
To Extract Concept Instances and Concept Attributes Based on Coordinate Structure
LI Wenjie1,2, SUI Zhifang1,2
1. Institute of Computational Linguistics, Peking University, Beijing 100871, China;2. Key Laboratory of Computational Linguistics (Ministry of Education), Peking University, Beijing 100871, China
Abstract:Most researches on concept instances and concept attributes extraction focuses on pattern-based approaches, which usually suffer from a low recall rate. In this paper, we present a method of extracting concept instances and concept attributes based on the coordinate structure. Since a part of candidate instances and attributes extracted by the coordination patterns can be putted into the similar-concept-phrases sets in advance, we can use these similar-concept-phrases sets to expand the extraction results in the procedure of co-occurrence pattern-based extraction. Compared with the baseline without using the coordination patterns, experimental results show that the coverage of this method is significantly improved without reducing the precision. Key wordscoordinate structure;Search Engine;instances extraction;attributes extraction;contextual pattern