Language Analysis and Calculation
LIU Tong, HUANG Degen, ZHANG Cong
2017, 31(6): 25-32.
A method of prepositional phrase recognition based on fusion of multiple models is proposed to deal with coordinate prepositional phrases and improve the performance of nested prepositional phrase recognition. First, a simple noun phrase recognition model is used to identify and merge the phrases in the corpus in order to simplify corpus and reduce internal complexity of prepositional phrases, Then, the CRF model is used to identify the inner layer of the nested prepositions phrases, i.e. if the preposition phrases is nested, recognize the inner layer, otherwise, recognize the whole preposition phrase, Finally, merge the recognized inner prepositional phrases in the corpus and modify the feature information in order to train a new model for outer prepositional phrase recognition. In addition, after the recognition of both inner and outer prepositional phrases, a double error correction system is used to correct the recognized phrases. Five-fold cross validation is conducted on the corpus of People’s Daily of 2000 including 7 028 prepositional phrases, and the results achieve 94.11% in precision, 94.02% in recall, and 94.06% in F-measure, outperformaing the baseline by 1.09%, 1.07%, 1.08%.