加羊吉,李亚超,宗成庆,于洪志. 最大熵和条件随机场模型相融合的藏文人名识别[J]. 中文信息学报, 2014, 28(1): 107-112.
JIA Yangji,LI Yachao,ZONG Chengqing,YU Hongzhi. A Hybrid Approach to Tibetan Person Name Identification by Maximum Entropy Model and Conditional Random Fields. , 2014, 28(1): 107-112.
1. National Language Information Technology Laboratory, Nothwest University for Natonalities, Lanzhou, Gansu 730030, China; 2. National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
Abstract:Tibetan person name recognition is one of the most difficult tasks in the area of Tibetan information processing, with a direct impact on the precision of Tibetan word segmentation. Based on the analysis of wording rules and features of Tibetan names, this paper proposes a method combining maximum entropy and conditional random fields to identify Tibetan person names. The experiment shows that this approach works significant well reaching 93.08% in F1-measure.