[1] Rabiner L, Juang B. An introduction to hidden Markov models[J]. ASSP Magazine, 1986: 4-16.
[2] Adam L B, Della P V J, Della P S A. A maximum entropy approach to natural language processing[J]. Computational linguistics, 1996,22(1): 39-71.
[3] John L, Andrew M, et al. Conditional random fields: Probabilistic models for segmenting and labeling sequence data[C]//Proceedings of the ICML, 2001: 45-54.
[4] 张梅山,邓知龙,车万翔,等.统计与词典相结合的领域自适应中文分词[J].中文信息学报,2012,26(2): 8-12.
[5] Guo Z, Zhang Y, Su C, et al. Exploration of n-gram Features for the Domain Adaptation of Chinese Word Segmentation[J]. Nature Language Processing and Chinese Computing. Springer Berlin Heidelberg, 2012: 121-131.
[6] 苏晨, 张玉洁, 郭振, 等. 适用于特定领域机器翻译的汉语分词方法[J]. 中文信息学报, 2013, 27(5): 184-190.
[7] Angluin D. Queries and concept learning[J]. Machine Learning, 1988, 2(4):319-342.
[8] Burr S. Active Learning Literature Survey[J]. University of Wisconsinmadison, 2009, 39(2): 127-131.
[9] 宗成庆.统计自然语言处理[M].北京: 清华大学出版社,2008.
[10] GB/T 13715-1992.信息处理用现代汉语分词规范[S].北京:中国标准出版社,1992:
[11] Xia F. The Segmentation Guidelines for the Penn Chinese Treebank (3.0)[J]. 2000.
[12] 段慧明,松井久人於,徐国伟,等.大规模汉语标注语料库的制作与使用[J]. 语言文字应用,2000,(2):72-77.

许华婷(1991—),助理实验师,主要研究领域为自然语言处理。
E-mail: xuhuating91@163.com

张玉洁(1961—),通信作者,教授,主要研究领域为自然语言处理。
E-mail: yjzhang@bjtu.edu.cn

杨晓晖(1962—),副教授,主要研究领域为计算机应用。
E-mail: xhyang@bjtu.edu.cn
(上接第38页)
[29] E A F Gibson. A computational theory of human linguistic processing: Memory limitations and processing breakdown[D]. School of Computer Science: Carnegie Mellon University, 1991.
[30] M Marcus, G Kim, M A Marcinkiewicz, et al. The Penn Treebank: annotating predicate argument structure[C]//Proceedings of the Workshop on Human Language Technology. Association for Computational Linguistics, 1994: 114-119.
基金
国家国际科技合作专项资助(2014DFA11350);国家自然科学基金(61370130)
{{custom_fund}}