基于神经元网络的汉语短语边界识别

奚晨海,孙茂松

PDF(275 KB)
PDF(275 KB)
中文信息学报 ›› 2002, Vol. 16 ›› Issue (2) : 20-26.

基于神经元网络的汉语短语边界识别

  • 奚晨海,孙茂松
作者信息 +

Automatic Prediction of Chinese Phrase Boundary Location with Neural Networks

  • XI Chen-hai,SUN Mao-song
Author information +
History +

摘要

短语边界的识别是浅层句法分析或组块分析的基础,对真实文本的处理具有重要意义。在一个含有64426词的汉语树库的支持下,本文设计并实现了基于神经元网络的汉语短语边界自动识别模型。初步实验结果显示,该模型的界定准确率为93.24%(封闭测试)和92.56%(开放测试)。

Abstract

Prediction of Chinese phrase boundary location is the base of shallow parsing or chunk parsing. It is also very important for processing real texts. With the support of our Chinese treebank including 64426 words , this paper designs and implements a method for automatic prediction of Chinese phrase boundary location based on neural network. The preliminary results show that the precision is 93.24%(close testing) and 92.56%(open testing) respectively.

关键词

汉语短语边界自动识别 / 神经元网络 / 中文信息处理

Key words

automatic prediction of Chinese phrase boundary location / neural network / Chinese information processing

引用本文

导出引用
奚晨海,孙茂松. 基于神经元网络的汉语短语边界识别. 中文信息学报. 2002, 16(2): 20-26
XI Chen-hai,SUN Mao-song. Automatic Prediction of Chinese Phrase Boundary Location with Neural Networks. Journal of Chinese Information Processing. 2002, 16(2): 20-26

参考文献

[1] Abney ,Steven. Parsing by chunks. In Robert Berwick ,Steven Abney and Caro Tenny ,eds. , Principle-Based Parsing. Kluwer Academic Publishers , Dordercht . 1991
[2] Abney ,Steven. Partial Parsing via finite-state cascades. In Proceedings of the ESSLLI’96 Robust Parsing Worksho. Prague ,Czech Repulic. 1996
[3] Chen , Hsin-His and Lee , Yue-Shi. Development of a partially bracketedcorpus with part-of-speech information only. In Proceedings of the 3rd Workshopon Very Large Corpora ,1994. 162 - 172 , Boston ,Massachusetts , USA
[4] Church ,K. A stochastic parts program and noun phrase parser forunrestricted text . In Proceedings of the Second Conference on Applied Natural Language Processing ,136 - 143 ,Austin ,Texas ,USA. 1998
[5] Rob Koeling. 2000. Chunking with maximum entropy models. In Proceedings of CoNLL - 2000 and LLL - 2000 ,Lisbon ,Portugal
[6] Ramshaw L. and M. Marcus. 1995. Text Chunking using transformation-based learning. In Proceedings of the 3rd Workshop on Very Large Corpora ,Boston ,Massachusetts ,USA
[7] Erik F. Tjong Kim Sang ,Sabine Buchholz. 2000. Introduction to the CoNLL-2000 Shared Task :Chunking. In Proceedings of CoNLL-2000 and LLL-2000 ,pp 127 - 132 ,Lisbon ,Portugal
[8] Helmut Schmid. 1994. Part-of-speech Tagging with Neural Networks. In Proceedings of the 15th International Conference on Computational Linguistics ( COLING - 94) , Kyoto ,Japan
[9] 周强. 一个短语自动界定模型,软件学报,1996 ,7 (增) :315 - 322
[10] 周强,孙茂松,黄昌宁. 汉语句子的组块分析体系,计算机学报,1999 ,22 (2) :1158 - 1165
[11] 孙宏林,俞士汶. 浅层句法分析方法概述,当代语言学,2000 ,2 (2) ,74 - 83
[12] 王伟. 人工神经网络原理—入门应用,北京:北京航空航天大学出版社,1995

基金

国家重点基础研究发展规划项目(编号:G1998030507)
PDF(275 KB)

744

Accesses

0

Citation

Detail

段落导航
相关文章

/