基于模式分类的汉语时态确定方法研究

林达真,李绍滋

PDF(308 KB)
PDF(308 KB)
中文信息学报 ›› 2006, Vol. 20 ›› Issue (1) : 69-77.

基于模式分类的汉语时态确定方法研究

  • 林达真,李绍滋
作者信息 +

A Pattern-classification Based Solution for the Recognition of Tense of the Chinese Language

  • LIN Da-zhen,LI Shao-zi
Author information +
History +

摘要

汉语时态是中文信息处理领域的一个难点。基于规则的处理方法在无时态特征词的句子,多时态特征词的句子处理等方面存在很大问题。本文从统计的角度,提出一种基于模式分类的时态确定方法,该方法综合评价句子中每个词对时态确定所作的贡献,能够处理无时态特征词的句子和多时态特征词的句子,并且该方法使用线性判别函数,具有对多维数据分析,训练与判别速度快的特性。在开放测试环境下,对单句的汉语时态确定正确率与召回率分别为79.8%和95.3%。

Abstract

As far as NLP is concerned , the tense of the Chinese language is especially hard to tackle. One of the outstanding characteristics of the Chinese language is that its tense is usually implied rather than obvious. Hence , the Rule-based solution is far from suitable for the recognition of tense in situations where tense-informing words are missing or more than one of such words are present. In this paper , we introduce a pattern-classification based solution , which evaluates each single word in terms of its contribution to the recognition of tense for the concerned sentence. This solution proves effective when processing sentences containing none or more than one tense-informing words. Furthermore , the implementation of linear discriminating function in this solution leads to its abilities of multi-dimensional data processing and training , and helps to achieve decent performance. Evaluated under open conditions , the Precision and the Recall of this solution for single sentences are 79.8% and 95.3% , respectively.

关键词

计算机应用 / 中文信息处理 / 汉语 / 时态 / 特征词 / 线性判别函数 / 感知器准则函数

Key words

computer application / Chinese information processing / Chinese / tense / characteristic words / linear discriminant function / perceptron criterion function

引用本文

导出引用
林达真,李绍滋. 基于模式分类的汉语时态确定方法研究. 中文信息学报. 2006, 20(1): 69-77
LIN Da-zhen,LI Shao-zi. A Pattern-classification Based Solution for the Recognition of Tense of the Chinese Language. Journal of Chinese Information Processing. 2006, 20(1): 69-77

参考文献

[1] 马红妹,王挺,陈火旺. 汉语篇章时间短语的分析与时制验算[J] . 计算机研究与发展,2002 ,39 (10) :1211 - 1220.
[2] 孙广范. 汉英机器翻译系统中动词时态的处理[A] . HNC(第二届)学术研讨会论文论文集[C] . 2003年9月20~22日,北京.
[3] 程节华,戴新宇,陈家骏,王启祥. 汉英机器翻译中时体态处理[J] . 计算机应用研究,2004 , (3) :79 - 80.
[4] 王凌飞. 汉英机译系统中上下文处理的研究[D] . 厦门大学硕士学位论文,2000.
[5] 马红妹. 汉英机器翻译中汉语上下文语境的表示与应用研究[D] . 国防科学技术大学博士学位论文, 2002.
[6] 龚千炎. 汉语的时相时制时态[M] . 北京:商务印书馆,1995.
[7] 龚千炎. 谈现代汉语的时制表示和时态表达系统[J] . 中国语文,1991 , (4) :251 - 261.
[8] 程立民. 汉语的时态和时态成分[J] . 语言研究,2002 , (3) :14 - 31.
[9] 戴耀晶. 现代汉语时体系统研究[M] . 浙江:浙江教育出版社,1997.
[10] 郭锐. 现代汉语词类研究[M] . 北京:商务印书馆,2002.
[11] 李宏东,姚天翔等译. 模式分类[M] . 北京:机械工业出版社,2003.

基金

国家863高科技项目(2001AA114110);福建省自然科学基金资助项目(A0310009);福建省科技重点项目(2001J005)
PDF(308 KB)

Accesses

Citation

Detail

段落导航
相关文章

/