本文对汉语短语结构的定界歧义做了全面考察,从歧义格式的组成成分,歧义对外造成的影响,模式歧义和实例歧义的对应关系三方面考察了短语结构定界歧义的不同类型,并对汉语短语结构定界歧义的不同类型进行了初步统计。希望能将计算机处理汉语时碰到的短语结构边界歧义问题进一步清晰化,供理论研究者和应用系统开发人员参考。
Abstract
This paper analyses the ambiguity of determining boundaries of Chinese phrases in automatic parsing by computer. The type of ambiguity can be classified from three different perspectives. As viewed from component of ambiguous structures , ambiguous phrases can be classified into two kinds : one including terminal symbols , the other not including terminal symbols but only non - terminal symbols. As viewed f rom the influence of ambiguity , ambiguous phrases can also be classified into two kinds : self - confined ambiguous phrases and non - self - confined ambiguous phrases. The influence of the former ambiguity is mainly inside the ambiguous phrases. The influence of the latter ambiguity is outside of the ambiguous phrases. As viewed from differentiated types of relation between type and token , ambiguous phrases can be classified into three kinds : the true - ambiguity , the quasi - ambiguity , and the pseudo - ambiguity. Furthermore , the distribution of these types of ambiguous phrases in Modern Chinese is also surveyed depending on the above analysis and a set of rules used for a Chinese - English Machine Translation system. The authors hope that the analysis on various types of ambiguities mentioned above conduces to solve the problem of phrase structure ambiguities in Chinese.
关键词
短语 /
短语定界歧义 /
自然语言处理
{{custom_keyword}} /
Key words
phrase /
phrase boundary ambiguity /
nature language processing
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] 朱德熙. 语法讲义. 北京:商务印书馆,1982
[2] 朱德熙. 语法答问. 北京:商务印书馆,1985
[3] 朱德熙. 汉语句法中的歧义现象. 中国语文,1980年第2期
[4] 赵元任. 汉语中的歧义现象. 中国现代语言学的开拓和发展. 北京:清华大学出版社,1992
[5] 吕叔湘. 歧义类例. 中国语文,1984年第5期
[6] 黄国营. 现代汉语歧义短语. 语言研究,1985年第1期
[7] 邵敬敏. 歧义分化方法探讨. 九十年代的语法思考. 北京:北京语言学院出版社,1994
[8] 冯志伟. 论歧义结构的潜在性. 中文信息学报,1995 ,9(4)
[9] 孙茂松,黄昌宁. 汉语中的兼类词、同形词类组及其处理策略. 中文信息学报,1989 ,3(4)
[10] 罗振声,郑碧霞. 汉语句型自动分析和分布统计算法与策略的研究. 中文信息学报,1994 ,8(2)
[11] 俞士汶. 关于计算语言学的若干研究. 语言文字应用,1993年第3期
[12] 周强,俞士汶. 汉语短语标注标记集的确定. 中文信息学报,1996 ,10(4)
[13] 刘群等. 一个汉英机器翻译系统的计算模型和语言模型. 智能计算机接口与应用进展. 北京:电子工业出版社,1997
[14] 詹卫东等. 现代汉语短语本位语法体系在汉英机器翻译中的应用及其问题. 同上
[15] 詹卫东. 现代汉语vp的结构定界和结构关系判定[硕士学位论文] . 北京大学,1996
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}