面向隐喻识别的词语抽象性度量

贾玉祥; 昝红英; 范 明; 俞士汶; 王治敏

PDF(2163 KB)
PDF(2163 KB)
中文信息学报 ›› 2017, Vol. 31 ›› Issue (3) : 41-47.
语言分析与计算

面向隐喻识别的词语抽象性度量

  • 贾玉祥1; 昝红英1; 范 明1; 俞士汶2; 王治敏3
作者信息 +

Measuring Word Abstractness for Metaphor Recognition

  • JIA Yuxiang 1; ZAN Hongying 1; FAN Ming1; YU Shiwen2; WANG Zhimin3
Author information +
History +

摘要

隐喻通常借助具体的概念来表达抽象的概念。如果能判断出文本中词语所指的概念是具体还是抽象的,即度量出词语的抽象程度,那么这将为隐喻的机器识别提供重要的依据。该文提出基于跨语言知识迁移的汉语词语抽象性度量方法,把英语中的词语抽象性知识迁移到汉语中来。提出基于词语抽象性知识的隐喻识别方法,并详细分析了词语抽象性与隐喻之间的关系。实验表明,知识迁移是可行的,基于抽象性知识的隐喻识别有较高的准确率,可以有效提高从真实文本中抽取隐喻的效率。

Abstract

In metaphors, abstract things are usually described in terms of concrete things. If we can decide whether a word is concrete or abstract, we will provide useful clues for automatic metaphor recognition. This paper proposed a cross-lingual knowledge transfer method to adapt English word abstractness knowledge to Chinese. Then we propose a metaphor recognition method based on word abstractness and analyze in detail the relation between word abstractness and metaphor. Experimental results show that, the cross-lingual knowledge transfer method is feasible to measure Chinese word abstractness, the abstractness-based metaphor recognition method achieves a high precision score, and it can improve the efficiency of metaphor extraction from real texts.

关键词

隐喻识别 / 词语抽象性 / 跨语言知识迁移

Key words

metaphor recognition / word abstractness / cross-lingual knowledge transfer

引用本文

导出引用
贾玉祥; 昝红英; 范 明; 俞士汶; 王治敏. 面向隐喻识别的词语抽象性度量. 中文信息学报. 2017, 31(3): 41-47
JIA Yuxiang ; ZAN Hongying ; FAN Ming; YU Shiwen; WANG Zhimin. Measuring Word Abstractness for Metaphor Recognition. Journal of Chinese Information Processing. 2017, 31(3): 41-47

参考文献

[1] Brysbaert M, Warriner A B, Kuperman V. Concreteness ratings for 40 thousand generally known English word lemmas[J]. Behavior research methods, 2014, 46(3): 904-911.
[2] Hill F, Korhonen A, Bentz C. A quantitative empirical analysis of the abstract/concrete distinction[J]. Cognitive science, 2014, 38(1): 162-177.
[3] Kwong O Y. Measuring concept concreteness from the lexicographic perspective[C]//Proceedings of PACLIC, 2011: 60-69.
[4] Kwong O Y. A preliminary study on the impact of lexical concreteness on Word Sense Disambiguation[C]//Proceedings of PACLIC, 2008: 235-244.
[5] Hill F, Reichart R, Korhonen A. Simlex-999: Evaluating semantic models with (genuine) similarity estimation[J]. arXiv preprint arXiv: 1408.3456, 2014.
[6] Tanaka S, Jatowt A, Kato M P, et al. Estimating content concreteness for finding comprehensible documents[C]//Proceedings of the sixth ACM international conference on Web search and data mining, 2013: 475-484.
[7] Turney P, Neuman Y, Assaf D, et al. Literal and metaphorical sense identification through concrete and abstract context[C]//Proceedings of the 2011 Conference on the Empirical Methods in Natural Language Processing, 2011: 680-690.
[8] Dunn J. What metaphor identification systems can tell us about metaphor-in-language[C]//Proceedings of the First Workshop on Metaphor in NLP, 2013: 1-10.
[9] Dunn J. Multi-dimensional abstractness in cross-domain mappings[C]//Proceedings of ACL, 2014: 27-32.
[10] Tsvetkov Y, Mukomel E, Gershman A. Cross-lingual metaphor detection using common semantic features[C]//Proceedings of the First Workshop on Metaphor in NLP, 2013: 45-51.
[11] Tsvetkov Y, Boytsov L, Gershman A, et al. Metaphor detection with cross-lingual model transfer[C]//Proceedings of ACL, 2014: 248-258.
[12] Coltheart M. The MRC psycholinguistic database[J]. The Quarterly Journal of Experimental Psychology, 1981,(33): 497-505.
[13] Hill F, Reichart R, Korhonen A. Multi-modal models for concrete and abstract concept meaning[J]. Transactions of the Association for Computational Linguistics, 2014, (2): 285-296.
[14] 贾玉祥, 俞士汶. 基于词典的名词性隐喻识别[J]. 中文信息学报, 2011, 25(2): 99-104.
[15] Jia Y X, Zan H Y, Fan M, et al. Word Relevance Computation for Noun-Noun Metaphor Recognition[C]//Proceedings of Chinese Lexical Semantics Workshop, Springer International Publishing, 2014: 251-259.
[16] 董振东, 董强. 知网[OL]. http://www.keenage.com.
[17] HIT-SCIR. 同义词词林(扩展版)[OL]. http://ir.hit.edu.cn.
[18] 王治敏. 汉语名词短语隐喻识别研究[D]. 北京大学博士学位论文, 2006.

基金

国家自然科学基金(61402419, 61170163);国家社会科学基金(14BYY096);国家重点基础研究发展计划 973 课题(2014CB340504);计算语言学教育部重点实验室(北京大学)开放课题(201301, 201401)
PDF(2163 KB)

Accesses

Citation

Detail

段落导航
相关文章

/