基于《知网》的汉语未登录词语义相似度计算

张瑞霞1,杨国增2,吴慧欣1

PDF(826 KB)
PDF(826 KB)
中文信息学报 ›› 2012, Vol. 26 ›› Issue (1) : 16-22.
综述

基于《知网》的汉语未登录词语义相似度计算

  • 张瑞霞1,杨国增2,吴慧欣1
作者信息 +

A New Measure of Semantic Similarity between Unknown Chinese Words Based on HowNet

  • ZHANG Ruixia1, YANG Guozeng2, WU Huixin1
Author information +
History +

摘要

提出了一种基于《知网》的汉语未登录词语义相似度计算方法。该方法首先参照意合网络理论构造了语义关系匹配函数;接着在用概念图表示未登录词语义信息的基础上,根据节点在语义表示中的作用不同对其分类;然后应用匹配函数对弧、节点对及节点对集进行分类;最后设计了未登录词的整体相似度、不同类型节点对及节点对集相似度的计算方法。该方法能够合理分类未登录词的语义信息并能将其充分利用到计算过程中,实验结果证明此方法是有效的。

Abstract

A new measure based on HowNet is put forward to compute the semantic similarity between unknown Chinese words. Firstly, the semantic matching function is constructed according the YiHeNet; secondly, nodes in the concept graphs of unknown Chinese words are classified according to their different effects in expressing the semantic information; then, the three notions of arcs, node pairs and node pair sets are classified according to matching functions; finally, similarity measures are designed to compute the similarities of unknown Chinese words, similarities of different node pairs and similarities of different node pair sets. This new measure helps to classify the semantic information of those unknown words and to apply it into the computing course, and experiments prove its effectiveness.
Key wordsHowNet; semantic similarity; unknown words; concept graphs

关键词

《知网》 / 语义相似度 / 未登录词 / 概念图

Key words

HowNet / semantic similarity / unknown words / concept graphs

引用本文

导出引用
张瑞霞1,杨国增2,吴慧欣1. 基于《知网》的汉语未登录词语义相似度计算. 中文信息学报. 2012, 26(1): 16-22
ZHANG Ruixia1, YANG Guozeng2, WU Huixin1. A New Measure of Semantic Similarity between Unknown Chinese Words Based on HowNet. Journal of Chinese Information Processing. 2012, 26(1): 16-22

基金

基金项目河南省科技厅基础研究项目(082300410140)
PDF(826 KB)

Accesses

Citation

Detail

段落导航
相关文章

/