基于潜在语义索引的文本浏览机制

林鸿飞,姚天顺

PDF(193 KB)
PDF(193 KB)
中文信息学报 ›› 2000, Vol. 14 ›› Issue (5) : 49-56.
综述

基于潜在语义索引的文本浏览机制

  • 林鸿飞1,姚天顺2
作者信息 +

Text Browsing Based on Latent Semantic Indexing

  • LIN Hong-fei1,YAO Tian-shun2
Author information +
History +

摘要

文本浏览是伴随着因特网上日益增多的在线文本而出现的辅助阅读机制,本文给出了基于潜在语义索引的文本浏览机制。它吸取了潜在语义索引和概念标注的优点,利用潜在语义索引,减少词汇间的“斜交”现象,在语义空间上进行项与项、文本与文本、项与文本之间的相似度计算。利用概念词典将文本特征项按语义分类,给予层次分类以确定的含义。最后,实现以分层概念为基础的信息导航。

Abstract

Text browsing is the assistant reading mechanism to help users browse the online texts. Text browsing based on Latent Semantic Indexing (LSI) is presented in this paper ,and it combines LSI with concept tagging to improve the efficiency of users reading. It applies LSI to reduce the skew intersections and calculates the similarity between terms and texts based on the semantic space ,it also divides the terms into several semantic classes and determines the meanings of classes. In additional ,it implements the information navigation based on conceptual tree.

关键词

文本浏览 / 潜在语义索引 / 概念标注 / 特征抽取

Key words

text browsing / latent semantic indexing / concept tagging / text feature extraction

引用本文

导出引用
林鸿飞,姚天顺. 基于潜在语义索引的文本浏览机制. 中文信息学报. 2000, 14(5): 49-56
LIN Hong-fei,YAO Tian-shun. Text Browsing Based on Latent Semantic Indexing. Journal of Chinese Information Processing. 2000, 14(5): 49-56

参考文献

[1] 姚天顺等. 自然语言理解. 北京:清华大学出版社,1995
[2] 林鸿飞,战学刚,姚天顺. 文本层次分析与文本浏览. 中文信息学报,1999 ,14 (4) :8 - 12
[3] 林鸿飞,战学刚,姚天顺. 基于概念的文本分析方法. 计算机研究与发展,2000 ,37 (3) :324 - 328
[4] 吴立德. 大规模中文文本处理. 上海:复旦大学出版社,1997
[5] Salton G,Amit Singhal. Automatic Text Decomposition Using Text Segments and Text Themes , In Proceedings 7th ACM Conference on Hypertext ,Washington ,D. C. ,1996
[6] Koller D ,Sahami M. Hierarchically Classifying Documents Using very Few Words , In : Proceedings of the 14th International Conference on Machine Learning ,1997
[7] Yang Y,Pedersen J . A Comparative Study on Feature Selection in Text Categorization. In : Proceedings of the 14th International Conference on Machine Learning ,1997.
[8] Berry M W,Dumais S T ,O’brien G W. Using linear algebra for intelligent information retrieval ,SIAM Review ,1995 ,37 (4) : 573 - 595
[9] Deerwester S ,Dumais S T , Furnas G W et al . Indexing by Latent Semantic Analysis ,Journal of the American Society for Information. Science ,1990 ,41 (6) :391 - 407

基金

国家自然科学基金资助项目(编号:69675019),国家教委博士点基金资助项目
PDF(193 KB)

788

Accesses

0

Citation

Detail

段落导航
相关文章

/