一种基于奇异值分解的双语信息过滤算法

路海明1,徐晋晖2,卢增祥1,李衍达1

PDF(347 KB)
PDF(347 KB)
中文信息学报 ›› 1999, Vol. 13 ›› Issue (3) : 19-26.
综述

一种基于奇异值分解的双语信息过滤算法

  • 路海明,徐晋晖,卢增祥,李衍达
作者信息 +

A SVD Method in Bilingual Information Filtering

  • Haiming Lu , Jinhui Xu, Zengxiang Lu , Yanda Li
Author information +
History +

摘要

本文提出了一种基于SVD(奇异值分解)的双语信息过滤算法,将双语文档进行了统一的表示,使得适应于单语过滤的算法可以方便地用于双语过滤,同时对文档向量进行了压缩,滤去了噪声。在应用方面,将双语过滤算法用于互联网上的个性化主动信息过滤。

Abstract

This paper introduces a SVD method in bilingual information filtering. It gives an uniform presentation to bilingual documents. Then any arithmetic used in monolingual information filtering can be easily used in bilingual information filtering. Using this method , we can compress the document vector and filter the noise. This method is used in personal information filtering on the Internet . We provide the WWW Bookmark Service. Through user's Bookmark , we can get user's preference and recommend interesting bilingual documents. According to user's feedback , we can improve the quality of information filtering.

关键词

双语信息过滤 / SVD / 互联网 / Bookmark服务

Key words

Bilingual information filtering / SVD / Internet / Bookmark Service

引用本文

导出引用
路海明1,徐晋晖2,卢增祥1,李衍达1. 一种基于奇异值分解的双语信息过滤算法. 中文信息学报. 1999, 13(3): 19-26
Haiming Lu1 , Jinhui Xu2, Zengxiang Lu1 , Yanda Li1. A SVD Method in Bilingual Information Filtering. Journal of Chinese Information Processing. 1999, 13(3): 19-26

参考文献

[1] 张贤达. 现代信号处理. 北京:清华大学出版社,1995 ,68 - 69
[2] Carbonell Jaime G, Yang Yiming , Frederking Robert E et al . Translingual Information Retrival : A Comparative Evalution. IJCAI97. 1997 ,708 - 714
[3] Belkin N J ,Croft WB. Information filtering and information retrieval : Two sides of the same coin. Communication of ACM 35 ,1992 ,12 (Dec.) : 29 - 38
[4] Vapnik V. The Nature of Statistical Learning Theory. New York : Springer , 1995
[5] Salton G. Automatic Text Processing : The Transformation , Analysis , and Retrieval of Information by Computer. Pennsylvania :Addison-Wesley , 1989
[6] Buckley C , Salton G, Allan J et al . Automatic Query Expandsion Using SAMRT : TREC 3. In : Overview of the Third Text Retrieval Conference (TREC - 3) , 1995 ,69 - 80
[7] Wong S K M ,Ziarko W,Wong P C N. Generalized Vector Space Model In Information Retrieval. In : ACM SIGIR Conference on Research and Development in Information Retrieval. (SIGIR'85) , 1985 ,18 - 25
[8] Deerwester S ,Dumais S T ,Furnas GW et al . Indexing by Latent Semantic Analysis. In : J Amer SocInf Sci 1 , 1990 ,6 :391 - 407
[9] Dumais S , Landauer T , Littman M. Automatic Cross - Linguistic Information Retrieval using Latent Semantic Indexing. In : Proceedings of SIGIR - 96 , Zurich , Auguest 1996
[10] Joachims T. Text categorization with support vector machine. Technical Report . LS VIII Number 23 , University of Dortmund ,1997
[11] Cortes C ,Vapnik V. Support - Vector Networks. Machine Learning ,1995 ,20 : 273 - 297
PDF(347 KB)

2906

Accesses

0

Citation

Detail

段落导航
相关文章

/