本文提出了一种基于SVD(奇异值分解)的双语信息过滤算法,将双语文档进行了统一的表示,使得适应于单语过滤的算法可以方便地用于双语过滤,同时对文档向量进行了压缩,滤去了噪声。在应用方面,将双语过滤算法用于互联网上的个性化主动信息过滤。
Abstract
This paper introduces a SVD method in bilingual information filtering. It gives an uniform presentation to bilingual documents. Then any arithmetic used in monolingual information filtering can be easily used in bilingual information filtering. Using this method , we can compress the document vector and filter the noise. This method is used in personal information filtering on the Internet . We provide the WWW Bookmark Service. Through user's Bookmark , we can get user's preference and recommend interesting bilingual documents. According to user's feedback , we can improve the quality of information filtering.
关键词
双语信息过滤 /
SVD /
互联网 /
Bookmark服务
{{custom_keyword}} /
Key words
Bilingual information filtering /
SVD /
Internet /
Bookmark Service
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] 张贤达. 现代信号处理. 北京:清华大学出版社,1995 ,68 - 69
[2] Carbonell Jaime G, Yang Yiming , Frederking Robert E et al . Translingual Information Retrival : A Comparative Evalution. IJCAI97. 1997 ,708 - 714
[3] Belkin N J ,Croft WB. Information filtering and information retrieval : Two sides of the same coin. Communication of ACM 35 ,1992 ,12 (Dec.) : 29 - 38
[4] Vapnik V. The Nature of Statistical Learning Theory. New York : Springer , 1995
[5] Salton G. Automatic Text Processing : The Transformation , Analysis , and Retrieval of Information by Computer. Pennsylvania :Addison-Wesley , 1989
[6] Buckley C , Salton G, Allan J et al . Automatic Query Expandsion Using SAMRT : TREC 3. In : Overview of the Third Text Retrieval Conference (TREC - 3) , 1995 ,69 - 80
[7] Wong S K M ,Ziarko W,Wong P C N. Generalized Vector Space Model In Information Retrieval. In : ACM SIGIR Conference on Research and Development in Information Retrieval. (SIGIR'85) , 1985 ,18 - 25
[8] Deerwester S ,Dumais S T ,Furnas GW et al . Indexing by Latent Semantic Analysis. In : J Amer SocInf Sci 1 , 1990 ,6 :391 - 407
[9] Dumais S , Landauer T , Littman M. Automatic Cross - Linguistic Information Retrieval using Latent Semantic Indexing. In : Proceedings of SIGIR - 96 , Zurich , Auguest 1996
[10] Joachims T. Text categorization with support vector machine. Technical Report . LS VIII Number 23 , University of Dortmund ,1997
[11] Cortes C ,Vapnik V. Support - Vector Networks. Machine Learning ,1995 ,20 : 273 - 297
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}