基于相关文档池建模的查询扩展

吕碧波,赵军

PDF(411 KB)
PDF(411 KB)
中文信息学报 ›› 2006, Vol. 20 ›› Issue (3) : 80-85.

基于相关文档池建模的查询扩展

  • 吕碧波,赵军
作者信息 +

Query Expansion Based on Modeling of Relevant Documents Pool

  • LV Bi-bo,ZHAO Jun
Author information +
History +

摘要

在信息检索领域,相关反馈是提高检索性能的有效方法之一。所谓相关反馈,指用户按照一定策略从查找到的相关文档中选择一些和主题相关的词进行查询扩展的技术。本文介绍了概率模型和向量空间模型下的常用查询扩展方法,并提出了一种基于语言模型的相关反馈方法,该方法同时考虑了扩展词应该具备的两个特征,即相关性和覆盖性。在TREC测试集上对这些算法进行了比较,结果表明这种新算法在平均准确率上比传统方法有所提高。

Abstract

In information retrieval, relevance feedback is an effective way to improve retrieval performance. The goal is to input user's judgement on previous retrieved documents, and to select some terms for query expansion using certain strategy. This paper introduces some common query expansion approaches in relevance feedback based on probability model and vector space model, then a new term selection method is introduced based on language model,which takes into account two features of expanded terms - "relevance" and "coverage". The evaluation is conducted on the TREC Collection, which shows that our method is better than traditional ones on average precision.

关键词

计算机应用 / 中文信息处理 / 信息检索 / 相关反馈 / 查询扩展

Key words

computer application / Chinese information processing / information retrieval / relevance feedback / query expansion

引用本文

导出引用
吕碧波,赵军. 基于相关文档池建模的查询扩展. 中文信息学报. 2006, 20(3): 80-85
LV Bi-bo,ZHAO Jun. Query Expansion Based on Modeling of Relevant Documents Pool. Journal of Chinese Information Processing. 2006, 20(3): 80-85

参考文献

[1] Ming Zhang , Ruihua Song , Chuan Lin , et al . Expansion-Based Techologies in Finding Relevant and New information[A]. TREC 2002 [C].
[2] 贺宏朝,何丕廉,等. 一种基于上下文的中文信息检索查询扩展[J]. 中文信息学报, 2002, 16 (6) : 32 - 37.
[3] Rocchio, J. J. Relevant Feedback in Information Retrieval [M] , Chapter 14, pages 313 - 323. Prentice - Hall Inc. 1971.
[4] Maron M. E. , Kuhns J. L. On Relevance, Probabilistic Indexing and Information Retrieva1 [J]. Journal of the Association for Computer Machinery. 1960, 7: 216 - 244.
[5] Rocchio J. J. Relevance Feedback in Information Retrieva1. In Salton G. (Ed.) , The SMART Retrieval System [M]. 1971. Engle-wood CIifs, Prentice-Hall, Inc. 3l3 - 323.
[6] S E Robertson , S Walker , M Beaulieu . Okap i at TREC27: automatic ad hoc, filtering ,VLC and interactive [A]. TREC - 7 [C].
[7] S E Robertson and SWalker. Okap i/Keenbow at TREC28. TREC28 [C].
[8] S E Robertson and K. Sparck Jones. Relevance Weighting of Searching Terms[J]. Journal of the American Society for Information Sciences. 1976, 27 (3) : 129 - 146.
[9] S E Robertson. On term selection for query expansion. Journal of Documentation [J]. 1990, 46: 359 - 364.
[10] J. M. Ponte and W. B. Croft. A Language modeling approach to IR [A]. In: proceedings of the ACM SIGIR Conference [C]. 1998, 275 - 281.
[11] C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval[A]. In: proceedings of the ACM-SIGIR 2001 [C]. 334 - 342.

基金

国家自然科学基金资助项目(60372016);北京市自然科学基金资助项目(4052027)
PDF(411 KB)

535

Accesses

0

Citation

Detail

段落导航
相关文章

/