查询扩展是提高检索性能的有效方法。为了弥补在数据集中由于词对没有直接出现而导致无法统计出词间关系进行查询扩展的缺陷,该文通过提取Markov网络中的词团信息来量化词间的混合相关性,将强化后的词间混合相关性应用于信息检索扩展模型中。实验表明 基于混合相关的Markov网络信息检索扩展模型的检索效果优于基于直接相关的查询扩展模型;此外,该文提出的模型在总体检索性能上略优于基于团的Markov网络信息检索模型,但在词团提取上大大减少了计算开销。
Abstract
Query expansion is effective to improve retrieval efficiency. In this paper, the mixed correlation between terms is quantized by term cliques which are obtained from Markov network, so as to solve the computation of the term relationship lack of cooccurence in corpus. The enhanced mixed correlation is then applied to query expansion. The experimental results show that the proposed method outperforms that based on direct correlation. In addition, the method is slightly better than a Markov network model based on cliques significantly reduces the computational overhead of term cliques.
Key wordsmixed correlation; Markov network; query expansion
关键词
混合相关 /
Markov网络 /
查询扩展
{{custom_keyword}} /
Key words
mixed correlation /
Markov network /
query expansion
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] 陈燕红,黄名选. 基于APriori改进算法的局部反馈查询扩展[J].现代图书情报技术,2007,09:84-87.
[2] C Lioma, B Larsen, W Lu. Rhetorical relations for information retrieval[C]//Proceedings of the 35th annual international ACM SIGIR conference on research and development in information retrieval, 2012: 931-940.
[3] D Metzler, W B Croft. Latent Concept Expansion Using Markov Random Fields[C]//Proceedings of the 30th annual international ACM SIGIR conference on research and development in information retrieval, 2007: 311-318.
[4] Yuan Lin, Hongfei Lin,Song Jin. Social Annotation in Query Expansion: a Machine Learning Approach[C]//Proceedings of the 34th annual international ACM SIGIR conference on research and development in information retrieval, 2011: 405-414.
[5] Fonseca B M, Golgher P B, Moura E S de. Discovering Search Engine Related Query Using Association Rules [J]. Journal of Web Engineering, 2004, 2(4): 215-227.
[6] 刘文飞,林鸿飞.基于网页查询结果的广告查询扩展研究[J].中文信息学报,2012,26(5):88-94.
[7] Dai Jiahong. Fuzzy cluster-based query expansion [D]. Master Thesis, Department of Information Management, National Sun Yat-sen University, Taiwan, 2004.
[8] 姚小同.查询扩展技术研究[D].北京:北京邮电大学,2009.
[9] 左家莉.基于Markov网络的信息检索模型[D].南昌:江西师范大学, 2005.
[10] 钟茂生,刘慧,刘磊.词汇间语义相关关系量化计算方法[J].中文信息学报,2009,23(2):115-122.
[11] 曹瑛,王明文,陶红亮.基于Markov 网络的检索模型[J].山东大学学报(理学版),2006,41(3):126-130.
[12] 石松.基于Markov团的信息检索扩展模型[D].南昌:江西师范大学, 2011.
[13] 刘宏哲,须德.基于本体的语义相似度和相关度计算研究综述[J].计算机科学,2012,39(2):8-13.
[14] 甘丽新.基于Markov概念的信息检索模型[D].南昌:江西师范大学, 2007.
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}
基金
江西省教育厅科技资助项目(GJJ11224);江西省自然科学基金资助项目(20122BAB211032);2011年江西省高校省级教改资助项目(JXJG-11-13-19)
{{custom_fund}}