Temporal Sensitive Learning to Rank Method for Microblog Search
WANG Shuxin1, WEI Bingjie2, LU Xiao2, WANG Bin3
1. University of Chinese Academy of Sciences, Institute of Computing Technology, CAS, Beijing 100190, China; 2. National Computer Network Emergency Response Technical Team/Coordination Center, Beijing 100029, China; 3. Institute of Information Engineering, CAS, Beijing 100093, China
Abstract:Microblog search has become a hot research problem in information retrieval area in recent years. Related work shows that most queries in microblog search are time-sensitive. To address this problem, many existing methods were proposed based on different time-sensitive assumptions, such as, “the newer of a document, the more important it is” or “the closer to the peak point a document is, the more important it is”. All these methods have improved retrieval effectiveness somehow. However, it is hard to summarize the temporal role in ranking of microblog search to one straight forward assumption as above. In this paper, our study on temporal distributions of relevant documents of different queries shows the complexity of temporal role in ranking; therefore, simple straight forward assumptions are not accurate. We proposed to use the temporal and entity evidences of query-document pairs to train a time-sensitive learning to rank model to tackle this problem. As for temporal features, both global features of query and local features of query-documents pair are extracted. Experimental results show that TLTR significantly improves the retrieval effectiveness over existing time aware ranking models on TREC Microblog Track 2011—2012 data set.
[1]Teevan J, Ramage D, Morris M R. TwitterSearch: a comparison of microblog search and web search[C]//Proceedings of the 4th ACM international conference on Web search and data mining. ACM, 2011: 35-44. [2] Kanhabua N, Nrvg K. Learning to rank search results for time-sensitive queries[C]//Proceedings of the 21st ACM international conference on information and knowledge management. ACM, 2012: 2463-2466. [3] Li X, Croft W B. Time-based language models[C]//Proceedings of the 12th international conference on Information and knowledge management. ACM, 2003: 469-475. [4] Efron M, Golovchinsky G. Estimation methods for ranking recent information[C]//Proceedings of the 34th international ACM SIGIR conference on research and development in Information Retrieval. ACM, 2011: 495-504. [5] Wei B, Zhang S, Li R, et al. A time-aware language model for microblog retrieval[R]//Report of TREC Microblog Adhoc Track, 2012. [6] 卫冰洁, 王斌. 面向微博搜索的时间感知的混合语言模型[J]. 计算机学报, 2014, 37(1):229-237. [7] Miyanishi T, Seki K, Uehara K. Combining recency and topic-dependent temporal variation for microblog search[M]//Advances in Information Retrieval. Springer Berlin Heidelberg, 2013: 331-343. [8] Efron M, Lin J, He J, et al. Temporal feedback for tweet search with non-parametric density estimation[C]//Proceedings of the 37th international ACM SIGIR conference on research & development in information retrieval. ACM, 2014: 33-42. [9] Miyanishi T, Seki K, Uehara K. Time-aware latent concept expansion for microblog search[C]//Proceedings of the 8th International AAAI Conference on Weblogs and Social Media. 2014. [10] Herbrich R, Graepel T, Obermayer K. Large margin rank boundaries for ordinal regression[J]. Advances in neural information processing systems, 1999: 115-132. [11] Cao Z, Qin T, Liu T Y, et al. Learning to rank: from pairwise approach to listwise approach[C]//Proceedings of the 24th international conference on machine learning. ACM, 2007: 129-136. [12] Ounis I, Macdonald C, Lin J, et al. Overview of the trec-2011 microblog track[C]//Proceedings of the 20th Text REtrieval Conference. 2011. [13] Metzler D, Cai C. USC/ISI at TREC 2011: Microblog Track[C]//Proceedings of the TREC. 2011. [14] Miyanishi T, Okamura N, Liu X, et al. TREC 2011 Microblog Track Experiments at Kobe University[R]. [15] Zhang X, He B, Luo T, et al. Query-biased learning to rank for real-time twitter search[C]//Proceedings of the 21st ACM international conference on Information and knowledge management. ACM, 2012: 1915-1919. [16] Damak F, Pinel-Sauvagnat K, Boughanem M, et al. Effectiveness of State-of-the-art Features for Microblog Search[C]//Proceedings of the 28th Annual ACM Symposium on Applied Computing. ACM, 2013: 914-919.