张森,王斌. Web检索查询意图分类技术综述[J]. 中文信息学报, 2008, 22(4): 75-82.
ZHANG Sen, WANG Bin. A Survey of Web Search Query Intention Classification. , 2008, 22(4): 75-82.
Web检索查询意图分类技术综述
张森,王斌
中国科学院 计算技术研究所,北京100190
A Survey of Web Search Query Intention Classification
ZHANG Sen, WANG Bin
Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
Abstract:An increasing number of researches have been focused on the classification of web queries in recent years. This article centers around the researches on automatic query classification according to query intention. It presents a survey of the background of query classification, its key techniques, the classification algorithms and the evaluation methods. And it outlines the problems and challenges in query intention classification, i.e. lack of authoritative evaluation method, the inadequate performance comparisons on large scale dataset, the acquisition of accurate query features, and the issues in the completeness and objectivity of a category system.
[1] Dou Shen, Jian-Tao Sun, Qiang Yang, and Zheng Chen. Building bridges for web query classification[C]//SIGIR ’06: Proceedings of the 29th annual international ACMSIGIR conference on Research and development in information retrieval. New York, NY, USA: ACM Press, 2006,131-138. [2] Daniel E. Rose and Danny Levinson. Understanding user goals in web search[C]//WWW ’04: Proceedings of the 13th international conference on World Wide Web. New York, NY, USA: ACM Press, 2004, 13-19. [3] Andrei Broder. A taxonomy of web search[C]//SIGIR Forum. New York, NY, USA: ACM Press , 2002, 3-10. [4] Uichin Lee, Zhenyu Liu, and Junghoo Cho. Automatic identification of user goals in web search[C]//WWW ’05: Proceedings of the 14th international conference on World Wide Web. New York, NY, USA : ACM Press, 2005, 391-400. [5] Luis Gravano, Vasileios Hatzivassiloglou, and Richard Lichtenstein. Categorizing web queries according to geographical locality[C]//CIKM ’03: Proceedings of the twelfth international conference on Information and knowledge management. New York, NY, USA: ACM Press, 2003, 325-333. [6] Bang Viet Nguyen and Min-Yen Kan. Functional faceted web query analysis[C]//WWW ’07: Workshop of the 16th international conference onWorld Wide Web. New York, NY, USA: ACM Press, 2007. [7] Ricardo A. Baeza-Yates, Liliana Calderon-Benavides, and Cristina N.Gonzalez-Caro. The intention behind web queries[C]//F. Crestani, P. Ferragina and M. Sanderson. SPIRE. Berlin Heidelberg: Spring-Verlag, 2006, 9-109. [8] Amanda Spink, Dietmar Wolfram, Major B. J. Jansen, and Tefko Saracevic. Searching the web: the public and their queries[J]. J. Am. Soc. Inf. Sci. Technol., 2001, 52(3): 226-234. [9] In-Ho Kang and GilChang Kim. Query type classification for web document retrieval[C]//SIGIR ’03: Proceedings of the 26th annual international ACMSIGIR conference on Research and development in information retrieval. New York, NY, USA: ACM Press, 2003, 64-71. [10] Steven M. Beitzel, Eric C. Jensen, Ophir Frieder, David D. Lewis, Abdur Chowdhury, and Aleksander Kolcz. Improving automatic query classification via semi-supervised learning[C]//ICDM ’05: Proceedings of the Fifth IEEE International Conference on Data Mining. Washington, DC, USA: IEEE Computer Society, 2005, 42-49. [11] Bernard J. Jansen, Danielle L. Booth, and Amanda Spink. Determining the user intent of web search engine queries[C]//WWW ’07: Proceedings of the 16th international conference on World Wide Web. New York, NY, USA: ACM Press, 2007, 1149-1150. [12] Yiqun Liu, Min Zhang, Liyun Ru, and Shaoping Ma. Automatic query type identification based on click throughinformation[C]//AIRS. Berlin Heidelberg: Springer-Verlag, 2006, 593-600. [13] Steven M. Beitzel, Eric C. Jensen, Ophir Frieder, David Grossman, David D.Lewis, Abdur Chowdhury, and Aleksandr Kolcz. Automatic web query classification using labeled and unlabeled training data[C]//SIGIR ’05: Proceedings of the 28th annual international ACMSIGIR conference on Research and development in information retrieval. New York, NY, USA: ACM Press, 2005, 58-582. [14] W Krauth and M Mezard. Learning algorithms with optimal stability in neural networks[J]. Journal of Physics A, 1987, 20: 745-752.