查询式是网络用户搜索时表达其信息需求的主要方式,系统提示的相关词则是用户改善查询的有效工具,该文以这二者为研究对象,从用户的使用行为入手对这二者的特征进行刻画和分析。首先使用日志挖掘的方法,对查询式进行总体的定量描述;进而通过定性分类将查询式中的高频词分为主体词和辅助词两大类,并比照问卷调查的研究结果,发现网络用户在搜索时大量地使用辅助词,主体词的内容相对集中,查询式的长度较短,结构相对简单。在对相关词的研究中,综合问卷调查和对比实验研究结果,发现被试者对搜索引擎提示的相关词认同程度高而应用程度低。该文为理解网络用户搜索时的语言使用提供了实证研究结果,并对搜索引擎索引的改善有一定的参考意义。
Abstract
Query is Web user’s primary method to express his/her information need in searching. Related term provided by systems is a useful tool to refine his/her query. The paper focuses on query and related term; describes and analyzes them from user’s utilization behavior aspect. Log mining is used to give descriptive statistics on query words; qualitative categorization is then used to divide the query words into primary and auxiliary keywords. The result of qualitative analysis is compared with the result of a questionnaire survey. Important finding are as the following. Users use auxiliary keywords greatly. The content of primary keyword is relatively concentrated. Query length is short and the query syntax is simple. From both the questionnaire and the controlled experiment results, we find that users have high recognition and low utilizations on related terms. The study provides empirical results to understand user’s language utilization and also data for search engine to refine its index.
Key words computer application; Chinese information processing; Chinese Search Engines; Information Behavior; Language Utilization; Log Mining; Questionnaire Survey; Controlled Experiment
关键词
计算机应用 /
中文信息处理 /
中文搜索引擎 /
用户搜索行为 /
语言使用 /
日志挖掘 /
问卷调查 /
对比实验
{{custom_keyword}} /
Key words
computer application /
Chinese information processing /
Chinese Search Engines /
Information Behavior /
Language Utilization /
Log Mining /
Questionnaire Survey /
Controlled Experiment
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] 王继民, 彭波. 搜索引擎用户访问量模型 [J]. 计算机工程与应用, 2004, 40(25): 9-11.
[2] 王继民, 彭波. 搜索引擎用户点击行为分析 [J]. 情报学报, 2006, 25(2): 154-162.
[3] 余慧佳, 刘奕群, 张敏, 等. 基于大规模日志分析的搜索引擎用户行为分析 [J]. 中文信息学报, 2007, 21(1): 109-114.
[4] Jansen B J, Spink A, Saracevic T. Real life, real users, and real needs: a study and analysis of user queries on the Web [J]. Information Processing and Management, 2000, 36(2): 207-227.
[5] Spink A, Jansen B J, Wolfman D, et al. From e-sex to e-commerce: Web search changes [J]. IEEE Computer, 2002, 35(3): 133-135.
[6] Jansen B J, Spink A. How are we searching the World Wide Web? A comparison of nine search engine transaction logs [J]. Information Processing and Management, 2006, 42(1): 248-263.
[7] Bilal D, Kirby J. Differences and similarities in information seeking: children and adults as Web users [J]. Information Processing and Management, 2002, 38(5): 649-670.
[8] Large A L, Beheshti J, Rahman T. Gender differences in collaborative Web searching behavior: an elementary school study [J]. Information Processing and Management, 2002, 38(3): 427-443.
[9] Reih S Y. Investing Web searching behavior in home environments [C] //Humanizing Information Technology: from Ideas to BITS and back: proceedings of the 66th Annual Meeting of the American Society for Information Science and Technology, Long Beach, California, United States, October 19-22, 2003. Medford (NJ): Information Today, Inc., 2003: 255-264.
[10] Whitmire E. Disciplinary differences and undergraduates' information seeking behavior [J]. Journal of the American Society for Information Science and Technology, 2002, 53(8): 631-638.
[11] Zhang Xiangmin, Chignell M. Assessment of the effects of user characteristics on mental model of information retrieval systems [J]. Journal of the American Society for Information Science and Technology, 2001, 52(6): 445-459.
[12] Whitmire E. Epistemological beliefs and the information seeking behavior of undergraduates [J]. Library and Information Science Research, 2003, 25(2): 127-142.
[13] Kim K S. Information seeking on the Web: effects of user and task variables [J]. Library and Information Science Research, 2002, 23(3): 233-255.
[14] 赖茂生, 吴龙婷, 岳珍, 等. 国外网络用户搜索行为研究综述 [J]. 情报学报, 2006, 25(S): 306-308.
[15] 中国互联网络信息中心. 中国互联网络发展状况统计报告 [R/OL]. (2007-07-18) [2008-01-23] http://www.cnnic.com.cn/uploadfiles/pdf/2007/7/18/113918.pdf.
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}