本文提出了一种基于链接分析的对Blog信息源进行量化评估的方法,在此基础之上发现重要Blog信息源,既体现了Blog信息的特点,又在一定程度上减小了作弊链接对链接分析结果的影响,能为用户阅读信息提供方便,并可望为Blog信息检索提供一种新的思路。为了证明该评估方法的有效性,本文还提出了Blog信息源重要性的评价指标,对比了重要Blog信息源量化评估方法和评价指标的评分结果,通过相关性分析,表明此方法和评价指标存在高度的一致性。
Abstract
This paper proposes a method of ranking bloggers based on link analysis, which can exemplify the characteristics of blogs and reduce the influence of link spamming. This method can also bring convenience to users to read blogs and supply a new methodology for information retrieval in the blogosphere. To ensure the reliability of the ranking results, some evaluation indicators for the importance of bloggers are given, and the grading result of bloggers using the proposed method is compared with those using these indicators. At last, correlation analysis shows consistency between the proposed method and the evaluation indicators.
关键词
计算机应用 /
中文信息处理 /
重要Blog信息源 /
链接分析 /
评价指标 /
相关性分析
{{custom_keyword}} /
Key words
computer application /
Chinese information processing /
important blogger /
link analysis /
evaluation indicator /
correlation analysis
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] Technorati. http://www.technorati.com [EB/OL]. February 2007.
[2] ComScore Networks, Inc. Behaviors of the blogosphere: understanding the scale, composition and activities of weblog audiences [EB/OL]. http://www.comscore.com/blogreport /comScoreBlogReport.pdf. November 2005.
[3] B. L. Tseng, Junichi Tatemura, Yi Wu. Tomographic clustering to visualize blog communities as mountain views [A]. The 14th International World Wide Web Conference [C]. Chiba: Japan, May 2005.
[4] E. Adar, L. Zhang, L. Adamic, et al. Implicit structure and the dynamics of blogspace [A]. The 13th International World Wide Web Conference [C]. New York, USA: May 2004.
[5] Ko Fujimura, Takafumi Inoue, Massayuki Sugisaki. The eigenrumor algorithm for ranking blogs [A]. The 14th International World Wide Web Conference [C]. Chiba, Japan: May 2005.
[6] Shinsuke Nakajima, Junichi Tatemura, Yoichiro Hino. Discovering important bloggers based on analyzing blog threads [A]. The 14th International World Wide Web Conference [C]. Chibal, Japan: May 2005.
[7] 王晓宇, 周傲英. 万维网的链接结构分析及其应用综述. 软件学报, 2003, 14(10): 1768-1780.
[8] S. Brin, L. Page. The anatomy of a large-scale hypertextual web search engine [A]. The 7th International World Wide Web Conference [C]. Brisbane, Australia: April 1998. 107-117.
[9] J. Kleinberg. Authoritative sources in a hyperlinked environment [A]. In: Proceedings of the 9th ACM-SIAM Symposium on Discrete Algorithms [C]. New Orleans, America: January 1997. 668-677.
[10] S. Chakrabarti, B. Dom, D. Gibson, et al. Automatic resource compilation by analyzing hyperlink structure and associated text [A]. The 7th International World Wide Web Conference [C]. Brisbane, Australia: April 1998. 65-74.
[11] Chakrabarti S, Dom B, Gibson D, Kleinberg J, Raghavan P, Rajagopalan S. Automatic resource compilation by analyzing hyperlink structure and associated text [A]. The 7th Int’l WWW Conference [C]. Brisbane, Australia: 1998. 65-74.
[12] S. Chakrabarti, B. Dom, D. Gibson, et al. Experiments in topic distillation [A]. In: Proceedings of the ACM SIGIR workshop on Hypertext Information Retrieval on the Web [C]. Melbourne, Australia: ACM Press, August 1998. 13-21.
[13] G. Mishne, D. Carmel, R. Lempel. Blocking blog spam with language model disagreement [A]. In: Proceedings of the 1st International Workshop on Adversarial Information Retrieval on the Web (AIR Web) [C]. Chiba, Japan: May 2005.
[14] W. G. Hopkins. A new view of statistics [EB /OL]. http://www.sportsci.org/resource/stats, July 2006.
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}
基金
国家自然科学基金资助项目(60302021, 60373101)
{{custom_fund}}