基于网页内容的广告推介研究

施水才,程涛,王霞,吕学强

PDF(844 KB)
PDF(844 KB)
中文信息学报 ›› 2007, Vol. 21 ›› Issue (4) : 42-47.
综述

基于网页内容的广告推介研究

  • 施水才1,程涛1,王霞2,吕学强1
作者信息 +

Advertisement-Promotion Research Based on the Content of Webpage

  • SHI Shui-cai1, CHENG Tao1, WANG Xia2, LV Xue-qiang1
Author information +
History +

摘要

网页与广告关联是基于网页内容的网络广告的核心技术,本文提出了一种基于语义的、以实现网页和广告精确匹配为目标的广告推介方法。首先对一个Web网页进行主题信息提取,获得网页的主题词;然后再对这些主题词语作同义词扩展、上位词扩展、下位词扩展和相关词扩展,最后从待匹配的广告中选择匹配度最高的广告。对该方法进行了模型系统实现并进行了测试运行, 结果表明该方法是行之有效的。

Abstract

Webpage-advertisement matching is the core technology of online advertisement based on the content, and the paper presents a semantic approach, with a goal of achieving webpage-advertisement matching accurately. Firstly, thematic information must be extracted from a webpage, and then thematic words are calculated. Extend the thematic words by looking up their similar words, upper words, lower words, related words, and finally choose advertisements which have highest matching rate. The method is implemented and tested, and the result shows that the proposed arithmetic is promising.

关键词

计算机应用 / 中文信息处理 / 同义词词林 / 主题词 / 网页数据抽取 / 关联度

Key words

computer application / chinese information processing / tongyici cilin / thematic words / web data extraction / matching rate

引用本文

导出引用
施水才,程涛,王霞,吕学强. 基于网页内容的广告推介研究. 中文信息学报. 2007, 21(4): 42-47
SHI Shui-cai, CHENG Tao, WANG Xia, LV Xue-qiang. Advertisement-Promotion Research Based on the Content of Webpage. Journal of Chinese Information Processing. 2007, 21(4): 42-47

参考文献

[1] 董晓常,王亚雪.追捧Google的理由[J]. 互联网周刊, 2005,(37): 25-27.
[2] Sahuguet A, Azavan F. Building Intelligent Web Applications Using Lightweight Wappers [J]. Data and Knowledge Engineering, 2001, 36(3): 283-316.
[3] Crescenzi V, Mecca G, Merialdo P. RoadRunner: Towards Automatic Data Extraction from Large Web Sites [A]. In: Proceeding of the 26th International Conference on Very Large Database Systems[C]. Rome, Italy: 2001. 109-118.
[4] 胡国平,张巍,王仁华.基于双层决策的新闻网页正文精确抽取[J].中文信息学报.2006, 20(6):1-9.
[5] 孙承杰,关毅.基于统计的网页正文信息抽取方法的研究[J].中文信息学报,2004, 18(5):17-55.
[6] 吴鹏飞,孟祥增,刘俊晓,等.基于结构与内容的网页主题信息提取研究[J].山东大学学报(理学版), 2006,41(3):131-134.
[7] 索红光,刘玉树,曹淑英.一种基于词汇链的关键词抽取方法 [J].中文信息学报,2006, 20(6):25-30.
[8] 唐培丽,王树明,胡明.基于语义的汉语文献主题词提取算法研究[J].吉林大学学报(信息科学版), 2005,23(5): 90-95.

基金

国家自然科学基金资助项目(60272084);北京市教育委员会科技发展计划重点项目(KZ200310772013)
PDF(844 KB)

Accesses

Citation

Detail

段落导航
相关文章

/