李浥尘,胡珀,王丽君. 基于神经网络的体育新闻自动生成研究[J]. 中文信息学报, 2018, 32(3): 77-83.
LI Yichen, HU Po, WANG Lijun. Sports News Generation Based on Neural Networks. , 2018, 32(3): 77-83.
基于神经网络的体育新闻自动生成研究
李浥尘,胡珀,王丽君
华中师范大学 计算机学院,湖北 武汉 430079
Sports News Generation Based on Neural Networks
LI Yichen, HU Po, WANG Lijun
School of Computer, Central China Normal University, Wuhan, Hubei 430079, China
Abstract:It is often time-consuming and laborious for a journalist to write sports news. In this paper, we propose a neural network model to automatically generate the sports news on the basis of the sports live scripts. The model avoids the manual feature extraction. Besides, it can also consider sentence-level information and global information within the script as well as the semantic relevance between sentences and corresponding news content in the scripts. The experimental results on the open data set verify the feasibility and effectiveness of the proposed method. In addition, we also try to generate the title of sports news based on rules and templates to extract the key content of it.
[1] Luhn H P. The automatic creation of literature abstracts[J]. IBM Journal of Research and Development, 1969,2(2):159-165. [2] Lin C Y, Eduard H. The automated acquisition of topic signatures for text summarization[C]//Proceedings of the 17th Conference on Computational Linguistics (COLING 2000), 2000:495-501, Association for Computational Linguistics, Stroudsburg, PA. [3] Nomoto T Matsumoto Y. A new approach to unsupervised text summarization[C]//Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2001), 2001:26-34, ACM, New York, NY. [4] Erkan G, Radev D R. LexPageRank:prestige in multi-document text summarization[C]//Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP 2004),2004. [5] Mihalcea R Tarau P. TextRank:bringing order into texts[C]//Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP 2004),2004. [6] Conroy J M, Oleary D P. Text summarization via hidden markov models[C]//Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2001), ACM, New York, NY,2001:406-407. [7] You O Y, Li W J, Li S J, et al. Applying regression models to query-focused multi-document summarization[J]. Information Processing and Management, 2011, 47(2):227-237. [8] Yang Z,Cai K K, Tang J, et al. Social context summarization[C]//Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2011), ACM, New York, NY, 2011:255-264. [9] Jianmin Zhang, Jin-ge Yao, Xiaojun Wan. Towards constructing sports news from live text commentary[C]//Proceedings of ACL 2016, 2016. [10] Jeffrey Nichols, Jalal Mahmud, Clemens Drews. Summarizing sporting events using twitter[C]//Proceedings of the 2012 ACM International Conference on Intelligent User Interfaces, 2012:189-198. [11] Nadjet Bouayad-Agha, Gerard Casamayor, Leo Wanner. Content selection from an ontology based knowledge base for the generation of football summaries[C]//Proceedings of the 13th European Workshop on Natural Language Generation, 2011:72-81. [12] Nadjet Bouayad-Agha, Gerard Casamayor, Simon Mille, et al. Perspective-oriented generation of football match summaries:Old tasks, new challenges[C]//Proceedings of the ACM Transactions on Speech and Language Processing (TSLP), 2012,9(2):3. [13] D Tjondronegoro,Yi-Ping Phoebe Chen, Binh Pham. Highlights for more complete sports video summarization[C]//Proceedings of IEEE Computer Society Press, 2004,11(4):22-37. [14] Ziqiang Cao, Wenjie Li, Sujian Li. AttSum:Joint learning of focusing and summarization with neural attention[C]//Proceedings of Coling 2016. [15] Sumit Chopra,Michael Auli, Alexander M Rush. Abstractive sentence summarization with attentive recurrent neural networks[C]//Proceedings of NAACL 2016. [16] Chin-Yew Lin, Eduard Hovy. Automatic evaluation of summaries using n-gram cooccurrence statistics[C]//Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology 2003,(1):71-78. [17] Dragomir R Radev, Hongyan Jing, Malgorzata Budzikowska. Centroid-based summarization of multiple documents:Sentence extraction,utility-based evaluation, and user studies[C]//Proceedings of the 2000 NAACL-ANLP Workshop on Automatic summarization, 2000:21-30. Association for Computational Linguistics. [18] DanGillick, Benoit Favre,and Dilek Hakkani-Tur. The icsi summarization system at tac 2008[C]//Proceedings of the Text Understanding Conference,2008.