|
|
Few-shot COVID-19 Rumor Detection for Online Social Media |
LU Hengyang1,2, FAN Chenyou3, WU Xiaojun1 |
1.Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence, Jiangnan University, Wuxi, Jiangsu 214122, China; 2.National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, Jiangsu 210023, China; 3.Shenzhen Institute of Artificial Intelligence and Robotics for Society, Shenzhen, Guangdong 518129, China |
|
|
Abstract The COVID-19 rumors published and spread on the online social media have a serious impact on people's livelihood, economy, and social stability. Most existing researches for rumor detection usually assumed that the happened events for modeling and predictions already have enough labeled data. These studies have severe limitations on detecting emergent events such as the COVID-19 which has very few training instances. This article focuses on the problem of few-shot rumor detection, aiming to detect rumors of emergent events with only very few labeled instances. Taking the COVID-19 rumors from Sina Weibo as the target, we construct a Sina Weibo COVID-19 rumor dataset for few-shot rumor detection, and propose a deep neural network based few-shot rumor detection model with meta learning. In the few-shot machine learning scenarios, the experimental results of the proposed model on the COVID-19 rumor dataset and the PHEME public dataset have been significantly improved.
|
Received: 29 June 2021
|
|
|
|
|
[1] Qazvinian V, Rosengren E, Radev D, et al. Rumor has it: Identifying misinformation in microblogs [C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing. Edinburgh, UK: ACL, 2011: 1589-1599. [2] Popat K. Assessing the credibility of claims on the web [C]//Proceedings of the 26th International Conference on World Wide Web Companion. Perth, Australia: ACM, 2017: 735-739. [3] Ma J, Gao W, Mitra P, et al. Detecting rumors from microblogs with recurrent neural networks [C]//Proceedings of the 25th International Joint Conference on Artificial Intelligence. New York, NY, USA: IJCAI, 2016: 3818-3824. [4] Yu F, Liu Q, Wu S, et al. A convolutional approach for misinformation identification [C]//Proceedings of the 26th International Joint Conference on Artificial Intelligence. Melbourne, VIC, Australia: IJCAI, 2017: 3901-3907. [5] Shu K, Wang S, Liu H. Understanding user profiles on social media for fake news detection [C]//Proceedings of the IEEE Conference on Multimedia Information Processing and Retrieval. Miami, FL, USA: IEEE, 2018: 430-435. [6] Finn C,Abbeel P, Levine S. Model-agnostic meta-learning for fast adaptation of deep networks [C]//Proceedings of the 34th International Conference on Machine Learning. Sydney, NSW, Australia, PMLR, 2017: 1126-1135. [7] Gidaris S, Komodakis N. Dynamic few-shot visual learning without forgetting [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City, UT, USA: IEEE, 2018: 4367-4375. [8] Vinyals O, Blundell C, Lillicrap T, et al. Matching networks for one shot learning [C]//Proceedings of the 30th Conference on Neural Information Processing Systems. Barcelona, Spain: NIPS, 2016: 3630-3638. [9] Snell J,Swersky K, Zemel R. Prototypical networks for few-shot learning [C]//Proceedings of the 31st Conference on Neural Information Processing Systems. Long Beach, CA, USA: NIPS, 2017: 4077-4087. [10] Derczynski L, Bontcheva K, Liakata M, et al. SemEval-2017 task 8: rumoureval: determining rumour veracity and support for rumours [C]//Proceedings of the 11th International Workshop on Semantic Evaluation. Vancouver, Canada: ACL, 2017: 69-76. [11] Zubiaga A, Liakata M, Procter R, et al. Analysing how people orient to and spread rumours in social media by looking at conversational threads [J]. PLoS ONE, 2016, 11(3): 1-29. [12] Gorrell G, Kochkina E, Liakata M, et al. SemEval-2019 task 7: RumourEval, determining rumour veracity and support for rumours [C]//Proceedings of the 13th International Workshop on Semantic Evaluation. Minneapolis, Minnesota, USA: ACL, 2019: 845-854. [13] Castillo C, Mendoza M, Poblete B. Information credibility on twitter [C]//Proceedings of the 20th International Conference on World Wide Web. Hyderabad, India, ACM, 2011: 675-684. [14] Kwon S, Cha M, Jung K. Rumor detection over varying time windows [J]. PLoS ONE, 2017, 12(1): e0168344. [15] Zhao Z, Resnick P, Mei Q. Enquiring minds:early detection of rumors in social media from enquiry posts [C]//Proceedings of the 24th International Conference on World Wide Web. Florence, Italy: ACM, 2015: 1395-1405. [16] Mohammad S M,Sobhani P, Kiritchenko S. Stance and sentiment in tweets [J]. ACM Transactions on Internet Technology, 2017, 17(3): 1-23. [17] Chen T, Li X, Yin H, et al. Call attention to rumors:deep attention based recurrent neural networks for early rumor detection [C]//Proceedings of Pacific-Asia Conference on Knowledge Discovery and Data Mining. Melbourne, VIC, Australia: Springer, 2018: 40-52. [18] Ma J, Gao W, Wong K F. Rumor detection on twitter with tree-structured recursive neural networks [C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Melbourne, Australia: ACL, 2018: 1980-1989. [19] Bian T, Xiao X, Xu T, et al. Rumor detection on social media with bi-directional graph convolutional networks [C]//Proceedings of the AAAI Conference on Artificial Intelligence. New York City, NY, USA: AAAI, 2020: 549-556. [20] Jin Z, Cao J, Jiang Y G, et al. News credibility evaluation on microblog with a hierarchical propagation model [C]//Proceedings of the IEEE International Conference on Data Mining. Shenzhen, China: IEEE, 2014: 230-239. [21] Liu Y, Xu S. Detecting rumors through modeling information propagation networks in a social media environment [J]. IEEE Transactions on Computational Social Systems, 2016, 3(2): 46-62. [22] 刘金硕, 冯阔, 邓娟, 等. MSRD: 多模态网络谣言检测方法[J]. 计算机研究与发展, 2020, 57(11): 2328-2336. [23] Sung F, Yang Y, Zhang L, et al. Learning to compare:relation network for few-shot learning [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City, UT, USA: IEEE, 2018: 1199-1208. [24] Sun Q, Liu Y, Chua T S, et al. Meta-transfer learning for few-shot learning [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Long Beach, CA, USA: IEEE, 2019: 403-412. [25] Li Z, Zhou F, Chen F, et al. Meta-sgd: Learning to learn quickly for few-shot learning[J/OL]. arXiv preprint arXiv:1707.09835, 2017. [26] Oreshkin B, López P R, Lacoste A. Tadam: Task dependent adaptive metric for improved few-shot learning [C]//Proceedings of the International Conference on Neural Information Processing Systems. Montréal, Canada: NeurIPS, 2018: 721-731. [27] Zhang C, Cai Y, Lin G, et al.DeepEMD: Few-shot image classification with differentiable earth movers distance and structured classifiers [C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle, WA, USA: IEEE/CVF, 2020: 12203-12213. [28] Mikolov T, Sutskever I, Chen K, et al. Distributed representations of words and phrases and their compositionality [C]//Proceedings of the 26th International Conference on Neural Information Processing Systems. Lake Tahoe, Nevada, USA: NIPS, 2013: 3111-3119. [29] Pennington J,Socher R, Manning C D. Glove: global vectors for word representation [C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing. Doha, Qatar: ACL, 2014: 1532-1543. [30] Peters M, Neumann M,Iyyer M, et al. Deep contextualized word representations [C]//Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Melbourne, Australia: ACL, 2018: 2227-2237. [31] Devlin J, Chang M W, Lee K, et al. BERT: pre-training of deep bidirectional transformers for language understanding [C]//Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Florence, Italy: ACL, 2019: 4171-4186. [32] Yuan C, Ma Q, Zhou W, et al. Jointly embedding the local and global relations of heterogeneous graph for rumor detection [C]//Proceedings of IEEE International Conference on Data Mining. Beijing, China: IEEE, 2019: 796-805. [33] 沈瑞琳, 潘伟民, 彭成, 等. 基于多任务学习的微博谣言检测方法[J/OL].计算机工程与应用:1-8. http://kns.cnki.net/kcms/detail/11.2127.TP.20201111.1158.004.html[2021-08-10]. [34] 周奇安, 李舟军. 基于 BERT 的任务导向对话系统自然语言理解的改进模型与调优方法[J]. 中文信息学报, 2020, 34(5): 82-90. [35] Cho K, vanMerrienboer B, Bahdanau D, et al. On the properties of neural machine translation: encoder-decoder approaches [C]//Proceedings of 8th Workshop on Syntax, Semantics and Structure in Statistical Translation. Doha, Qatar: ACL, 2014: 103-111. [36] Zubiaga A, Liakata M, Procter R. Learning reporting dynamics during breaking news for rumour detection in social media[J/OL]. arXiv preprint arXiv:1610.07363, 2016. [37] Ma J, Gao W, Wong K F. Detect rumors on twitter by promoting information campaigns with generative adversarial learning [C]//Proceedings of the World Wide Web Conference. San Francisco, CA, USA: ACM, 2019: 3049-3055. |
|
|
|