Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3487553.3524662acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
short-paper

Multi-Context Based Neural Approach for COVID-19 Fake-News Detection

Published: 16 August 2022 Publication History

Abstract

When the world is facing the COVID-19 pandemic, society is also fighting another battle to tackle misinformation. Due to the widespread effect of COVID 19 and increased usage of social media, fake news and rumors about COVID-19 are being spread rapidly. Identifying such misinformation is a challenging and active research problem. The lack of suitable datasets and external world knowledge contribute to the challenges associated with this task. In this paper, we propose MiCNA, a multi-context neural architecture to mitigate the problem of COVID-19 fake news detection. In the proposed model, we leverage the rich information of the three different pre-trained transformer-based models, i.e., BERT, BERTweet and COVID-Twitter-BERT to three different aspects of information (viz. general English language semantics, Tweet semantics, and information related to tweets on COVID 19) which together gives us a single multi-context representation. Our experiments provide evidence that the proposed model outperforms the existing baseline and the candidate models (i.e., three transformer architectures) and becomes a state-of-the-art model on the task of COVID-19 fake-news detection. We achieve new state-of-the-art performance on a benchmark COVID-19 fake-news dataset with 98.78% accuracy on the validation dataset and 98.69% accuracy on the test dataset.

References

[1]
Daron Acemoglu, Asuman Ozdaglar, and Ali ParandehGheibi. 2010. Spread of (mis)information in social networks. Games and Economic Behavior 70, 2 (2010), 194 – 227. https://doi.org/10.1016/j.geb.2010.01.005
[2]
Maaz Amjad, Grigori Sidorov, Alisa Zhila, Helena Gómez-Adorno, Ilia Voronkov, and Alexander Gelbukh. 2020. “Bend the truth”: Benchmark dataset for fake news detection in urdu language and its evaluation. Journal of Intelligent & Fuzzy SystemsPreprint (2020), 1–13.
[3]
Mohammed Azhan and Mohammad Kemal Ahmad. 2021. LaDiff ULMFiT: A Layer Differentiated training approach for ULMFiT. In CONSTRAINT@AAAI.
[4]
Michael Barthel, Amy Mitchell, and Jesse Holcomb. 2016. Many Americans believe fake news is sowing confusion. Pew Research Center 15(2016), 12.
[5]
Ceren Budak, Divyakant Agrawal, and Amr El Abbadi. 2011. Limiting the Spread of Misinformation in Social Networks. In Proceedings of the 20th International Conference on World Wide Web (Hyderabad, India) (WWW ’11). Association for Computing Machinery, New York, NY, USA, 665–674. https://doi.org/10.1145/1963405.1963499
[6]
Sourya Dipta Das, Ayan Basak, and Saikat Dutta. 2021. A Heuristic-driven Ensemble Framework for COVID-19 Fake News Detection. In CONSTRAINT@AAAI.
[7]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 4171–4186. https://doi.org/10.18653/v1/N19-1423
[8]
J. Gaglani, Y. Gandhi, S. Gogate, and A. Halbe. 2020. Unsupervised WhatsApp Fake News Detection using Semantic Search. In 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS). 285–289. https://doi.org/10.1109/ICICCS48265.2020.9120902
[9]
Anna Glazkova, Maksim Glazkov, and T. Trifonov. 2021. g2tmn at Constraint@AAAI2021: Exploiting CT-BERT and Ensembling Learning for COVID-19 Fake News Detection. In CONSTRAINT@AAAI.
[10]
Michael Goodyear. 2020. Fake News in the Time of COVID-19: Inherent Powers over Public Health. Available at SSRN 3740639(2020).
[11]
Mutian He, Yangqiu Song, Kun Xu, and Yu Dong. 2020. On the Role of Conceptualization in Commonsense Knowledge Graph Construction. (03 2020).
[12]
Dan Hendrycks and Kevin Gimpel. 2016. Gaussian Error Linear Units (GELUs). arXiv: Learning (2016).
[13]
Md Zobaer Hossain, Md Ashraful Rahman, Md Saiful Islam, and Sudipta Kar. 2020. BanFakeNews: A Dataset for Detecting Fake News in Bangla. In Proceedings of the 12th Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France, 2862–2871. https://www.aclweb.org/anthology/2020.lrec-1.349
[14]
Debanjana Kar, Mohit Bhardwaj, Suranjana Samanta, and A. Azad. 2021. No Rumours Please! A Multi-Indic-Lingual Approach for COVID Fake-Tweet Detection. 2021 Grace Hopper Celebration India (GHCI)(2021), 1–5.
[15]
Hamid Karimi, Proteek Roy, Sari Saba-Sadiya, and Jiliang Tang. 2018. Multi-source multi-class fake news detection. In Proceedings of the 27th International Conference on Computational Linguistics. 1546–1557.
[16]
Sejeong Kwon, Meeyoung Cha, K. Jung, Wei Chen, and Y. Wang. 2013. Prominent Features of Rumor Propagation in Online Social Media. 2013 IEEE 13th International Conference on Data Mining (2013), 1103–1108.
[17]
Xiangyang Li, Yu Xia, Xiang Long, Zheng Li, and Sujian Li. 2021. Exploring Text-transformers in AAAI 2021 Shared Task: COVID-19 Fake News Detection in English. In CONSTRAINT@AAAI.
[18]
Yichuan Li, Bo han Jiang, Kai Shu, and Huan Liu. 2020. MM-COVID: A Multilingual and Multimodal Data Repository for Combating COVID-19 Disinformation. ArXiv abs/2011.04088(2020).
[19]
Jing Ma, Wei Gao, and Kam-Fai Wong. 2017. Detect Rumors in Microblog Posts Using Propagation Structure via Kernel Learning. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Vancouver, Canada, 708–717. https://doi.org/10.18653/v1/P17-1066
[20]
M. Müller, M. Salathé, and P. Kummervold. 2020. COVID-Twitter-BERT: A Natural Language Processing Model to Analyse COVID-19 Content on Twitter. ArXiv abs/2005.07503(2020).
[21]
Dat Quoc Nguyen, Thanh Vu, and A. Nguyen. 2020. BERTweet: A pre-trained language model for English Tweets. In EMNLP.
[22]
Van-Hoang Nguyen, Kazunari Sugiyama, Preslav Nakov, and Min-Yen Kan. 2020. FANG: Leveraging Social Context for Fake News Detection Using Graph Representation(CIKM ’20). Association for Computing Machinery, New York, NY, USA, 1165–1174. https://doi.org/10.1145/3340531.3412046
[23]
Ofcom. 2020. Covid-19 news and information: consumption and attitudes.Results from week one of Ofcom’s online survey. (2020). https://www.ofcom.org.uk/__data/assets/pdf_file/0031/193747/covid-19-news-consumption-week-one-findings.pdf
[24]
World Health Organization 2020. Coronavirus disease (COVID-19) advice for the public: myth busters. (2020).
[25]
Parth Patwa, Shivam Sharma, Srinivas Pykl, Vineeth Guptha, Gitanjali Kumari, Md. Shad Akhtar, Asif Ekbal, Amitava Das, and Tanmoy Chakraborty. 2021. Fighting an Infodemic: COVID-19 Fake News Dataset. In CONSTRAINT@AAAI.
[26]
Verónica Pérez-Rosas, Bennett Kleinberg, Alexandra Lefevre, and Rada Mihalcea. 2018. Automatic Detection of Fake News. In Proceedings of the 27th International Conference on Computational Linguistics. Association for Computational Linguistics, Santa Fe, New Mexico, USA, 3391–3401. https://www.aclweb.org/anthology/C18-1287
[27]
Natali Ruchansky, Sungyong Seo, and Yan Liu. 2017. CSI: A Hybrid Deep Model for Fake News Detection. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (Singapore, Singapore) (CIKM ’17). Association for Computing Machinery, New York, NY, USA, 797–806. https://doi.org/10.1145/3132847.3132877
[28]
Tanik Saikh, Arkadipta De, Asif Ekbal, and Pushpak Bhattacharyya. 2019. A Deep Learning Approach for Automatic Detection of Fake News. In Proceedings of the 16th International Conference on Natural Language Processing. 230–238.
[29]
I. Baris Schlicht and Zeyd Boukhers. 2021. ECOL: Early Detection of COVID Lies Using Content, Prior Knowledge and Source Information. In CONSTRAINT@AAAI.
[30]
Gautam Kishore Shahi and Durgesh Nandini. 2020. FakeCovid-A Multilingual Cross-domain Fact Check News Dataset for COVID-19. In Workshop Proceedings of the 14th International AAAI Conference on Web and Social Media (ICWSM) 2020. https://doi.org/10.36190/2020.14
[31]
Kai Shu, Amy Sliva, Suhang Wang, Jiliang Tang, and Huan Liu. 2017. Fake news detection on social media: A data mining perspective. ACM SIGKDD explorations newsletter 19, 1 (2017), 22–36.
[32]
Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: A Simple Way to Prevent Neural Networks from Overfitting. J. Mach. Learn. Res. 15, 1 (Jan. 2014), 1929–1958.
[33]
Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing Data using t-SNE. Journal of Machine Learning Research 9, 86 (2008), 2579–2605. http://jmlr.org/papers/v9/vandermaaten08a.html
[34]
Rutvik Vijjali, Prathyush Potluri, S. Kumar, and S. Teki. 2020. Two Stage Transformer Model for COVID-19 Fake News Detection and Fact Checking. ArXiv abs/2011.13253(2020).
[35]
Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, and Samuel R. Bowman. 2019. GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net. https://openreview.net/forum?id=rJ4km2R5t7
[36]
Xinyi Zhou, Reza Zafarani, Kai Shu, and Huan Liu. 2019. Fake News: Fundamental Theories, Detection Strategies and Challenges. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining(Melbourne VIC, Australia) (WSDM ’19). Association for Computing Machinery, New York, NY, USA, 836–837. https://doi.org/10.1145/3289600.3291382
[37]
Yukun Zhu, Ryan Kiros, Rich Zemel, Ruslan Salakhutdinov, Raquel Urtasun, Antonio Torralba, and Sanja Fidler. 2015. Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV)(ICCV ’15). IEEE Computer Society, USA, 19–27.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '22: Companion Proceedings of the Web Conference 2022
April 2022
1338 pages
ISBN:9781450391306
DOI:10.1145/3487553
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 August 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. BERT
  2. COVID-19
  3. Fake-News Detection
  4. Transformers

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

WWW '22
Sponsor:
WWW '22: The ACM Web Conference 2022
April 25 - 29, 2022
Virtual Event, Lyon, France

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 196
    Total Downloads
  • Downloads (Last 12 months)45
  • Downloads (Last 6 weeks)2
Reflects downloads up to 11 Sep 2024

Other Metrics

Citations

Cited By

View all

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media