Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Leveraging relevant summarized information and multi-layer classification to generalize the detection of misleading headlines

Published: 01 May 2023 Publication History

Abstract

Disinformation is an important problem facing society nowadays. Given the rapid and easy access to information, news stories quickly go viral, the vast majority of which are misleading and with no prospect of verification. Specifically, the headline of a correctly designed news item must correspond to a summary of the main information of that news item and it should be neutral. However, many headlines circulating on the Internet use false or distorted information, seeking to confuse or mislead the reader. Misleading headlines indicate a dissonance between the headline and the content of the news story. From a computational perspective, this problem is being tackled as a Stance Detection problem between the headline and the body text of the news item. This paper contributes to the fight against the spread of misleading information by presenting a generic and flexible multi-level hierarchical classification. The approach is based on two stages that enable the detection of the stance between the news headline and the body text. The proposed architecture, called HeadlineStanceChecker+ uses the headline and only the essential information of the news item (not the full body text) as inputs. To extract this essential information, different summarization approaches (extractive and abstractive) are analyzed in order to determine the most relevant information for the task. The experimentation has been carried out using the Fake News Challenge (FNC-1) dataset. A 94.49% accuracy was obtained using extractive summaries, which were more helpful than abstractive ones. HeadlineStanceChecker+ improves the accuracy results of existing state-of-the-art systems. In conclusion, using automatic extractive summaries together with the two-stage generic architecture is an effective solution to the problem.

Highlights

HeadlineStanceChecker+ is an enhanced system for misleading headlines detection.
Misleading headlines detection is tackled as a stance detection problem.
Summarization supports stance detection by only selecting relevant information.
A hierarchical classifier effectively determines the stance of a headline.
Results show that extractive approaches perform better for stance detection task.

References

[1]
Rubin V.L., Disinformation and misinformation triangle: A conceptual model for “fake news” epidemic, causal factors and interventions, J. Doc. 75 (2019) 1013–1034.
[2]
Fallis D., The varieties of disinformation, Philos. Inf. Qual. (2014) 135–161.
[3]
Shao C., Ciampaglia G.L., Varol O., Flammini A., Menczer F., The spread of fake news by social bots, 2017, CoRR abs/1707.07592.
[4]
Franklin B., Hamer M., Hanna M., Kinsey M., Richardson J.E., Inverted pyramid, SAGE Key Concepts: Key Concepts J. Stud. (2005) 122.
[5]
van Dijk T., News as Discourse, in: Communication Series, L. Erlbaum Associates, 1988.
[6]
Reis J., Benevenuto F., de Melo P.O.S.V., Prates R., Kwak H., An J., Breaking the news: First impressions matter on online news, Proceedings of the 9th International Conference on Web and Social Media, ICWSM 2015, AAAI Press, 2015, pp. 357–366.
[7]
Conroy N.J., Rubin V.L., Chen Y., Automatic deception detection: Methods for finding fake news, in: Proceedings of the 78th ASIS&T Annual Meeting: Information Science with Impact: Research in and for the Community, American Society for Information Science, 2015, pp. 82:1–82:4.
[8]
Kuiken J., Schuth A., Spitters M., Marx M., Effective headlines of newspaper articles in a digital environment, Digit. J. 5 (10) (2017) 1300–1314.
[9]
Gabielkov M., Ramachandran A., Chaintreau A., Legout A., Social clicks: What and who gets read on Twitter?, ACM SIGMETRICS Perform. Eval. Rev. 44 (2016) 179–192.
[10]
Ecker U.K., Lewandowsky S., Chang E.P., Pillai R., The effects of subtle misinformation in news headlines, J. Exp. Psychol.: Appl. 20 (4) (2014) 323.
[11]
Chen Y., Conroy N.K., Rubin V.L., News in an online world: The need for an “automatic crap detector”, Proc. Assoc. Inf. Sci. Technol. 52 (1) (2015) 1–4.
[12]
Lutz B., Adam M.T.P., Feuerriegel S., Pröllochs N., Neumann D., Affective information processing of fake news: Evidence from neuroIS, in: Information Systems and Neuroscience, Springer International Publishing, 2020, pp. 121–128.
[13]
Wei W., Wan X., Learning to identify ambiguous and misleading news headlines, in: Proceedings of the 26th International Joint Conference on Artificial Intelligence, AAAI Press, 2017, pp. 4172–4178.
[14]
Chen Y., Conroy N.J., Rubin V.L., Misleading online content: Recognizing clickbait as “false news”, in: Proceedings of the ACM on Workshop on Multimodal Deception Detection, Association for Computational Linguistics, 2015, pp. 15–19.
[15]
S. Chesney, M. Liakata, M. Poesio, M. Purver, Incongruent headlines: Yet another way to mislead your readers, in: Proceedings of the 2017 Emnlp Workshop: Natural Language Processing Meets Journalism, 2017, pp. 56–61.
[16]
Ferreira W., Vlachos A., Emergent: a novel data-set for stance classification, in: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, Association for Computational Linguistics, 2016, pp. 1163–1168.
[17]
Tay Y., Dehghani M., Bahri D., Metzler D., Efficient transformers: A survey, ACM Comput. Surv. 55 (6) (2023) 109:1–109:28.
[18]
Cai J., Li J., Li W., Wang J., Deeplearning model used in text classification, in: 2018 15th International Computer Conference on Wavelet Active Media Technology and Information Processing, ICCWAMTIP, IEEE, 2018, pp. 123–126.
[19]
Saquete E., Tomás D., Moreda P., Martínez-Barco P., Palomar M., Fighting post-truth using natural language processing: A review and open challenges, Expert Syst. Appl. 141 (2020).
[20]
Zarrella G., Marsh A., MITRE at SemEval-2016 task 6: Transfer learning for stance detection, in: Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), Association for Computational Linguistics, 2016, pp. 458–463.
[21]
Ghosh S., Singhania P., Singh S., Rudra K., Ghosh S., Stance detection in web and social media: a comparative study, in: International Conference of the Cross-Language Evaluation Forum for European Languages, Springer, 2019, pp. 75–87.
[22]
Babakar M., Bakos N., Daumé H., Mantzarlis A., Seddah D., Vlachos A., Wardle C., Fake news challenge - I, 2016, http://www.fakenewschallenge.org/.
[23]
Silverman C., Lies, damn lies and viral content, 2015.
[24]
Baird S., Sibley D., Pan Y., Talos targets disinformation with fake news challenge victory, 2017, https://blog.talosintelligence.com/2017/06/talos-fake-news-challenge.html.
[25]
Hanselowski A., PVS A., Schiller B., Caspelherr F., Description of the system developed by team athene in the FNC-1, 2017.
[26]
Riedel B., Augenstein I., Spithourakis G.P., Riedel S., A simple but tough-to-beat baseline for the Fake News Challenge stance detection task, Comput. Res. Repos. (2017) CoRR abs/1707.03264.
[27]
Zhang Q., Liang S., Lipani A., Ren Z., Yilmaz E., From stances’ imbalance to their hierarchical representation and detection, in: The World Wide Web Conference, ACM, 2019, pp. 2323–2332.
[28]
Dulhanty C., Deglint J.L., Daya I.B., Wong A., Taking a stance on fake news: Towards automatic disinformation assessment via deep bidirectional transformer language models for stance detection, 2019, CoRR abs/1911.11951.
[29]
B. Pouliquen, R. Steinberger, C. Best, Automatic detection of quotations in multilingual news, in: Proceedings of Recent Advances in Natural Language Processing 2007, 2007, pp. 487–492.
[30]
Vlachos A., Riedel S., Identification and verification of simple claims about statistical properties, in: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2015, pp. 2596–2601.
[31]
Li Y., Sosea T., Sawant A., Nair A.J., Inkpen D., Caragea C., P-stance: A large dataset for stance detection in political domain, in: Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, Association for Computational Linguistics, 2021, pp. 2355–2365.
[32]
Conforti C., Berndt J., Pilehvar M.T., Giannitsarou C., Toxvaerd F., Collier N., STANDER: an expert-annotated dataset for news stance detection and evidence retrieval, in: Cohn T., He Y., Liu Y. (Eds.), Findings of the Association for Computational Linguistics: EMNLP 2020, in: Findings of ACL, vol. EMNLP 2020, Association for Computational Linguistics, 2020, pp. 4086–4101.
[33]
Dernoncourt F., Ghassemi M., Chang W., A repository of corpora for summarization, in: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), European Language Resources Association (ELRA), 2018.
[34]
Nenkova A., Automatic text summarization of newswire: Lessons learned from the document understanding conference, in: Proceedings of the 20th National Conference on Artificial Intelligence - Vol. 3, AAAI Press, 2005, pp. 1436–1441.
[35]
Mackie S., McCreadie R., Macdonald C., Ounis I., Experiments in newswire summarisation, in: Ferro N., Crestani F., Moens M.-F., Mothe J., Silvestri F., Di Nunzio G.M., Hauff C., Silvello G. (Eds.), Advances in Information Retrieval, Springer International Publishing, 2016, pp. 421–435.
[36]
Duan Y., Jatowt A., Across-time comparative summarization of news articles, in: Proceedings of the 12th ACM International Conference on Web Search and Data Mining, Association for Computing Machinery, 2019, pp. 735–743.
[37]
Zhu C., Yang Z., Gmyr R., Zeng M., Huang X., Make lead bias in your favor: A simple and effective method for news summarization, 2019, CoRR abs/1912.11602.
[38]
Huang Y., Yu Z., Guo J., Xiang Y., Yu Z., Xian Y., Abstractive document summarization via multi-template decoding, Appl. Intell. 52 (9) (2022) 9650–9663.
[39]
Shi K., Cai X., Yang L., Zhao J., Pan S., StarSum: A star architecture based model for extractive summarization, IEEE/ACM Trans. Audio Speech Lang. Process. 30 (2022) 3020–3031.
[40]
Banko M., Mittal V.O., Witbrock M.J., Headline generation based on statistical translation, in: Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, Association for Computational Linguistics, 2000, pp. 318–325.
[41]
B. Dorr, D. Zajic, R. Schwartz, Hedge Trimmer: A Parse-and-Trim Approach to Headline Generation, in: Proceedings of the North American Chapter of the Association for Computational Linguistics, Text Summarization Workshop 2003, 2003, pp. 1–8.
[42]
D. Zajic, B. Dorr, R. Schwartz, Automatic Headline Generation for Newspaper Stories, in: Proceedings of the Workshop on Automatic Summarization 2002, 2002, pp. 78–85.
[43]
Tan J., Wan X., Xiao J., From neural sentence summarization to headline generation: A coarse-to-fine approach, in: Proceedings of the 26th International Joint Conference on Artificial Intelligence, IJCAI ’17, AAAI Press, 2017, pp. 4109–4115.
[44]
Gavrilov D., Kalaidin P., Malykh V., Self-attentive model for headline generation, in: Advances in Information Retrieval, Springer International Publishing, 2019, pp. 87–93.
[45]
Singh R.K., Khetarpaul S., Gorantla R., Allada S.G., SHEG: summarization and headline generation of news articles using deep learning, Neural Comput. Appl. 33 (8) (2021) 3251–3265.
[46]
Gambhir M., Gupta V., Recent automatic text summarization techniques: A survey, Artif. Intell. Rev. 47 (1) (2017) 1–66.
[47]
Yadav D., Desai J., Yadav A.K., Automatic text summarization methods: A comprehensive review, SN Comput. Sci. 4 (33) (2023).
[48]
N. Moratanch, S. Chitrakala, A survey on extractive text summarization, in: 2017 International Conference on Computer, Communication and Signal Processing, ICCCSP, 2017, pp. 1–6.
[49]
Mihalcea R., Tarau P., TextRank: Bringing order into text, in: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2004, pp. 404–411.
[50]
Miller D., Leveraging BERT for extractive text summarization on lectures, 2019, CoRR abs/1906.04165.
[51]
Jani D., Patel N., Yadav H., Suthar S., Patel S., A concise review on automatic text summarization, in: Nayak J., Behera H., Naik B., Vimal S., Pelusi D. (Eds.), Computational Intelligence in Data Mining, Springer Nature Singapore, 2022, pp. 523–536.
[52]
Lewis M., Liu Y., Goyal N., Ghazvininejad M., Mohamed A., Levy O., Stoyanov V., Zettlemoyer L., BART: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension, in: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, 2020, pp. 7871–7880.
[53]
Lin C.-Y., ROUGE: A package for automatic evaluation of summaries, in: Text Summarization Branches Out, Association for Computational Linguistics, 2004, pp. 74–81.
[54]
Esmaeilzadeh S., Peh G.X., Xu A., Neural abstractive text summarization and fake news detection, Comput. Res. Repos. (2019) CoRR abs/1904.00788.
[55]
Hartl P., Kruschwitz U., Applying automatic text summarization for fake news detection, in: Proceedings of the Thirteenth Language Resources and Evaluation Conference, European Language Resources Association, 2022, pp. 2702–2713.
[56]
Sepúlveda-Torres R., Vicente M., Saquete E., Lloret E., Palomar M., Exploring summarization to enhance headline stance detection, in: International Conference on Applications of Natural Language to Information Systems, Springer, 2021, pp. 243–254.
[57]
Vicente M., Sepúlveda-Torres R., Barros C., Saquete E., Lloret E., Can text summarization enhance the headline stance detection task? benefits and drawbacks, in: International Conference on Document Analysis and Recognition, Springer, 2021, pp. 53–67.
[58]
Sepúlveda-Torres R., Vicente M.E., Saquete E., Lloret E., Palomar M., HeadlineStanceChecker: Exploiting summarization to detect headline disinformation, J. Web Semant. 71 (2021).
[59]
Lorena A.C., De Carvalho A.C., Gama J.M., A review on the combination of binary classifiers in multiclass problems, Artif. Intell. Rev. 30 (1–4) (2008) 19–37.
[60]
Kumar S., Ghosh J., Crawford M.M., Hierarchical fusion of multiple classifiers for hyperspectral data analysis, Pattern Anal. Appl. 2 (5) (2002) 210–220.
[61]
Freitas A., Carvalho A., A tutorial on hierarchical classification with applications in bioinformatics, Res. Trends Data Min. Technol. Appl. (2007) 175–208.
[62]
Silla C.N., Freitas A.A., A survey of hierarchical classification across different application domains, Data Mining and Knowledge Discovery 22 (1) (2010) 31–72. 2010 22:1.
[63]
Naik A., Rangwala H., Large Scale Hierarchical Classification: State of the Art, in: SpringerBriefs in Computer Science, Springer International Publishing, 2018.
[64]
Ma Y., Liu X., Zhao L., Liang Y., Zhang P., Jin B., Hybrid embedding-based text representation for hierarchical multi-label text classification, Expert Syst. Appl. 187 (2022).
[65]
Silva-Palacios D., Ferri C., Ramírez-Quintana M.J., Probabilistic class hierarchies for multiclass classification, J. Comput. Sci. 26 (2018) 254–263.
[66]
Devlin J., Chang M.-W., Lee K., Toutanova K., BERT: Pre-training of deep bidirectional transformers for language understanding, in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Association for Computational Linguistics, 2019, pp. 4171–4186.
[67]
Passalis N., Tefas A., Learning bag-of-embedded-words representations for textual information retrieval, Pattern Recognit. 81 (2018) 254–267.
[68]
Metcalf L., Casey W., Metrics, similarity, and sets, in: Cybersecurity and Applied Mathematics, Elsevier, 2016, pp. 3–22.
[69]
Vicente M.E., Pastor E.L., Relevant content selection through positional language models: An exploratory analysis, Process. Del Leng. Natural 65 (2020) 75–82.
[70]
Gibbs A.L., Su F.E., On choosing and bounding probability metrics, Internat. Statist. Rev. 70 (3) (2002) 419–435.
[71]
González-Castro V., Alaiz-Rodríguez R., Alegre E., Class distribution estimation based on the hellinger distance, Inform. Sci. 218 (2013) 146–164.
[72]
Lebret R., Collobert R., Word embeddings through hellinger PCA, in: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, Association for Computational Linguistics, 2014, pp. 482–490.
[73]
Hu Z., Hong L.J., Kullback-Leibler divergence constrained distributionally robust optimization, Optim. Online (2013).
[74]
Raiber F., Kurland O., Kullback-Leibler divergence revisited, in: Proceedings of the ACM SIGIR International Conference on Theory of Information Retrieval, Association for Computing Machinery, 2017, pp. 117–-124.
[75]
T. Mikolov, K. Chen, G. Corrado, J. Dean, Efficient Estimation of Word Representations in Vector Space, in: Y. Bengio, Y. LeCun (Eds.), 1st International Conference on Learning Representations, ICLR 2013, Scottsdale, Arizona, USA, May 2-4, 2013, Workshop Track Proceedings, 2013.
[76]
Reimers N., Gurevych I., Sentence-BERT: Sentence embeddings using siamese BERT-networks, in: Inui K., Jiang J., Ng V., Wan X. (Eds.), Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019, Hong Kong, China, November 3-7, 2019, Association for Computational Linguistics, 2019, pp. 3980–3990.
[77]
Wagner W., Steven Bird, Ewan Klein and Edward Loper: Natural language processing with python, analyzing text with the natural language toolkit - O’Reilly media, Beijing, 2009, ISBN 978-0-596-51649-9, Lang. Resour. Eval. 44 (4) (2010) 421–424.
[78]
Liu Y., Ott M., Goyal N., Du J., Joshi M., Chen D., Levy O., Lewis M., Zettlemoyer L., Stoyanov V., RoBERTa: A robustly optimized BERT pretraining approach, 2019, CoRR abs/1907.11692.
[79]
Wang A., Singh A., Michael J., Hill F., Levy O., Bowman S., GLUE: A multi-task benchmark and analysis platform for natural language understanding, in: Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, Association for Computational Linguistics, 2018, pp. 353–355.
[80]
Lai G., Xie Q., Liu H., Yang Y., Hovy E., RACE: Large-scale reading comprehension dataset from examinations, in: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2017, pp. 785–794.
[81]
Slovikovskaya V., Attardi G., Transfer learning from transformers to fake news challenge stance detection (FNC-1) task, in: Proceedings of the 12th Language Resources and Evaluation Conference, LREC 2020, Marseille, France, May 11-16, 2020, European Language Resources Association, 2020, pp. 1211–1218.
[82]
Cai J., Luo J., Wang S., Yang S., Feature selection in machine learning: A new perspective, Neurocomputing 300 (2018) 70–79.
[83]
Nasiri H., Alavi S.A., A novel framework based on deep learning and ANOVA feature selection method for diagnosis of COVID-19 cases from chest X-ray images, Comput. Intell. Neurosci. 2022 (2022).
[84]
Lin H., Ding H., Predicting ion channels and their types by the dipeptide mode of pseudo amino acid composition, J. Theoret. Biol. 269 (1) (2011) 64–69.
[85]
Brownlee J., How to choose a feature selection method for machine learning, Mach. Learn. Mast. 10 (2019).
[86]
Hanselowski A., PVS A., Schiller B., Caspelherr F., Chaudhuri D., Meyer C.M., Gurevych I., A retrospective analysis of the fake news challenge stance-detection task, in: Proceedings of the 27th International Conference on Computational Linguistics, Association for Computational Linguistics, 2018, pp. 1859–1874.
[87]
Biewald L., Experiment tracking with weights and biases, 2020, Software available from wandb.com.
[88]
O. Levy, T. Zesch, I. Dagan, I. Gurevych, Recognizing Partial Textual Entailment, in: Annual Meeting of the Association for Computational Linguistics, 2013.
[89]
Harabagiu S., Hickl A., Lacatusu F., Negation, contrast and contradiction in text processing, in: Proceedings of the 21st National Conference on Artificial Intelligence - Vol. 1, AAAI ’06, AAAI Press, 2006, pp. 755–762.
[90]
Beltagy I., Peters M.E., Cohan A., Longformer: The long-document transformer, 2020, CoRR abs/2004.05150.
[91]
Lacoste A., Luccioni A., Schmidt V., Dandres T., Quantifying the carbon emissions of machine learning, 2019, CoRR abs/1910.09700.

Index Terms

  1. Leveraging relevant summarized information and multi-layer classification to generalize the detection of misleading headlines
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Data & Knowledge Engineering
    Data & Knowledge Engineering  Volume 145, Issue C
    May 2023
    534 pages

    Publisher

    Elsevier Science Publishers B. V.

    Netherlands

    Publication History

    Published: 01 May 2023

    Author Tags

    1. Natural language processing
    2. Fake news
    3. Misleading headlines
    4. Stance detection

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Nov 2024

    Other Metrics

    Citations

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media