Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Multilingual News Feed Analysis using Intelligent Linguistic Particle Filtering Techniques

Published: 10 March 2023 Publication History

Abstract

Analyzing real-time news feeds and their impacts in the real world is a complex task in the social networking arena. Particularly, countries with a multilingual environment have various patterns and perceptions of news reports considering the diversity of the people. Multilingual and multimodal news analysis is an emerging trend for evaluating news source neutralities. Therefore, in this work, four new deep news particle filtering techniques were developed, including generic news analysis, sequential importance re-sampling (SIR)-based news particle filtering analysis, reinforcement learning (RL)-based multimodal news analysis, and deep Convolution neural network (DCNN)-based multi-news filtering approach, for news classification. Results indicate that these techniques, which primarily employ particle filtering with multilevel sampling strategies, produce 15% to 20% better performance than conventional news analysis techniques.

References

[1]
Jie Ding, Jiazhong Chen, Jinxing Lin, and Guoping Jiang. 2019. Particle filtering-based recursive identification for controlled auto-regressive systems with quantized output. IET Control Theory & Applications 13, 14 (2019), 2181–2187.
[2]
Edward Herbst and Frank Schorfheide. 2019. Tempered particle filtering. Journal of Econometrics 210, 1 (2019), 26–44.
[3]
Mani Razi, Robert M. Kirby, and Akil Narayan. 2019. Fast predictive multi-fidelity prediction with models of quantized fidelity levels. Journal of Computational Physics 376 (2019), 992–1008.
[4]
Alexandre Bovet and Hernán A. Makse. 2019. Influence of fake news in Twitter during the 2016 US presidential election. Nature Communications Journal 10, 1 (2019), 1–14.
[5]
Sawinder Kaur, Parteek Kumar, and Ponnurangam Kumaraguru. 2019. Automating fake news detection system using multi-level voting model. Springer, Soft Computing (2019), 1–21.
[6]
Kun Ma, Ziqiang Yu, Ke Ji, and Bo Yang. 2019. Stream-based live public opinion monitoring approach with adaptive probabilistic topic model. Springer, Soft Computing 23, 16 (2019), 7451–7470.
[7]
Juan Antonio Morente-Molinera, Gang Kou, C. Pang, Francisco Javier Cabrerizo, and Enrique Herrera-Viedma. 2019. An automatic procedure to create fuzzy ontologies from users’ opinions using sentiment analysis procedures and multi-granular fuzzy linguistic modelling methods. Information Sciences 476 (2019), 222–238.
[8]
Roberto A. De Santis. 2019. Impact of the asset purchase programme on euro area government bond yields using market news. Economic Modelling (2019).
[9]
Ray Moynihan, Bero Lisa, Ross-Degnan Dennis, David Henry, Kirby Lee, Judy Watkins, Connie Mah, and Stephen B. Soumerai. 2000. Coverage by the news media of the benefits and risks of medications. New England Journal of Medicine 342, 22 (2000), 1645–1650.
[10]
Joseph Engelberg, R. David McLean, and Jeffrey Pontiff. 2018. Anomalies and news. The Journal of Finance 73, 5 (2018), 1971–2001.
[11]
Amr Ahmed, Qirong Ho, Jacob Eisenstein, Eric Xing, Alexander J. Smola, and Choon Hui Teo. 2011. Unified analysis of streaming news. In Proceedings of the 20th International Conference on World wide Web. ACM, (2011). 267–276.
[12]
Dong-Ho Lee, Yu-Ri Kim, Hyeong-Jun Kim, Seung-Myun Park, and Yu-Jun Yang. 2019. Fake news detection using deep learning. Journal of Information Processing Systems 15, 5 (2019).
[13]
Johannes Kiesel, Maria Mestre, Rishabh Shukla, Emmanuel Vincent, Payam Adineh, David Corney, Benno Stein, and Martin Potthast. 2019. Semeval-2019 task 4: Hyperpartisan news detection. In Proceedings of the 13th International Workshop on Semantic Evaluation. 829–839.
[14]
Wataru Souma, Irena Vodenska, and Hideaki Aoyama. 2019. Enhanced news sentiment analysis using deep learning methods. Journal of Computational Social Science 2, 1 (2019), 33–46.
[15]
Kevin Lerman and Ariel Gilder. 2009. System and method for forecasting fluctuations in future data and particularly for forecasting security prices by news analysis. U.S. Patent Application 12/150,960, filed January 22, 2009.
[16]
Xinyi Zhou, Reza Zafarani, Kai Shu, and Huan Liu. 2019. Fake news: Fundamental theories, detection strategies and challenges. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining. ACM, (2019), 836–837.
[17]
Brett K. Beaulieu-Jones, Zhiwei Steven Wu, Chris Williams, Ran Lee, Sanjeev P. Bhavnani, James Brian Byrd, and Casey S. Greene. 2019. Privacy-preserving generative deep neural networks support clinical data sharing. Circulation: Cardiovascular Quality and Outcomes 12, 7 (2019), e005122.
[18]
Julio C. S. Reis, André Correia, Fabrício Murai, Adriano Veloso, Fabrício Benevenuto, and Erik Cambria. 2019. Supervised learning for fake news detection. IEEE Intelligent Systems 34, 2 (2019), 76–81.
[19]
Shadikur Rahman, Syeda Sumbul Hossain, and Saiful Islam, Mazharul Islam Chowdhury, Fatama Binta Rafiq, and Khalid Been Md Badruzzaman. 2019. Context-based news headlines analysis using machine learning approach. In International Conference on Computational Collective Intelligence. Springer, Cham, 167–178.
[20]
Vaibhav Khatavkar, Makarand Velankar, and Parag Kulkarni. 2019. Multi-perspective analysis of news articles using machine learning algorithms. International Journal of Computer Applications 975, 8887 (2019).
[21]
N. Jamal, C. Xianqiao, F. Al-Turjman, and F. Ullah. 2021. A deep learning–based approach for emotions classification in big corpus of imbalanced tweets. Transactions on Asian and Low-Resource Language Information Processing 20, 3 (2021), 1–16.
[22]
S. Rakeshkumar, S. Muthramalingam, and F. Al-Turjman. 2021. Multimodal news feed evaluation system with deep reinforcement learning approaches. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 20, 1 (2021), 1–12.
[23]
F. Ullah, S. Jabbar, and L. Mostarda. 2021. An intelligent decision support system for software plagiarism detection in academia. International Journal of Intelligent Systems.
[24]
K. Razzaq Malik, M. Habib, S. Khalid, F. Ullah, M. Umar, T. Sajjad, and A. Ahmad. 2017. Data compatibility to enhance sustainable capabilities for autonomous analytics in IoT. Sustainability 9, 6 (2017), 877.
[25]
U. Khadam, M. M. Iqbal, L. Mostarda, and F. Ullah. 2020. An efficient framework for text document security and privacy. In International Symposium on Security and Privacy in Social Networks and Big Data. Springer, Singapore, 132–140.
[26]
S. Amin, M. I. Uddin, S. Hassan, A. Khan, N. Nasser, A. Alharbi, and H. Alyami. 2020. Recurrent neural networks with TF-IDF embedding technique for detection and classification in tweets of Dengue disease. IEEE Access 8, 131522–131533.
[27]
V. Poleneni, J. K. Rao, and S. A. Hidayathulla. 2021. COVID-19 prediction using ARIMA model. In 2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence). IEEE, 860–865.
[28]
L. Garg, E. Chukwu, N. Nasser, C. Chakraborty, and G. Garg. 2020. Anonymity preserving IoT-based COVID-19 and other infectious disease contact tracing model. IEEE Access 8 (2020), 159402–159414.
[29]
A. Onan. 2022. Bidirectional convolutional recurrent neural network architecture with group-wise enhancement mechanism for text sentiment classification. Journal of King Saud University-Computer and Information Sciences 34, 5 (2022), 2098–2117.
[30]
A. Onan. 2019. Consensus clustering-based undersampling approach to imbalanced learning. Scientific Programming. (2019).
[31]
A. Onan, S. Korukoğlu, and H. Bulut. 2016. Ensemble of keyword extraction methods and classifiers in text classification. Expert Systems with Applications 57 (2016), 232–247.
[32]
A. Onan. 2019. Two-stage topic extraction model for bibliometric data analysis based on word embeddings and clustering. IEEE Access 7 (2019), 145614–145633.
[33]
A. Onan and S. Korukoğlu. 2017. A feature selection model based on genetic rank aggregation for text sentiment classification. Journal of Information Science 43, 1 (2017), 25–38.
[34]
A. Onan, S. Korukoğlu, and H. Bulut. 2017. A hybrid ensemble pruning approach based on consensus clustering and multi-objective evolutionary algorithm for sentiment classification. Information Processing & Management 53, 4 (2017), 814–833.
[35]
A. Onan. 2021. Sentiment analysis on product reviews based on weighted word embeddings and deep neural networks. Concurrency and Computation: Practice and Experience 33, 23 (2021), e5909.
[36]
A. Onan. 2020. Mining opinions from instructor evaluation reviews: A deep learning approach. Computer Applications in Engineering Education 28, 1 (2020), 117–138.
[37]
A. Onan. 2021. Sentiment analysis on massive open online course evaluations: A text mining and deep learning approach. Computer Applications in Engineering Education 29, 3 (2021), 572–589.
[38]
An ensemble scheme based on language function analysis and feature engineering for text genre classification. Journal of Information Science 44 1, 28–47.
[39]
A. Onan and M. A. Toçoğlu. 2021. A term weighted neural language model and stacked bidirectional LSTM based framework for sarcasm identification. IEEE Access 9 (2021), 7701–7722.
[40]
A. Onan. 2019. Topic-enriched word embeddings for sarcasm identification. In Computer Science On-line Conference. Springer, Cham, 293–304.
[41]
A. Onan. 2018. Biomedical text categorization based on ensemble pruning and optimized topic modelling. Computational and Mathematical Methods in Medicine (2018).
[42]
D. B. Claro, M. Souza, C. Castellã Xavier, and L. Oliveira.2019. Multilingual open information extraction: Challenges and opportunities. Information 10, 7 (2019), 228.
[43]
X. Wang, Q. Liu, T. Gui, Q. Zhang, Y. Zou, X. Zhou, ... and X. J. Huang. 2021. TextFlint: Unified multilingual robustness evaluation toolkit for natural language processing. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations. 347–355.
[44]
A. Lytos, T. Lagkas, P. Sarigiannidis, V. Argyriou, and G. Eleftherakis. 2022. Modelling argumentation in short text: A case of social media debate. Simulation Modelling Practice and Theory 115 (2022), 102446.
[45]
M. Souza and R. Wassermann. 2021. Belief Contraction in Non-classical logics as hyperintensional belief change. In Proceedings of the International Conference on Principles of Knowledge Representation and Reasoning 18, 1 (2021), 588–598.
[46]
C. C. Xavier and M. Souza. 2020. A basic approach for extracting and analyzing data from Twitter. In Special Topics in Multimedia, IoT and Web Technologies. Springer, Cham, 185–211.
[47]
N. C. Dang, M. N. Moreno-García, and F. De la Prieta. 2020. Sentiment analysis based on deep learning: A comparative study. Electronics 9, 3 (2020), 483.
[48]
Z. Kastrati, F. Dalipi, A. S. Imran, K. Pireva Nuci, and M. A. Wani. 2021. Sentiment analysis of students’ feedback with NLP and deep learning: A systematic mapping study. Applied Sciences 11, 9 (2021), 3986.
[49]
X. Wang, S. Dou, L. Xiong, Y. Zou, Q. Zhang, T. Gui, ... and X. Huang. 2022. MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective. arXiv preprint arXiv:2204.04391.
[50]
P. Goel, V. Goel, and A. K. Gupta. 2020. Multilingual data analysis to classify sentiment analysis for tweets using NLP and classification algorithm. In Advances in Data and Information Sciences. Springer, Singapore, 271–280.
[51]
A. Lytos, T. Lagkas, P. Sarigiannidis, and K. Bontcheva. 2019. The evolution of argumentation mining: From models to social media and emerging tools. Information Processing & Management 56, 6 (2019), 102055.

Index Terms

  1. Multilingual News Feed Analysis using Intelligent Linguistic Particle Filtering Techniques

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Asian and Low-Resource Language Information Processing
    ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 22, Issue 3
    March 2023
    570 pages
    ISSN:2375-4699
    EISSN:2375-4702
    DOI:10.1145/3579816
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 March 2023
    Online AM: 18 November 2022
    Accepted: 11 October 2022
    Revised: 14 August 2022
    Received: 25 February 2022
    Published in TALLIP Volume 22, Issue 3

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Machine learning and deep learning
    2. artificial intelligence
    3. news feeds
    4. particle filtering
    5. sampling

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 164
      Total Downloads
    • Downloads (Last 12 months)73
    • Downloads (Last 6 weeks)13
    Reflects downloads up to 18 Aug 2024

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media