research-article

Multilingual News Feed Analysis using Intelligent Linguistic Particle Filtering Techniques

Authors:

Rakesh Kumar S,

Gayathri Nagasubramanian,

Muthuramalingam S,

Fadi Al-TurjmanAuthors Info & Claims

ACM Transactions on Asian and Low-Resource Language Information Processing, Volume 22, Issue 3

Article No.: 87, Pages 1 - 19

https://doi.org/10.1145/3569899

Published: 10 March 2023 Publication History

Abstract

Analyzing real-time news feeds and their impacts in the real world is a complex task in the social networking arena. Particularly, countries with a multilingual environment have various patterns and perceptions of news reports considering the diversity of the people. Multilingual and multimodal news analysis is an emerging trend for evaluating news source neutralities. Therefore, in this work, four new deep news particle filtering techniques were developed, including generic news analysis, sequential importance re-sampling (SIR)-based news particle filtering analysis, reinforcement learning (RL)-based multimodal news analysis, and deep Convolution neural network (DCNN)-based multi-news filtering approach, for news classification. Results indicate that these techniques, which primarily employ particle filtering with multilevel sampling strategies, produce 15% to 20% better performance than conventional news analysis techniques.

References

[1]

Jie Ding, Jiazhong Chen, Jinxing Lin, and Guoping Jiang. 2019. Particle filtering-based recursive identification for controlled auto-regressive systems with quantized output. IET Control Theory & Applications 13, 14 (2019), 2181–2187.

[2]

Edward Herbst and Frank Schorfheide. 2019. Tempered particle filtering. Journal of Econometrics 210, 1 (2019), 26–44.

[3]

Mani Razi, Robert M. Kirby, and Akil Narayan. 2019. Fast predictive multi-fidelity prediction with models of quantized fidelity levels. Journal of Computational Physics 376 (2019), 992–1008.

[4]

Alexandre Bovet and Hernán A. Makse. 2019. Influence of fake news in Twitter during the 2016 US presidential election. Nature Communications Journal 10, 1 (2019), 1–14.

[5]

Sawinder Kaur, Parteek Kumar, and Ponnurangam Kumaraguru. 2019. Automating fake news detection system using multi-level voting model. Springer, Soft Computing (2019), 1–21.

[6]

Kun Ma, Ziqiang Yu, Ke Ji, and Bo Yang. 2019. Stream-based live public opinion monitoring approach with adaptive probabilistic topic model. Springer, Soft Computing 23, 16 (2019), 7451–7470.

Digital Library

[7]

Juan Antonio Morente-Molinera, Gang Kou, C. Pang, Francisco Javier Cabrerizo, and Enrique Herrera-Viedma. 2019. An automatic procedure to create fuzzy ontologies from users’ opinions using sentiment analysis procedures and multi-granular fuzzy linguistic modelling methods. Information Sciences 476 (2019), 222–238.

[8]

Roberto A. De Santis. 2019. Impact of the asset purchase programme on euro area government bond yields using market news. Economic Modelling (2019).

[9]

Ray Moynihan, Bero Lisa, Ross-Degnan Dennis, David Henry, Kirby Lee, Judy Watkins, Connie Mah, and Stephen B. Soumerai. 2000. Coverage by the news media of the benefits and risks of medications. New England Journal of Medicine 342, 22 (2000), 1645–1650.

[10]

Joseph Engelberg, R. David McLean, and Jeffrey Pontiff. 2018. Anomalies and news. The Journal of Finance 73, 5 (2018), 1971–2001.

[11]

Amr Ahmed, Qirong Ho, Jacob Eisenstein, Eric Xing, Alexander J. Smola, and Choon Hui Teo. 2011. Unified analysis of streaming news. In Proceedings of the 20th International Conference on World wide Web. ACM, (2011). 267–276.

Digital Library

[12]

Dong-Ho Lee, Yu-Ri Kim, Hyeong-Jun Kim, Seung-Myun Park, and Yu-Jun Yang. 2019. Fake news detection using deep learning. Journal of Information Processing Systems 15, 5 (2019).

[13]

Johannes Kiesel, Maria Mestre, Rishabh Shukla, Emmanuel Vincent, Payam Adineh, David Corney, Benno Stein, and Martin Potthast. 2019. Semeval-2019 task 4: Hyperpartisan news detection. In Proceedings of the 13th International Workshop on Semantic Evaluation. 829–839.

[14]

Wataru Souma, Irena Vodenska, and Hideaki Aoyama. 2019. Enhanced news sentiment analysis using deep learning methods. Journal of Computational Social Science 2, 1 (2019), 33–46.

[15]

Kevin Lerman and Ariel Gilder. 2009. System and method for forecasting fluctuations in future data and particularly for forecasting security prices by news analysis. U.S. Patent Application 12/150,960, filed January 22, 2009.

[16]

Xinyi Zhou, Reza Zafarani, Kai Shu, and Huan Liu. 2019. Fake news: Fundamental theories, detection strategies and challenges. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining. ACM, (2019), 836–837.

Digital Library

[17]

Brett K. Beaulieu-Jones, Zhiwei Steven Wu, Chris Williams, Ran Lee, Sanjeev P. Bhavnani, James Brian Byrd, and Casey S. Greene. 2019. Privacy-preserving generative deep neural networks support clinical data sharing. Circulation: Cardiovascular Quality and Outcomes 12, 7 (2019), e005122.

[18]

Julio C. S. Reis, André Correia, Fabrício Murai, Adriano Veloso, Fabrício Benevenuto, and Erik Cambria. 2019. Supervised learning for fake news detection. IEEE Intelligent Systems 34, 2 (2019), 76–81.

Digital Library

[19]

Shadikur Rahman, Syeda Sumbul Hossain, and Saiful Islam, Mazharul Islam Chowdhury, Fatama Binta Rafiq, and Khalid Been Md Badruzzaman. 2019. Context-based news headlines analysis using machine learning approach. In International Conference on Computational Collective Intelligence. Springer, Cham, 167–178.

Digital Library

[20]

Vaibhav Khatavkar, Makarand Velankar, and Parag Kulkarni. 2019. Multi-perspective analysis of news articles using machine learning algorithms. International Journal of Computer Applications 975, 8887 (2019).

[21]

N. Jamal, C. Xianqiao, F. Al-Turjman, and F. Ullah. 2021. A deep learning–based approach for emotions classification in big corpus of imbalanced tweets. Transactions on Asian and Low-Resource Language Information Processing 20, 3 (2021), 1–16.

Digital Library

[22]

S. Rakeshkumar, S. Muthramalingam, and F. Al-Turjman. 2021. Multimodal news feed evaluation system with deep reinforcement learning approaches. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 20, 1 (2021), 1–12.

Digital Library

[23]

F. Ullah, S. Jabbar, and L. Mostarda. 2021. An intelligent decision support system for software plagiarism detection in academia. International Journal of Intelligent Systems.

[24]

K. Razzaq Malik, M. Habib, S. Khalid, F. Ullah, M. Umar, T. Sajjad, and A. Ahmad. 2017. Data compatibility to enhance sustainable capabilities for autonomous analytics in IoT. Sustainability 9, 6 (2017), 877.

[25]

U. Khadam, M. M. Iqbal, L. Mostarda, and F. Ullah. 2020. An efficient framework for text document security and privacy. In International Symposium on Security and Privacy in Social Networks and Big Data. Springer, Singapore, 132–140.

[26]

S. Amin, M. I. Uddin, S. Hassan, A. Khan, N. Nasser, A. Alharbi, and H. Alyami. 2020. Recurrent neural networks with TF-IDF embedding technique for detection and classification in tweets of Dengue disease. IEEE Access 8, 131522–131533.

[27]

V. Poleneni, J. K. Rao, and S. A. Hidayathulla. 2021. COVID-19 prediction using ARIMA model. In 2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence). IEEE, 860–865.

[28]

L. Garg, E. Chukwu, N. Nasser, C. Chakraborty, and G. Garg. 2020. Anonymity preserving IoT-based COVID-19 and other infectious disease contact tracing model. IEEE Access 8 (2020), 159402–159414.

[29]

A. Onan. 2022. Bidirectional convolutional recurrent neural network architecture with group-wise enhancement mechanism for text sentiment classification. Journal of King Saud University-Computer and Information Sciences 34, 5 (2022), 2098–2117.

Digital Library

[30]

A. Onan. 2019. Consensus clustering-based undersampling approach to imbalanced learning. Scientific Programming. (2019).

Digital Library

[31]

A. Onan, S. Korukoğlu, and H. Bulut. 2016. Ensemble of keyword extraction methods and classifiers in text classification. Expert Systems with Applications 57 (2016), 232–247.

Digital Library

[32]

A. Onan. 2019. Two-stage topic extraction model for bibliometric data analysis based on word embeddings and clustering. IEEE Access 7 (2019), 145614–145633.

[33]

A. Onan and S. Korukoğlu. 2017. A feature selection model based on genetic rank aggregation for text sentiment classification. Journal of Information Science 43, 1 (2017), 25–38.

Digital Library

[34]

A. Onan, S. Korukoğlu, and H. Bulut. 2017. A hybrid ensemble pruning approach based on consensus clustering and multi-objective evolutionary algorithm for sentiment classification. Information Processing & Management 53, 4 (2017), 814–833.

Digital Library

[35]

A. Onan. 2021. Sentiment analysis on product reviews based on weighted word embeddings and deep neural networks. Concurrency and Computation: Practice and Experience 33, 23 (2021), e5909.

[36]

A. Onan. 2020. Mining opinions from instructor evaluation reviews: A deep learning approach. Computer Applications in Engineering Education 28, 1 (2020), 117–138.

[37]

A. Onan. 2021. Sentiment analysis on massive open online course evaluations: A text mining and deep learning approach. Computer Applications in Engineering Education 29, 3 (2021), 572–589.

[38]

An ensemble scheme based on language function analysis and feature engineering for text genre classification. Journal of Information Science 44 1, 28–47.

Digital Library

[39]

A. Onan and M. A. Toçoğlu. 2021. A term weighted neural language model and stacked bidirectional LSTM based framework for sarcasm identification. IEEE Access 9 (2021), 7701–7722.

[40]

A. Onan. 2019. Topic-enriched word embeddings for sarcasm identification. In Computer Science On-line Conference. Springer, Cham, 293–304.

[41]

A. Onan. 2018. Biomedical text categorization based on ensemble pruning and optimized topic modelling. Computational and Mathematical Methods in Medicine (2018).

[42]

D. B. Claro, M. Souza, C. Castellã Xavier, and L. Oliveira.2019. Multilingual open information extraction: Challenges and opportunities. Information 10, 7 (2019), 228.

[43]

X. Wang, Q. Liu, T. Gui, Q. Zhang, Y. Zou, X. Zhou, ... and X. J. Huang. 2021. TextFlint: Unified multilingual robustness evaluation toolkit for natural language processing. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations. 347–355.

[44]

A. Lytos, T. Lagkas, P. Sarigiannidis, V. Argyriou, and G. Eleftherakis. 2022. Modelling argumentation in short text: A case of social media debate. Simulation Modelling Practice and Theory 115 (2022), 102446.

[45]

M. Souza and R. Wassermann. 2021. Belief Contraction in Non-classical logics as hyperintensional belief change. In Proceedings of the International Conference on Principles of Knowledge Representation and Reasoning 18, 1 (2021), 588–598.

[46]

C. C. Xavier and M. Souza. 2020. A basic approach for extracting and analyzing data from Twitter. In Special Topics in Multimedia, IoT and Web Technologies. Springer, Cham, 185–211.

[47]

N. C. Dang, M. N. Moreno-García, and F. De la Prieta. 2020. Sentiment analysis based on deep learning: A comparative study. Electronics 9, 3 (2020), 483.

[48]

Z. Kastrati, F. Dalipi, A. S. Imran, K. Pireva Nuci, and M. A. Wani. 2021. Sentiment analysis of students’ feedback with NLP and deep learning: A systematic mapping study. Applied Sciences 11, 9 (2021), 3986.

[49]

X. Wang, S. Dou, L. Xiong, Y. Zou, Q. Zhang, T. Gui, ... and X. Huang. 2022. MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective. arXiv preprint arXiv:2204.04391.

[50]

P. Goel, V. Goel, and A. K. Gupta. 2020. Multilingual data analysis to classify sentiment analysis for tweets using NLP and classification algorithm. In Advances in Data and Information Sciences. Springer, Singapore, 271–280.

[51]

A. Lytos, T. Lagkas, P. Sarigiannidis, and K. Bontcheva. 2019. The evolution of argumentation mining: From models to social media and emerging tools. Information Processing & Management 56, 6 (2019), 102055.

Digital Library

Index Terms

Multilingual News Feed Analysis using Intelligent Linguistic Particle Filtering Techniques
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Information extraction

Recommendations

Multimodal News Feed Evaluation System with Deep Reinforcement Learning Approaches
Special issue on Deep Learning for Low-Resource Natural Language Processing, Part 1 and Regular Papers

Multilingual and multimodal data analysis is the emerging news feed evaluation system. News feed analysis and evaluations are interrelated processes, which are useful in understanding the news factors. The news feed evaluation system can be implemented ...
A robust and real-time algorithm for human face tracking using improved particle filtering
CCDC'09: Proceedings of the 21st annual international conference on Chinese control and decision conference

In view of the problem that face tracker based on particle filtering using only histogram cue is frequently disturbed by background, a particle swarm optimization particle filtering (PSOPF) face tracking algorithm is proposed. An AdaBoost classifier is ...
Nonlinear Kalman Filtering in Affine Term Structure Models

The extended Kalman filter, which linearizes the relationship between security prices and state variables, is widely used in fixed-income applications. We investigate whether the unscented Kalman filter should be used to capture nonlinearities and ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Asian and Low-Resource Language Information Processing

ACM Transactions on Asian and Low-Resource Language Information Processing Volume 22, Issue 3

March 2023

570 pages

ISSN:2375-4699

EISSN:2375-4702

DOI:10.1145/3579816

Editor:
Imed Zitouni
Google, USA

Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 March 2023

Online AM: 18 November 2022

Accepted: 11 October 2022

Revised: 14 August 2022

Received: 25 February 2022

Published in TALLIP Volume 22, Issue 3

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
164
Total Downloads

Downloads (Last 12 months)73
Downloads (Last 6 weeks)13

Reflects downloads up to 18 Aug 2024

Other Metrics

View Author Metrics

Citations

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Full Text

View this article in Full Text.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View full text|Download PDF

View Issue’s Table of Contents