Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3639233.3639342acmotherconferencesArticle/Chapter ViewAbstractPublication PagesnlpirConference Proceedingsconference-collections
research-article

Impact of Emojis in Emotion Analysis on Code-Mixed Text

Published: 05 March 2024 Publication History

Abstract

With the maturity of sentiment analysis technology, researchers are gradually not satisfied with only excavating positive and negative attitudes from the text but hope to obtain more delicate emotions. In addition, the development of social media makes people from different backgrounds like to express their emotions and feelings on social platforms. However, textual communication from social media is too colloquial, leading to the code-mixed text phenomenon, where sentences contain different languages, which poses difficulties for text analysis research. We observed that emojis in text contain emotion content and are universal across various language texts. This paper proposes a deep learning method for multi-class code-mixed emotion analysis using emojis. The proposed method has achieved good results on media text datasets and attempts to solve the difficulty of mixed sentiment classification with different codes. In this work, experiments were carried out on Hindi-English and Chinese-English code-mixed datasets, and the results have shown that integrating emoji representation with multilingual language model yielded 4% improvement on emotion analysis. Emojis should be preserved in emotion analysis regardless of which language the text is in.

References

[1]
Lal, Y.K., Kumar, V., Dhar, M., Shrivastava, M. and Koehn, P. 2019. De-Mixing Sentiment from Code-Mixed Text. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop (Florence, Italy, 2019), 371–377.
[2]
Santosh, T.Y.S.S. and Aravind, K.V.S. 2019. Hate Speech Detection in Hindi-English Code-Mixed Social Media Text. Proceedings of the ACM India Joint International Conference on Data Science and Management of Data (Kolkata India, Jan. 2019), 310–313.
[3]
Tang, T., Tang, X. and Yuan, T. 2020. Fine-Tuning BERT for Multi-Label Sentiment Analysis in Unbalanced Code-Switching Text. IEEE Access. 8, (2020), 193248–193256.
[4]
School of Computer Science & Technology, Kean University, Union, NJ 07083, USA, Rashid, A. and Huang, C. 2021. Sentiment Analysis on Consumer Reviews of Amazon Products. International Journal of Computer Theory and Engineering. 13, 2 (2021), 35–41.
[5]
Homaid, M.S., Bisandu, D.B., Moulitsas, I. and Jenkins, K. 2022. Analysing the Sentiment of Air-Traveller: A Comparative Analysis. International Journal of Computer Theory and Engineering. 14, 2 (2022), 48–53.
[6]
Vijay, T., Chawla, A., Dhanka, B. and Karmakar, P. 2020. Sentiment Analysis on COVID-19 Twitter Data. 2020 5th IEEE International Conference on Recent Advances and Innovations in Engineering (ICRAIE) (Jaipur, India, Dec. 2020), 1–7.
[7]
Wani, M.A., Agarwal, N., Jabin, S. and Hussain, S.Z. 2018. User emotion analysis in conflicting versus non-conflicting regions using online social networks. Telematics and Informatics. 35, 8 (Dec. 2018), 2326–2336.
[8]
Bostan, L.A.M. and Klinger, R. An Analysis of Annotated Corpora for Emotion Classification in Text. 16.
[9]
Sunitha, D., Patra, R.K., Babu, N.V., Suresh, A. and Gupta, S.C. 2022. Twitter sentiment analysis using ensemble based deep learning model towards COVID-19 in India and European countries. Pattern Recognition Letters. 158, (Jun. 2022), 164–170.
[10]
Sarkar, K. and Chakraborty, S. 2015. A Sentiment Analysis System for Indian Language Tweets. Mining Intelligence and Knowledge Exploration. R. Prasath, A.K. Vuppala, and T. Kathirvalavakumar, eds. Springer International Publishing. 694–702.
[11]
Shanmugalingam, K., Sumathipala, S. and Premachandra, C. 2018. Word Level Language Identification of Code Mixing Text in Social Media using NLP. 2018 3rd International Conference on Information Technology Research (ICITR) (Moratuwa, Sri Lanka, Dec. 2018), 1–5.
[12]
Appidi, A.R., Srirangam, V.K., Suhas, D. and Shrivastava, M. 2020. Creation of Corpus and analysis in Code-Mixed Kannada-English Twitter data for Emotion Prediction. Proceedings of the 28th International Conference on Computational Linguistics (Barcelona, Spain (Online), 2020), 6703–6709.
[13]
Yulianti, E., Kurnia, A., Adriani, M. and Duto, Y.S. 2021. Normalisation of Indonesian-English Code-Mixed Text and its Effect on Emotion Classification. International Journal of Advanced Computer Science and Applications. 12, 11 (2021).
[14]
Baali, M. and Ghneim, N. 2019. Emotion analysis of Arabic tweets using deep learning approach. Journal of Big Data. 6, 1 (Dec. 2019), 89.
[15]
Bălan, O., Moise, G., Petrescu, L., Moldoveanu, A., Leordeanu, M. and Moldoveanu, F. 2019. Emotion Classification Based on Biophysical Signals and Machine Learning Techniques. Symmetry. 12, 1 (Dec. 2019), 21.
[16]
Park, S.-H., Bae, B.-C. and Cheong, Y.-G. 2020. Emotion Recognition from Text Stories Using an Emotion Embedding Model. 2020 IEEE International Conference on Big Data and Smart Computing (BigComp) (Busan, Korea (South), Feb. 2020), 579–583.
[17]
H. Manguri, K., N. Ramadhan, R. and R. Mohammed Amin, P. 2020. Twitter Sentiment Analysis on Worldwide COVID-19 Outbreaks. Kurdistan Journal of Applied Research. (May 2020), 54–65.
[18]
Department of Software Engineering, Bahcesehir University, Besiktas, Istanbul, Turkey., Ayvaz, S. and Shiha, M.O. 2017. The Effects of Emoji in Sentiment Analysis. International Journal of Computer and Electrical Engineering. 9, 1 (2017), 360–369.
[19]
Eisner, B., Rocktäschel, T., Augenstein, I., Bošnjak, M. and Riedel, S. 2016. emoji2vec: Learning Emoji Representations from their Description. arXiv.
[20]
Chen, Y., Yuan, J., You, Q. and Luo, J. 2018. Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTM. Proceedings of the 26th ACM international conference on Multimedia (Seoul Republic of Korea, Oct. 2018), 117–125.
[21]
Lou, Y., Zhang, Y., Li, F., Qian, T. and Ji, D. 2020. Emoji-Based Sentiment Analysis Using Attention Networks. ACM Transactions on Asian and Low-Resource Language Information Processing. 19, 5 (Sep. 2020), 1–13.
[22]
Liu, C., Fang, F., Lin, X., Cai, T., Tan, X., Liu, J. and Lu, X. 2021. Improving sentiment analysis accuracy with emoji embedding. Journal of Safety Science and Resilience. 2, 4 (Dec. 2021), 246–252.
[23]
Barry, E., Jameel, S. and Raza, H. 2022. Emojional: Emoji Embeddings. Advances in Computational Intelligence Systems. T. Jansen, R. Jensen, N. Mac Parthaláin, and C.-M. Lin, eds. Springer International Publishing. 312–324.
[24]
Ghosh, S., Priyankar, A., Ekbal, A. and Bhattacharyya, P. 2023. Multitasking of sentiment detection and emotion recognition in code-mixed Hinglish data. Knowledge-Based Systems. 260, (Jan. 2023), 110182.
[25]
Wang, Z., Li, S., Wu, F., Sun, Q. and Zhou, G. 2018. Overview of NLPCC 2018 Shared Task 1: Emotion Detection in Code-Switching Text. Natural Language Processing and Chinese Computing. M. Zhang, V. Ng, D. Zhao, S. Li, and H. Zan, eds. Springer International Publishing. 429–433.

Index Terms

  1. Impact of Emojis in Emotion Analysis on Code-Mixed Text

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    NLPIR '23: Proceedings of the 2023 7th International Conference on Natural Language Processing and Information Retrieval
    December 2023
    336 pages
    ISBN:9798400709227
    DOI:10.1145/3639233
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 05 March 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. BERT
    2. Code-mixed text
    3. Emoji
    4. Emotion analysis

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    NLPIR 2023

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 36
      Total Downloads
    • Downloads (Last 12 months)36
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 01 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media