Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Context-aware Emotion Detection from Low-resource Urdu Language Using Deep Neural Network

Published: 08 May 2023 Publication History

Abstract

Emotion detection (ED) plays a vital role in determining individual interest in any field. Humans use gestures, facial expressions, and voice pitch and choose words to describe their emotions. Significant work has been done to detect emotions from the textual data in English, French, Chinese, and other high-resource languages. However, emotion classification has not been well studied in low-resource languages (i.e., Urdu) due to the lack of labeled corpora. This article presents a publicly available Urdu Nastalique Emotions Dataset (UNED) of sentences and paragraphs annotated with different emotions and proposes a deep learning (DL)-based technique for classifying emotions in the UNED corpus. Our annotated UNED corpus has six emotions for both paragraphs and sentences. We perform extensive experimentation to evaluate the quality of the corpus and further classify it using machine learning and DL approaches. Experimental results show that the developed DL-based model performs better than generic machine learning approaches with an F1 score of 85% on the UNED sentence-based corpus and 50% on the UNED paragraph-based corpus.

References

[1]
Ahmad Abbasi, Abdul Rehman Javed, Chinmay Chakraborty, Jamel Nebhen, Wisha Zehra, and Zunera Jalil. 2021. ElStream: An ensemble learning approach for concept drift detection in dynamic social big data stream learning. IEEE Access 9 (2021), 66408–66419.
[2]
Malak Abdullah and Samira Shaikh. 2018. Teamuncc at SemEval-2018 task 1: Emotion detection in English and Arabic tweets using deep learning. In Proceedings of the 12th International Workshop on Semantic Evaluation. 350–357.
[3]
Akiko Aizawa. 2003. An information-theoretic perspective of tf–idf measures. Information Processing & Management 39, 1 (2003), 45–65.
[4]
Kholoud Alsmearat, Mohammed Shehab, Mahmoud Al-Ayyoub, Riyad Al-Shalabi, and Ghassan Kanaan. 2015. Emotion analysis of Arabic articles and its impact on identifying the author’s gender. In 2015 IEEE/ACS 12th International Conference of Computer Systems and Applications (AICCSA’15). IEEE, 1–6.
[5]
Nourah Alswaidan and Mohamed El Bachir Menai. 2019. KSU at SemEval-2019 task 3: Hybrid features for emotion recognition in textual conversation. In Proceedings of the 13th International Workshop on Semantic Evaluation. 247–250.
[6]
Kamran Amjad, Maria Ishtiaq, Samar Firdous, and Muhammad Amir Mehmood. 2017. Exploring Twitter news biases using Urdu-based sentiment lexicon. In 2017 International Conference on Open Source Systems & Technologies (ICOSST’17). IEEE, 48–53.
[7]
M. U. Arshad, M. F. Bashir, A. Majeed, W. Shahzad, and M. O. Beg. 2019. Corpus for emotion detection on Roman Urdu. In 2019 22nd International Multitopic Conference (INMIC’19). 1–6.
[8]
Ron Artstein and Massimo Poesio. 2008. Inter-coder agreement for computational linguistics. Computational Linguistics 34, 4 (2008), 555–596.
[9]
P. Ashokkumar, Siva G. Shankar, Gautam Srivastava, Praveen Kumar Reddy Maddikunta, and Thippa Reddy Gadekallu. 2021. A two-stage text feature selection algorithm for improving text classification. ACM Transactions on Asian and Low-resource Language Information Processing 20, 3 (2021).
[10]
Egils Avots and Gholamreza Anbarjafari. 2019. Multimodal database of emotional speech, video and gestures. In Pattern Recognition and Information Forensics: ICPR 2018 International Workshops, CVAUI, IWCF, and MIPPSNA, Revised Selected Papers, Vol. 11188. Springer, 153.
[11]
Gilbert Badaro, Obeida El Jundi, Alaa Khaddaj, Alaa Maarouf, Raslan Kain, Hazem Hajj, and Wassim El-Hajj. 2018. EMA at SemEval-2018 task 1: Emotion mining for Arabic. In Proceedings of the 12th International Workshop on Semantic Evaluation. 236–244.
[12]
Yves Bestgen. 2019. CECL at SemEval-2019 task 3: Using surface learning for detecting emotion in textual conversations. In Proceedings of the 13th International Workshop on Semantic Evaluation. 148–152.
[13]
Abdessalam Bouchekif, Praveen Joshi, Latifa Bouchekif, and Haithem Afli. 2019. EPITA-ADAPT at SemEval-2019 task 3: Detecting emotions in textual conversations using deep learning models combination. In Proceedings of the 13th International Workshop on Semantic Evaluation. 215–219.
[14]
Jinkun Chen, Cong Liu, and Ming Li. 2017. Automatic emotional spoken language text corpus construction from written dialogs in fictions. In 2017 7th International Conference on Affective Computing and Intelligent Interaction (ACII’17). IEEE, 319–324.
[15]
Xiyao Cheng, Ying Chen, Bixiao Cheng, Shoushan Li, and Guodong Zhou. 2017. An emotion cause corpus for Chinese microblogs with multiple-user structures. ACM Transactions on Asian and Low-resource Language Information Processing (TALLIP) 17, 1 (2017), 6.
[16]
Giovanni Costantini, Iacopo Iaderola, Andrea Paoloni, and Massimiliano Todisco. 2014. Emovo corpus: An Italian emotional speech database. In International Conference on Language Resources and Evaluation (LREC’14). European Language Resources Association (ELRA), 3501–3504.
[17]
Kodati Dheeraj and Tene Ramakrishnudu. 2021. Negative emotions detection on online mental-health related patients texts using the deep learning with MHA-BCNN model. Expert Systems with Applications (2021), 115265.
[18]
Hyo Jin Do and Ho-Jin Choi. 2015. Korean Twitter emotion classification using automatically built emotion lexicons and fine-grained features. In Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation: Posters. 142–150.
[19]
Raïssa Yapan Dougnon, Philippe Fournier-Viger, Jerry Chun-Wei Lin, and Roger Nkambou. 2015. Accurate online social network user profiling. In Joint German/Austrian Conference on Artificial Intelligence (Künstliche Intelligenz). Springer, 264–270.
[20]
Raïssa Yapan Dougnon, Philippe Fournier-Viger, Jerry Chun-Wei Lin, and Roger Nkambou. 2016. Inferring social network user profiles using a partial social graph. Journal of Intelligent Information Systems 47, 2 (2016), 313–344.
[21]
Changde Du, Changying Du, Jinpeng Li, Wei-long Zheng, Bao-liang Lu, and Huiguang He. 2017. Semi-supervised Bayesian deep multi-modal emotion recognition. arXiv preprint arXiv:1704.07548 (2017).
[22]
Samar Fathy, Nahla El-Haggar, and Mohamed H. Haggag. 2017. A hybrid model for emotion detection from text. International Journal of Information Retrieval Research (IJIRR) 7, 1 (2017), 32–48.
[23]
Zhiwei Guo, Keping Yu, Yu Li, Gautam Srivastava, and Jerry Chun-Wei Lin. 2021. Deep learning-embedded social internet of things for ambiguity-aware social recommendations. IEEE Transactions on Network Science and Engineering (2021).
[24]
Muhammad Hassan and Muhammad Shoaib. 2018. Opinion within opinion: Segmentation approach for urdu sentiment analysis. International Arab Journal on Information Technology 15, 1 (2018), 21–28.
[25]
Muhammad Humayoun, Rao Muhammad Adeel Nawab, Muhammad Uzair, Saba Aslam, and Omer Farzand. 2016. Urdu summary corpus. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC’16). 796–800.
[26]
Younghee Jung, Kinam Park, Taemin Lee, Jeongmin Chae, and Soonyoung Jung. 2017. A corpus-based approach to classifying emotions using Korean linguistic features. Cluster Computing 20, 1 (2017), 583–595.
[27]
Sawit Kasuriya, Thanaruk Theeramunkong, Chai Wutiwiwatchai, and Piyawat Sukhummek. 2019. Developing a Thai emotional speech corpus from Lakorn (EMOLA). Language Resources and Evaluation 53 (March2019), 1–39.
[28]
Dacher Keltner. 2004. Ekman, emotional expression, and the art of empirical epiphany. Journal of Research in Personality 38, 1 (2004), 37–44.
[29]
Hema Krishnan, M. Sudheep Elayidom, and T. Santhanakrishnan. 2017. Emotion detection of tweets using naïve Bayes classifier. Emotion (2017).
[30]
Marloes Kuijper, Mike van Lenthe, and Rik van Noord. 2018. Ug18 at SemEval-2018 task 1: Generating additional training data for predicting emotion intensity in spanish. arXiv preprint arXiv:1805.10824 (2018).
[31]
Jiyoung Lee and Yun Jung Choi. 2018. Understanding social viewing through discussion network and emotion: A focus on South Korean presidential debates. Telematics and Informatics 35, 5 (2018), 1382–1391.
[32]
Mirko Mazzoleni, Gabriele Maroni, and Fabio Previdi. 2017. Unsupervised learning of fundamental emotional states via word embeddings. In 2017 IEEE Symposium Series on Computational Intelligence (SSCI’17). IEEE, 1–6.
[33]
Mohamed Meddeb, Hichem Karray, and Adel M. Alimi. 2017. Building and analysing emotion corpus of the Arabic speech. In 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR’17). IEEE, 134–139.
[34]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).
[35]
Junko Minato, David B. Bracewell, Fuji Ren, and Shingo Kuroiwa. 2006. Statistical analysis of a Japanese emotion corpus for natural language processing. In International Conference on Intelligent Computing. Springer, 924–929.
[36]
Junko Minato, David B. Bracewell, Fuji Ren, and Shingo Kuroiwa. 2008. Japanese emotion corpus analysis and its use for automatic emotion word identification. Engineering Letters 16, 1 (2008).
[37]
Saif M. Mohammad and Peter D. Turney. 2013. Crowdsourcing a word–emotion association lexicon. Computational Intelligence 29, 3 (2013), 436–465.
[38]
Neelam Mukhtar and Mohammad Abid Khan. 2018. Urdu sentiment analysis using supervised machine learning approach. International Journal of Pattern Recognition and Artificial Intelligence 32, 2 (2018), 1851001.
[39]
Neelam Mukhtar, Mohammad Abid Khan, Nadia Chiragh, and Shah Nazir. 2018. Identification and handling of intensifiers for enhancing accuracy of Urdu sentiment analysis. Expert Systems 35, 6 (2018), e12317.
[40]
Myriam D. Munezero, Calkin Suero Montero, Erkki Sutinen, and John Pajunen. 2014. Are they different? Affect, feeling, emotion, sentiment, and opinion detection in text. IEEE Transactions on Affective Computing 5, 2 (2014), 101–111.
[41]
Rutvija Pandya and Jayati Pandya. 2015. C5. 0 algorithm to improved decision tree with feature selection and reduced error pruning. International Journal of Computer Applications 117, 16 (2015), 18–21.
[42]
Robert Plutchik. 1980. A general psychoevolutionary theory of emotion. In Theories of Emotion. Elsevier, 3–33.
[43]
Changqin Quan and Fuji Ren. 2009. Construction of a blog emotion corpus for Chinese emotional expression analysis. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3. Association for Computational Linguistics, 1446–1454.
[44]
Ebin Deni Raj, Gunasekaran Manogaran, Gautam Srivastava, and Yulei Wu. 2020. Information granulation-based community detection for social networks. IEEE Transactions on Computational Social Systems 8, 1 (2020), 122–133.
[45]
Zia Ul Rehman and Imran Sarwar Bajwa. 2016. Lexicon-based sentiment analysis for Urdu language. In 2016 6th International Conference on Innovative Computing Technology (INTECH’16). IEEE, 497–501.
[46]
Xin Rong. 2014. Word2vec parameter learning explained. arXiv preprint arXiv:1411.2738 (2014).
[47]
Ali Saeed, Rao Muhammad Adeel Nawab, Mark Stevenson, and Paul Rayson. 2019. A sense annotated corpus for all-words Urdu word sense disambiguation. ACM Transactions on Asian and Low-resource Language Information Processing (TALLIP) 18, 4 (2019), 40.
[48]
Kashfia Sailunaz, Manmeet Dhaliwal, Jon Rokne, and Reda Alhajj. 2018. Emotion detection from text and speech: A survey. Social Network Analysis and Mining 8, 1 (2018), 28.
[49]
Mary Jane C. Samonte, Hector Irvin B. Punzalan, Richard Julian Paul G. Santiago, and Peter Joshua L. Linchangco. 2017. Emotion detection in blog posts using keyword spotting and semantic analysis. In Proceedings of the 3rd International Conference on Communication and Information Processing. ACM, 6–13.
[50]
Kazuki Sato and Tomonobu Ozaki. 2019. Estimation of emotion type and intensity in Japanese tweets using multi-task deep learning. In Workshops of the International Conference on Advanced Information Networking and Applications. Springer, 314–323.
[51]
Abhinav Sethy and Bhuvana Ramabhadran. 2008. Bag-of-word normalized n-gram models. In 9th Annual Conference of the International Speech Communication Association.
[52]
Neel Shah, Gautam Srivastava, David W. Savage, and Vijay Mago. 2020. Assessing canadians health activity and nutritional habits through social media. Frontiers in Public Health 7 (2020), 400.
[53]
Sergey Smetanin. 2019. EmoSense at SemEval-2019 task 3: Bidirectional LSTM network for contextual emotion detection in textual conversations. In Proceedings of the 13th International Workshop on Semantic Evaluation. 210–214.
[54]
Mohamed Soltani, Hafed Zarzour, and Mohamed Chaouki Babahenini. 2018. Facial emotion detection in massive open online courses. In World Conference on Information Systems and Technologies. Springer, 277–286.
[55]
Afraz Zahra Syed, Muhammad Aslam, and Ana Maria Martinez-Enriquez. 2011. Sentiment analysis of Urdu language: Handling phrase-level negation. In Mexican International Conference on Artificial Intelligence. Springer, 382–393.
[56]
Afraz Z. Syed, Muhammad Aslam, and Ana Maria Martinez-Enriquez. 2014. Associating targets with SentiUnits: A step forward in sentiment analysis of Urdu text. Artificial Intelligence Review 41, 4 (2014), 535–561.
[57]
Mansur Alp Tocoglu and Adil Alpkocak. 2014. Emotion extraction from turkish text. In 2014 European Network Intelligence Conference. IEEE, 130–133.
[58]
Mansur Alp Tocoglu and Adil Alpkocak. 2018. TREMO: A dataset for emotion analysis in Turkish. Journal of Information Science 44, 6 (2018), 848–860.
[59]
Anthony J. Viera, Joanne M. Garrett, et al. 2005. Understanding interobserver agreement: The kappa statistic. Family Medicine 37, 5 (2005), 360–363.
[60]
Deepanshu Vijay, Aditya Bohra, Vinay Singh, Syed Sarfaraz Akhtar, and Manish Shrivastava. 2018. Corpus creation and emotion prediction for Hindi-English code-mixed social media text. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop. 128–135.
[61]
Takashi Yamazaki and Minoru Nakayama. 2017. Extracting acoustic features of Japanese speech to classify emotions. In FedCSIS Communication Papers. 141–145.
[62]
Liang Yang and Hongfei Lin. 2012. Construction and application of Chinese emotional corpus. In Workshop on Chinese Lexical Semantics. Springer, 122–133.
[63]
Wisha Zehra, Abdul Rehman Javed, Zunera Jalil, Habib Ullah Khan, and Thippa Reddy Gadekallu. 2021. Cross corpus multi-lingual speech emotion recognition using ensemble learning. Complex & Intelligent Systems (2021), 1–10.
[64]
Dongyu Zhang, Hongfei Lin, Liang Yang, Shaowu Zhang, and Bo Xu. 2018. Construction of a Chinese corpus for the analysis of the emotionality of metaphorical expressions. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 144–150.
[65]
Dongyu Zhang, Hongfei Lin, Puqi Zheng, Liang Yang, and Shaowu Zhang. 2018. The identification of the emotionality of metaphorical expressions based on a manually annotated Chinese corpus. IEEE Access 6 (2018), 71241–71248.
[66]
Jialiang Zhao and Qi Gao. 2017. Annotation and detection of emotion in text-based dialogue systems with CNN. arXiv preprint arXiv:1710.00987 (2017).
[67]
J. R. Landis and G. G. Koch. 1977. The measurement of observer agreement for categorical data. Biometrics, 159–e174.

Cited By

View all
  • (2024)Understanding hate speech: the HateInsights dataset and model interpretabilityPeerJ Computer Science10.7717/peerj-cs.237210(e2372)Online publication date: 2-Oct-2024
  • (2024)Improved optimizer with deep learning model for emotion detection and classificationMathematical Biosciences and Engineering10.3934/mbe.202429021:7(6631-6657)Online publication date: 2024
  • (2024)Augmenting context with power information for green context-awareness in smart environmentsFrontiers in Computer Science10.3389/fcomp.2024.13655006Online publication date: 7-Mar-2024
  • Show More Cited By

Index Terms

  1. Context-aware Emotion Detection from Low-resource Urdu Language Using Deep Neural Network

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Asian and Low-Resource Language Information Processing
    ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 22, Issue 5
    May 2023
    653 pages
    ISSN:2375-4699
    EISSN:2375-4702
    DOI:10.1145/3596451
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 May 2023
    Online AM: 01 April 2022
    Accepted: 23 March 2022
    Revised: 28 December 2021
    Received: 24 September 2021
    Published in TALLIP Volume 22, Issue 5

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Emotion detection (ED)
    2. Urdu Nastalique Emotions Dataset (UNED)
    3. annotated UNED corpus

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)427
    • Downloads (Last 6 weeks)28
    Reflects downloads up to 22 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Understanding hate speech: the HateInsights dataset and model interpretabilityPeerJ Computer Science10.7717/peerj-cs.237210(e2372)Online publication date: 2-Oct-2024
    • (2024)Improved optimizer with deep learning model for emotion detection and classificationMathematical Biosciences and Engineering10.3934/mbe.202429021:7(6631-6657)Online publication date: 2024
    • (2024)Augmenting context with power information for green context-awareness in smart environmentsFrontiers in Computer Science10.3389/fcomp.2024.13655006Online publication date: 7-Mar-2024
    • (2024)A Deep Learning Approach for Automated Depression Assessment Using Roman UrduIEEE Access10.1109/ACCESS.2024.351926412(193387-193401)Online publication date: 2024
    • (2024)Enhancing Emoji-Based Sentiment Classification in Urdu Tweets: Fusion Strategies With Multilingual BERT and Emoji EmbeddingsIEEE Access10.1109/ACCESS.2024.344689712(126587-126600)Online publication date: 2024
    • (2024)Revolutionizing Urdu Sentiment Analysis: Harnessing the Power of XLM-R and GPT-2IEEE Access10.1109/ACCESS.2024.342949612(99779-99793)Online publication date: 2024
    • (2024)Emotion Recognition and Management in the Tourism Industry During Emergency Events Using Improved Convolutional Neural NetworkIEEE Access10.1109/ACCESS.2024.337043112(32660-32667)Online publication date: 2024
    • (2024)Challenges and Opportunities of Text-Based Emotion Detection: A SurveyIEEE Access10.1109/ACCESS.2024.335635712(18416-18450)Online publication date: 2024
    • (2024)A systematic review of trimodal affective computing approachesExpert Systems with Applications: An International Journal10.1016/j.eswa.2024.124852255:PDOnline publication date: 21-Nov-2024
    • (2024)A Transfer-Based Deep Learning Model for Persian Emotion ClassificationMultimedia Tools and Applications10.1007/s11042-024-19668-wOnline publication date: 4-Jul-2024
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media