Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Rule Based Fuzzy Computing Approach on Self-Supervised Sentiment Polarity Classification with Word Sense Disambiguation in Machine Translation for Hindi Language

Published: 09 May 2023 Publication History
  • Get Citation Alerts
  • Abstract

    With increasing globalization, communication among people of diverse cultural backgrounds is also taking place to a very large extent in the present era. Issues like language diversity in various parts of the world can lead to hindrance in communication. The usage of social media and user-generated material has grown at an exponential rate and existing supervised sentiment polarity classification techniques need labelling for the training dataset. In this study, two problems have been analyzed. First, sentiment analysis of the Twitter dataset and sense disambiguation of morphologically rich Hindi language. A rule-based fuzzy logics-based system for self-supervised sentiment classification was used to compute and analyze the self-supervised or completely unsupervised sentiment categorization of a social-media dataset using three types of lexicons.  The combination of fuzzy with three different types of lexicons gives sentiment analysis a new path. The unsupervised fuzzy rules integrate the fuzziness of both negative as well as positive scores, and fuzzy logic-based systems can cope with ambiguity and vagueness. The fuzzy-system uses an unsupervised/self-supervised fuzzy rule-based technique to identify text using natural language processing (NLP) and sense of word. We compared the results of fuzzy rule based self-supervised sentiment classification by using three types of lexicons on five different datasets, with unsupervised as well as supervised sentiment classification techniques. Second, using cross-lingual sense embedding rather than cross-lingual word embedding resolves the ambiguity issue. The word sense embeddings are produced for the source languages to learn multiple or various senses of the words. Different evaluation metrics depict an improved performance for English-Hindi language.

    References

    [1]
    A. Abdi, S. M. Shamsuddin, S. Hasan, and J. Piran. 2019. Deep learning-based sentiment classification of evaluative text based on multi-feature fusion. Information Processing & Management 56, 4 (2019), 1245–1259.
    [2]
    B. Agarwal, N. Mittal, P. Bansal, and S. Garg. 2015. Sentiment analysis using common-sense and context information. Computational Intelligence and Neuroscience (2015).
    [3]
    S. Akhtar, D. Ghosal, A. Ekbal, P. Bhattacharyya, and S. Kurohashi. 2019. All-in-one: Emotion, sentiment and intensity prediction using a multi-task ensemble framework. IEEE Transactions on Affective Computing (2019).
    [4]
    N. Altrabsheh, M. Cocea, and S. Fallahkhair. 2014. Sentiment analysis: Towards a tool for analysing real-time students feedback. In 2014 IEEE 26th International Conference on Tools with Artificial Intelligence. IEEE. 419–423.
    [6]
    M. Artetxe, G. Labaka, E. Agirre, and K. Cho. 2017. Unsupervised neural machine translation. arXiv preprint arXiv:1710.11041.
    [7]
    M. Artetxe, G. Labaka, and E. Agirre. 2018. A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings. arXiv preprint arXiv:1805.06297.
    [8]
    Muhammad Z. Asghar et al. 2021. Senti-eSystem: A sentiment-based eSystem-using hybridized fuzzy and deep neural network for measuring customer satisfaction. Software: Practice and Experience 51, 3 (2021), 571–594.
    [9]
    S. Baccianella, A. Esuli, and F. Sebastiani. 2010. SentiWordNet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In LREC 10, 2010, 2200–2204.
    [10]
    L. Barbosa and J. Feng. 2010. Robust sentiment detection on Twitter from biased and noisy data. In Coling 2010: Posters. 36–44.
    [11]
    S. Banerjee and A. Lavie. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization. 65–72.
    [12]
    Y. Bengio and Y. LeCun. 2007. Scaling learning algorithms towards AI. Large-Scale Kernel Machines 34, 5 (2007), 1–41.
    [13]
    R. Baruah, R. K. Mundotiya, and A. K. Singh. 2021. Low resource neural machine translation: Assamese to/from other Indo-Aryan (Indic) languages. Transactions on Asian and Low-Resource Language Information Processing 21, 1 (2021), 1–32.
    [14]
    E. Cambria, S. Poria, D. Hazarika, and K. Kwok. 2018. SenticNet 5: Discovering conceptual primitives for sentiment analysis by means of context embeddings. In Proceedings of the AAAI Conference on Artificial Intelligence 32, 1 (2018).
    [15]
    J. Devlin, M. W. Chang, K. Lee, and K. Toutanova. 2018. BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.
    [16]
    G. Doddington. 2002. Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In Proceedings of the Second International Conference on Human Language Technology Research. 138–145.
    [17]
    P. Domingos and M. Pazzani. 1997. On the optimality of the simple Bayesian classifier under zero-one loss. Machine Learning 29, 2 (1997), 103–130.
    [18]
    L. C. Duţu, G. Mauris, and P. Bolon. 2017. A fast and accurate rule-base generation method for Mamdani fuzzy systems. IEEE Transactions on Fuzzy Systems 26, 2 (2017), 715–733.
    [19]
    A. Go, R. Bhayani, and L. Huang. 2009. Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford 1, 12 (2009), 2009.
    [20]
    J. Godara, R. Aron, and M. Shabaz. 2022. Sentiment analysis and sarcasm detection from social network to train health-care professionals. World Journal of Engineering 19 1 (2022), 124–133.
    [21]
    C. Hutto and E. Gilbert. 2014. VADER: A parsimonious rule-based model for sentiment analysis of social media text. In Proceedings of the International AAAI Conference on Web and Social Media 8, 1 (2014).
    [23]
    J. S. R. Jang, C. T. Sun, and E. Mizutani. 1997. Neuro-fuzzy and soft computing-a computational approach to learning and machine intelligence [Book Review]. IEEE Transactions on Automatic Control 42, 10 (1997), 1482–1484.
    [24]
    A. Joulin, E. Grave, P. Bojanowski, and T. Mikolov. 2016. Bag of tricks for efficient text classification. arXiv preprint arXiv:1607.01759.
    [25]
    A. Jurek, M. D. Mulvenna, and Y. Bi. 2015. Improved lexicon-based sentiment analysis for social media analytics. Security Informatics 4, 1 (2015), 1–13.
    [26]
    Y. Kim, J. Geng, and H. Ney. 2019. Improving unsupervised word-by-word translation with language model and denoising autoencoder. arXiv preprint arXiv:1901.01590.
    [27]
    D. P. Kingma and J. Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
    [28]
    C. Y. Lin. 2004. ROUGE: A package for automatic evaluation of summaries. In Text Summarization Branches Out. 74–81.
    [29]
    B. Liu. 2012. Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies 5, 1 (2012), 1–167.
    [30]
    V. López, S. Del Río, J. M. Benítez, and F. Herrera. 2015. Cost-sensitive linguistic fuzzy rule based classification systems under the MapReduce framework for imbalanced big data. Fuzzy Sets and Systems 258 (2015), 5–38.
    [31]
    S. S. Jain, R. Gupta, C. Tiwari, and N. Kaur. 2019. Behaviour of players on IPL based on fuzzy C means. International Journal of Innovative Technology and Exploring Engineering 8, 9S (2019), 150--154.
    [32]
    E. H. Mamdani and S. Assilian. 1975. An experiment in linguistic synthesis with a fuzzy logic controller. International Journal of Man-Machine Studies 7, 1 (1975), 1–13.
    [33]
    A. Montoro, J. A. Olivas, A. Peralta, F. P. Romero, and J. Serrano-Guerrero. 2018. An ANEW based fuzzy sentiment analysis model. In 2018 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE). IEEE. 1–7.
    [34]
    R. Moraes, João Francisco Valiati, and Wilson P. Gavião Neto. 2013. Document-level sentiment classification: An empirical comparison between SVM and ANN. Expert Syst. Appl. 40, 2 (2013), 621–633.
    [35]
    A. Neviarouskaya, H. Prendinger, and M. Ishizuka. 2011. SentiFul: A lexicon for sentiment analysis. IEEE Transactions on Affective Computing 2, 1 (2011), 22–36.
    [36]
    F. Nielsen. 2011. AFINN. Richard Petersens Plads, Building 321, DK-2800 Kgs. Lyngby.
    [37]
    A. Ortigosa, J. M. Martín, and R. M. Carro. 2014. Sentiment analysis in Facebook and its application to e-learning. Computers in Human Behavior 31 (2014), 527–541.
    [38]
    O. Oyebode, F. Alqahtani, and R. Orji. 2020. Using machine learning and thematic analysis methods to evaluate mental health apps based on user reviews. IEEE Access 8 (2020), 111141–111158.
    [39]
    K. Papineni, S. Roukos, T. Ward, and W. J. Zhu. 2002. BLEU: A method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. 311–318.
    [40]
    C. W. Park and D. R. Seo. 2018. Sentiment analysis of Twitter corpus related to artificial intelligence assistants. In 2018 5th International Conference on Industrial Engineering and Applications (ICIEA). IEEE. 495–498.
    [41]
    P. Patwa, G. Aguilar, S. Kar, S. Pandey, S. PYKL, B. Gambäck,… and A. Das. 2020. SemEval-2020 task 9: Overview of sentiment analysis of code-mixed tweets. arXiv e-prints, arXiv-2008.
    [42]
    M. Pelevina, N. Arefyev, C. Biemann, and A. Panchenko. 2017. Making sense of word embeddings. arXiv preprint arXiv:1708.03390.
    [43]
    I. Perikos and I. Hatzilygeroudis. 2013. Recognizing emotion presence in natural language sentences. In International Conference on Engineering Applications of Neural Networks. Springer, Berlin. 30–39.
    [44]
    Y. Qin, X. Wang, and Z. Xu. 2022. Ranking tourist attractions through online reviews: A novel method with intuitionistic and hesitant fuzzy information based on sentiment analysis. International Journal of Fuzzy Systems 24, 2 (2022), 755–777.
    [45]
    A. K. Sahoo, P. K. Sarangi, and C. S. Yadav. 2022. Indian sign language recognition using ensemble based classifier combination. Macromolecular Symposia 401, 1 (2022), 2100286.
    [46]
    N. Saleena. 2018. An ensemble classification system for Twitter sentiment analysis. Procedia Computer Science 132 (2018), 937–946.
    [47]
    J. A. Sanz, A. Fernández, H. Bustince, and F. Herrera. 2013. IVTURS: A linguistic fuzzy rule-based classification system based on a new interval-valued fuzzy reasoning method with tuning and rule selection. IEEE Transactions on Fuzzy Systems 21, 3 (2013), 399–411.
    [48]
    Jesus Serrano-Guerrero, Francisco P. Romero, and Jose A. Olivas. 2021. Fuzzy logic applied to opinion mining: A review. Knowledge-Based Systems 222 (2021), 107018.
    [49]
    M. Shabaz and U. Garg. 2020. Clustering Yelp's sentiment data through various approaches and estimating the error rate. Materials Today: Proceedings (2020).
    [50]
    N. M. Shelke, S. Deshpande, and V. Thakre. 2017. Exploiting expectation maximization algorithm for sentiment analysis of product reviews. In 2017 International Conference on Inventive Communication and Computational Technologies (ICICCT). IEEE. 390–396.
    [51]
    Y. Shelke and C. Chakraborty. 2021. An end-to-end shape-preserving point completion network. IEEE Computer Graphics and Applications 41, 3 (2021), 124–138.
    [52]
    S. Shi, X. Wu, R. Su, and H. Huang. 2022. Low-resource neural machine translation: Methods and trends. Transactions on Asian and Low-Resource Language Information Processing.
    [53]
    C. Simionescu, I. Stoleru, D. Lucaci, G. Balan, I. Bute, and A. Iftene. 2019. UAIC at SemEval-2019 Task 3: Extracting much from little. In Proceedings of the 13th International Workshop on Semantic Evaluation. 355–359.
    [54]
    M. Sivakumar and R. U. Srinivasulu. 2021. Aspect-based sentiment analysis of mobile phone reviews using LSTM and fuzzy logic. International Journal of Data Science and Analytics 12, 4 (2021), 355–367.
    [55]
    M. Snover, N. Madnani, B. Dorr, and R. Schwartz. 2009. Fluency, adequacy, or HTER? Exploring different human judgments with a tunable MT metric. In Proceedings of the Fourth Workshop on Statistical Machine Translation. 259–268.
    [57]
    C. Strapparava and R. Mihalcea. 2007. SemEval-2007 task 14: Affective text. In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007). 70–74.
    [58]
    H. Sun, R. Wang, M. Utiyama, B. Marie, K. Chen, E. Sumita, and T. Zhao. 2021. Unsupervised neural machine translation for similar and distant language pairs: An empirical study. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 20, 1 (2021), 1–17.
    [59]
    S. Tan, Y. Li, H. Sun, Z. Guan, X. Yan, J. Bu,… and X. He. 2013. Interpreting the public sentiment variations on Twitter. IEEE Transactions on Knowledge and Data Engineering 26, 5 (2013), 1158–1170.
    [60]
    M. Thelwall, K. Buckley, and G. Paltoglou. 2012. Sentiment strength detection for the social web. Journal of the American Society for Information Science and Technology 63, 1 (2012), 163–173.
    [61]
    A. Tripathy, A. Agrawal, and S. K. Rath. 2016. Classification of sentiment reviews using n-gram machine learning approach. Expert Systems with Applications 57 (2016), 117–126.
    [62]
    Adrià Torrens Urrutia, M. Dolores Jiménez-López, and Vilém Novák. 2021. Fuzzy natural logic for sentiment analysis: A proposal. International Symposium on Distributed Computing and Artificial Intelligence. Springer, Cham, 2021.
    [63]
    S. Vashishtha and S. Susan. 2019. Fuzzy rule based unsupervised sentiment analysis from social media posts. Expert Systems with Applications 138 (2019), 112834.
    [64]
    S. Vashishtha and S. Susan. 2020. Fuzzy interpretation of word polarity scores for unsupervised sentiment analysis. In 2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT). IEEE. 1–6.
    [65]
    A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, ... and I. Polosukhin. 2017. Attention is all you need. Advances in Neural Information Processing Systems (2017), 30.
    [66]
    I. P. Windasari, F. N. Uzzi, and K. I. Satoto. 2017. Sentiment analysis on Twitter posts: An analysis of positive or negative opinion on GoJek. In 2017 4th International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE). IEEE. 266–269.
    [67]
    M. Wöllmer, F. Weninger, T. Knaup, B. Schuller, C. Sun, K. Sagae, and L. P. Morency. 2013. YouTube movie reviews: Sentiment analysis in an audio-visual context. IEEE Intelligent Systems 28, 3 (2013), 46–53.
    [68]
    F. Xu, Z. Pan, and R. Xia. 2020. E-commerce product review sentiment classification based on a naïve Bayes continuous learning framework. Information Processing & Management 57, 5 (2020), 102221.
    [69]
    A. Yadav, C. K. Jha, A. Sharan, and V. Vaish. 2020. Sentiment analysis of financial news using unsupervised approach. Procedia Computer Science 167 (2020), 589–598.
    [70]
    Y. Yan, H. Yang, and H. M. Wang. 2017. Two simple and effective ensemble classifiers for Twitter sentiment analysis. In 2017 Computing Conference. IEEE. 1386–1393.
    [71]
    L. Yang, Y. Li, J. Wang, and R. S. Sherratt. 2020. Sentiment analysis for E-commerce product reviews in Chinese based on sentiment lexicon and deep learning. IEEE Access 8 (2020), 23522–23530.
    [72]
    S. Yoo, J. Song, and O. Jeong. 2018. Social media contents based sentiment analysis and prediction system. Expert Systems with Applications 105 (2018), 102–111.
    [73]
    L. A. Zadeh. 2015. Fuzzy logic—a personal perspective. Fuzzy Sets and Systems, 281, 4–20.

    Cited By

    View all
    • (2024)Dynamic decoding and dual synthetic data for automatic correction of grammar in low-resource scenarioPeerJ Computer Science10.7717/peerj-cs.212210(e2122)Online publication date: 5-Jul-2024
    • (2024)Deep learning based next word prediction aided assistive gaming technology for people with limited vocabularyEntertainment Computing10.1016/j.entcom.2024.10066150(100661)Online publication date: May-2024
    • (2023)Real Time Object Detection with Image Recognition and Web Scraper2023 3rd International Conference on Technological Advancements in Computational Sciences (ICTACS)10.1109/ICTACS59847.2023.10390064(280-283)Online publication date: 1-Nov-2023
    • Show More Cited By

    Index Terms

    1. Rule Based Fuzzy Computing Approach on Self-Supervised Sentiment Polarity Classification with Word Sense Disambiguation in Machine Translation for Hindi Language

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 22, Issue 5
      May 2023
      653 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3596451
      Issue’s Table of Contents

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 09 May 2023
      Online AM: 22 February 2023
      Accepted: 06 November 2022
      Revised: 07 October 2022
      Received: 02 June 2022
      Published in TALLIP Volume 22, Issue 5

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Sentiment analysis
      2. fuzzy sets
      3. lexicon
      4. unsupervised sentiment classification
      5. self-learning

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)127
      • Downloads (Last 6 weeks)5
      Reflects downloads up to 26 Jul 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Dynamic decoding and dual synthetic data for automatic correction of grammar in low-resource scenarioPeerJ Computer Science10.7717/peerj-cs.212210(e2122)Online publication date: 5-Jul-2024
      • (2024)Deep learning based next word prediction aided assistive gaming technology for people with limited vocabularyEntertainment Computing10.1016/j.entcom.2024.10066150(100661)Online publication date: May-2024
      • (2023)Real Time Object Detection with Image Recognition and Web Scraper2023 3rd International Conference on Technological Advancements in Computational Sciences (ICTACS)10.1109/ICTACS59847.2023.10390064(280-283)Online publication date: 1-Nov-2023
      • (2023)Optimizing Organic Food Sustainability Through Digital Platforms for Enhanced SEO2023 3rd International Conference on Technological Advancements in Computational Sciences (ICTACS)10.1109/ICTACS59847.2023.10389827(608-613)Online publication date: 1-Nov-2023
      • (2023)Handwritten Documents Conversion To Digital Documents2023 9th International Conference on Smart Computing and Communications (ICSCC)10.1109/ICSCC59169.2023.10335069(6-11)Online publication date: 17-Aug-2023
      • (2023)Machine Learning Approaches to Predict the Teams for Fantasy Leagues2023 International Conference on Recent Advances in Information Technology for Sustainable Development (ICRAIS)10.1109/ICRAIS59684.2023.10367068(54-58)Online publication date: 6-Nov-2023
      • (2023)Design and Application of Online Translation System Based on Web2023 IEEE International Conference on Paradigm Shift in Information Technologies with Innovative Applications in Global Scenario (ICPSITIAGS)10.1109/ICPSITIAGS59213.2023.10527596(100-105)Online publication date: 28-Dec-2023
      • (2023)Talk House a Drop-in Audio Webapp: A cross between a podcast and a conference call2023 IEEE 2nd International Conference on Industrial Electronics: Developments & Applications (ICIDeA)10.1109/ICIDeA59866.2023.10295249(38-43)Online publication date: 29-Sep-2023
      • (2023)Car Recognition System Using Convolutional Neural Network2023 6th International Conference on Contemporary Computing and Informatics (IC3I)10.1109/IC3I59117.2023.10398091(379-384)Online publication date: 14-Sep-2023
      • (2023)Knowledge Based Nested Frames of Discernment for Target Integrated Identification2023 China Automation Congress (CAC)10.1109/CAC59555.2023.10450752(1454-1459)Online publication date: 17-Nov-2023
      • Show More Cited By

      View Options

      Get Access

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Full Text

      View this article in Full Text.

      Full Text

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media