Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3582197.3582198acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicitConference Proceedingsconference-collections
research-article

Performance Comparison of Deep Learning Text Embeddings in Sentiment Analysis Tasks with Online Consumer Reviews

Published: 30 March 2023 Publication History

Abstract

In order to investigate the effect of various natural language processing models on different data processing, this paper adopted the consumer reviews of two well-known Internet retailing websites: Yelp and Zappos, and used four text embedding methods: word2vec, Glove, BERT, and GPT-2 and two text classification methods: SVM and Neural Network (NN) for text classification, in order to compare the performance of the combinations of these text mining techniques. The result shows that BERT is the best-performing text embedding method overall in both datasets when used with both SVM and NN. It is also found that NN is better than SVM for overall text classification. As an exploratory experiment, we aim to provide a three-dimensional comparison to find the most suitable algorithm for consumer review data, and the implication is that BERT and NN can achieve satisfactory results in most of the scenarios.

References

[1]
Yin Zhang, Jinyang Du, Xiao Ma, Haoyu Wen & Giancarlo Fortino Cognitive Computation. 2021. Aspect-based sentiment analysis for user reviews. Cognitive Computation, 2021, 13(5): 1114-1127. https://doi.org/10.1007/s12559-021-09855-4
[2]
Li Jiaxuan, Pang Cheong-Iao Patrick, Xiao Yundan and Wong Dennis. 2022. Changes in Doctor-Patient Relationships in China during COVID-19: A Text Mining Analysis. International Journal of Environmental Research and Public Health, 19(20), 13446, MDPI.
[3]
Satanik Mitra, Mamata Jenamani. 2021. Helpfulness of online consumer reviews: A multi-perspective approach. Information Processing & Management, 2021, 58(3): 102538. https://doi.org/10.1016/Jipm.2021.102538
[4]
Corradini Enrico, Nocera Antonino, Ursino Domenico, Virgili Luca. 2021. Investigating negative reviews and detecting negative influencers in Yelp through a multi-dimensional social network based model. International Journal of Information Management, 2021, 60: 102377. https://doi.org/10.1016/Jijinfomgt.2021.102377
[5]
Akkinapally Yugendhar, Syed Mahamood Ali. 2017. Evaluation of implementing holacracy, a comprehensive study on Zappos.International Journal of Engineering and Management Research (IJEMR), 2017, 7(5): 163-171.
[6]
Alexander W. Olson, Fernando Calderón-Figuero, Olimpia Bidian Daniel Silver Scott Sanner. 2020. Reading the city through its neighbourhoods: Deep text embeddings of Yelp reviews as a basis for determining similarity and change. Cities, 2021, 110: 103045. https://doi.org/.10.1016/Jcities.2020.103045
[7]
Sajadul Hassan Kumhar, Mudasir M Kirmani, Jitendra Sheetlani, Mudasir Hassan.2021. Word embedding generation for Urdu language using Word2vec model Materials Today: Proceedings, 2021. https://doi.org/10.1016/Jmatpr.2020.11.766
[8]
Piyush Gupta, Inika Roy, Gunnika Batra, Arun Kumar Dubey. 2021. Decoding emotions in text Using GloVe embeddings. 2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS). IEEE, 2021: 36-40.10.1109/ICCCIS51004.2021.9397132
[9]
Alessandro Solbiati, Kevin Heffernan, Georgios Damaskinos, Shivani Poddar, Shubham Modi, Jacques Cali.2021 Unsupervised topic segmentation of meetings with BERT embeddings. arXiv preprint arXiv:2106.12978, 2021. https://doi.org/10.48550/arXiv.2106.12978
[10]
Pavel Khloponin, Leila Kosseim. 2021. Using Document Embeddings for Background Linking of News Articles. International Conference on Applications of Natural Language to Information Systems. Springer, Cham, 2021: 317-329. https://doi.org/10.1007/978-3-030-80599-9_28
[11]
Size Li, Yiqun Yu, Xiaolan Bian, Li Yao, Min Li, Yan-Ru Lou, Jing Yuan, Hai-Shu Lin, Lucy Liu, Bing Han, Xiaoqiang Xiang. Prediction of oral hepatotoxic dose of natural products derived from traditional Chinese medicines based on SVM classifier and PBPK modeling. Archives of Toxicology, 2021, 95(5): 1683-1701.
[12]
Pm Lavanya, E.Sasikala .2021. Deep learning techniques on text classification using Natural language processing (NLP) in social healthcare network: A comprehensive survey.2021 3rd International Conference on Signal Processing and Communication (ICPSC),2021: 603-609.
[13]
Xiaorui Shao, Chang-So Kim, and Kwak Dong Ryul .2019. A Study on Customers' Sentiment Analysis Based on Big Data Using Twitter Data. IACSIT Press, 2019(1).
[14]
Li Jiaxuan and Pang Cheong-Iao Patrick. 2022. Have Patients’ Attitudes towards Doctors Changed with COVID-19? Insights from Online Doctor Reviews in Beijing, China. Proceedings of 13th International Conference on E-business, Management and Economics (ICEME 2022), Beijing, China, 16-18 Jul 2022.
[15]
Marouane Birjali, Mohammed Kasri, Abderrahim Beni-Hssane. 2021.A comprehensive survey on sentiment analysis: Approaches, challenges and trends. Knowledge-Based Systems, 2021, 226: 107134. https://doi.org/10.1016/Jknosys.2021.107134
[16]
Ruifan Li, Hao Chen, Fangxiang Feng, Zhanyu Ma, Xiaojie Wang, Eduard Hovy.2021. Dual graph convolutional networks for aspect-based sentiment analysis.Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2021: 6319-6329.
[17]
Li-Chen Cheng, Yen-Liang Chen, Yuan-Yu Liao. 2021.Aspect-based sentiment analysis with component focusing multi-head co-attention networks. Neurocomputing, 2022, 489: 9-17. https://doi.org/10.1016/Jneucom.2022.03.027
[18]
Patrick C.-I. Pang. 2022. Performance Evaluation of Text Embeddings with Online Consumer Reviews in Retail Sectors. Proceedings of IEEE/ACIS 22nd International Hybrid Conference on Computer and Information Science (ICIS 2022).
[19]
Wenbo Zheng, Lan Yan, Chao Gou, Zhi-Cheng Zhang, Jun Jason Zhang, Ming Hu, Fei-Yue Wang. 2021. Pay attention to doctor–patient dialogues: multi-modal knowledge graph attention image-text embedding for COVID-19 diagnosis. Information Fusion, 2021, 75: 168-185.
[20]
Paul-Ambroise Duquenne, Hongyu Gong, Holger Schwenk. 2021.Multimodal and multilingual embeddings for large-scale speech mining. Advances in Neural Information Processing Systems, 2021, 34: 15748-15761.
[21]
Husien A. Alhammi and Kais Haddar. 2020. Building a Libyan Dialect Lexicon-Based Sentiment Analysis System Using Semantic Orientation of Adjective-Adverb Combinations . International Journal of Computer Theory and Engineering,12(6):145-150, 2020.
[22]
Aamir Rashid and Ching-yu Huang .2021. Sentiment Analysis on Consumer Reviews of Amazon Products, International Journal of Computer Theory and Engineering, 13(2):35-41
[23]
Ahmed K. Abbasa, Najim A. Al-haideria, Ali A. Bashikh. 2019. Implementing artificial neural networks and support vector machines to predict lost circulation. Egyptian Journal of Petroleum, 2019, 28(4): 339-347.
[24]
Amrutha Ragothaman and Ching-Yu Huang .2021. Sentiment Analysis on Covid-19 Twitter Data, International Journal of Computer Theory and Engineering, 13(4):100-107
[25]
Mohammed Salih Homaid, Desmond Bala Bisandu, Irene Moulitsas, and Karl Jenkins . 2022. Analysing the Sentiment of Air-Traveller: A Comparative Analysis, International Journal of Computer Theory and Engineering, 14(2):48-53
[26]
Pedro L. Rodriguez and Arthur Spirling. 2021. Word embeddings: What works, what doesn't, and how to tell the difference for applied research. The Journal of Politics, 2022, 84(1): 101-115.
[27]
Jeffrey Pennington, Richard Socher, Christopher D. Manning. 2014. Glove: Global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). 2014: 1532-1543.
[28]
Pei Liu, Xuemin Wang, Chao Xiang, Weiye Meng. 2020. A survey of text data augmentation. 2020 International Conference on Computer Communication and Network Security (CCNS). IEEE, 2020: 191-19
[29]
Youcef Benmahamed, Omar Kherif, Madjid Teguar, Ahmed Boubakeur and Sherif S. M. Ghoneim. 2021. Accuracy improvement of transformer faults diagnostic based on DGA data using SVM-BA classifier. Energies, 2021, 14(10): 2970. https://doi.org/10.3390/en14102970
[30]
Abeer Talib Ali, Mohammed Zakariah, Prashant Kumar Shukla, Aseel Alhadlaq. 2022. Human-Computer Interaction for Recognizing Speech Emotions Using Multilayer Perceptron Classifier. Journal of Healthcare Engineering, 2022, 2022.
[31]
Yong Yu, Xiaosheng Si, .Changhua Hu, Jianxun Zhang. 2019. A review of recurrent neural networks: LSTM cells and network architectures. Neural computation, 2019, 31(7): 1235-1270.
[32]
Dazi Li, Jianxun Liu, Zhiyu Zhang, Mingjie Yan, Yining Dong and Jun Liu. 2022. Fractional calculus & machine learning methods based rubber stress-strain relationship prediction, Molecular Simulation, 48:10, 944-954.

Cited By

View all
  • (2024)Natural language processing for analyzing online customer reviews: a survey, taxonomy, and open research challengesPeerJ Computer Science10.7717/peerj-cs.220310(e2203)Online publication date: 19-Jul-2024
  • (2023)BBPM: A Study of Information Pre-retrieval Models Based on Medical BERT Model2023 9th International Conference on Computer and Communications (ICCC)10.1109/ICCC59590.2023.10507322(2389-2393)Online publication date: 8-Dec-2023

Index Terms

  1. Performance Comparison of Deep Learning Text Embeddings in Sentiment Analysis Tasks with Online Consumer Reviews

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      ICIT '22: Proceedings of the 2022 10th International Conference on Information Technology: IoT and Smart City
      December 2022
      385 pages
      ISBN:9781450397438
      DOI:10.1145/3582197
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 30 March 2023

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. business analytics
      2. deep learning
      3. online consumer reviews
      4. text embedding

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Conference

      ICIT 2022
      ICIT 2022: IoT and Smart City
      December 23 - 25, 2022
      Shanghai, China

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)25
      • Downloads (Last 6 weeks)3
      Reflects downloads up to 09 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Natural language processing for analyzing online customer reviews: a survey, taxonomy, and open research challengesPeerJ Computer Science10.7717/peerj-cs.220310(e2203)Online publication date: 19-Jul-2024
      • (2023)BBPM: A Study of Information Pre-retrieval Models Based on Medical BERT Model2023 9th International Conference on Computer and Communications (ICCC)10.1109/ICCC59590.2023.10507322(2389-2393)Online publication date: 8-Dec-2023

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media