Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3368926.3369675acmotherconferencesArticle/Chapter ViewAbstractPublication PagessoictConference Proceedingsconference-collections
research-article

Vietnamese Sentiment Analysis for Hotel Review based on Overfitting Training and Ensemble Learning

Published: 04 December 2019 Publication History

Abstract

In this study, we propose a machine learning model in analyzing customer opinions based on Vietnamese text: the case of hotel service, classifying a review as a positive or a negative. In particular, our solution focuses on improvement: preprocessing, standardization, training data relabelling with Error Analysis method. Besides, training data is enhanced with emotional dictionary; 5-Fold Cross Validation and Confusion Matrix are used to control overfitting and underfitting and to test the model; Hyperparameter Tuning method is used to optimize model parameters; Ensemble Methods are used to combine several machine learning techniques into the most efficient predictive model. Used data is collected from website booking.com.

References

[1]
Ali Hasan, S. M. (n.d.) (2018). "Machine Learning-Based Sentiment Analysis for Twitter Accounts".
[2]
Alia Karim Abdul Hassan, A. B. (2017). Reviews Sentiment analysis for collaborative recommender system.
[3]
Anh - Nguyê Thi Lan (2013), "Nghiên cúu thuât toán hoc máy svm và úng dung trong bài toán khai phá y kiě ph´n hôi cúa khách hàng trên website", Luân vărn Thac sĩ, Hoc viě Công nghê Buu chính Viě thông.
[4]
Chang, C.L. (2011). "SVM: "A Library for Support Vector Machine".
[5]
Pang, B., Lee, L. and Vaithyanathan, S. (2012), "Thumbs up: Sentiment Classification Using Machine Learning Techniques.", Proceedings of the ACL-02 conference on Empirical methods in natural language processing 10, pp. 79--86.
[6]
Phu X. V. Nguyen, Tham T. T. Hong, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen (2018). "Deep Learning versus Traditional Classifiers on Vietnamese Students' Feedback Corpus". 2018 5th NAFOSTED Conference on Information and Computer Science (NICS);
[7]
Severyn, A. and Moschitti, A. (2015), "Twitter Sentiment Analysis with Deep Convolutional Neural Networks.", Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 959--962.
[8]
Thành, N.T. (2013). "Sentiment classification for vietnamese user reviews and its application to a sentiment analysis system". Luân vărn Thac sĩ, Đai hoc Công nghê, Đai hoc Quoc gia Hà Nôi.
[9]
Tang, D., Qin, B. and Liu, T. (2015), "Deep Learning for Sentiment Analysis: Successful Approaches and Future Challenges.", Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 5(6), pp. 292--303.
[10]
Tsytsarau, M. and Palpanas, T. (2012), "Survey on Mining Subjective Data on the Web.", Data Mining and Knowledge Discovery, 24(3), pp. 478--514.
[11]
Thin-Dang Van, Kiet-Van Nguyen, Ngan Luu-Thuy Nguyen (2018). "A Supervised Method For Aspect Based Sentiment Analysis". VLSP - 2018;
[12]
Thien Khai Tran, Tuoi Thi Phan (2019). "Deep Learning Application to Ensemble Learning - The Simple, but Effective, Approach to Sentiment Classifying". Applied Sciences - 2019;
[13]
Tran Sy BANG, Virach SORNLERTLAMVANICH (2018). "Sentiment Classification for Hotel Booking Review Based on Sentence Dependency Structure and Sub-Opinion Analysis". IEICE TRANS. INF. & SYST., VOL.E101--D, NO.4 APRIL 2018.
[14]
Quynh-Trang Thi Pham, Xuan-Truong Nguyen, Van-Hien Tran, Thi-Cham Nguyen, Mai-Vu Tran (2016). "Vietnamese Sentiment Analysis for Product Reviews". VLSP - 2016;
[15]
Vikram Elango, G. N. (2018), "Sentiment Analysis for Hotel Reviews". CS 229 Machine Learning, Final Projects, Autumn 2014.

Cited By

View all
  • (2023)Vietnamese Sentiment Analysis: An Overview and Comparative Study of Fine-tuning Pretrained Language ModelsACM Transactions on Asian and Low-Resource Language Information Processing10.1145/358913122:6(1-27)Online publication date: 4-Apr-2023
  • (2023)A Study of Vietnamese Sentiment Classification with Ensemble Pre-Trained Language ModelsVietnam Journal of Computer Science10.1142/S219688882350017311:01(137-165)Online publication date: 7-Dec-2023
  • (2023)A Comprehensive Ensemble Deep Learning Method for Identifying Native Advertising in News Articles2023 IEEE 8th International Conference On Software Engineering and Computer Systems (ICSECS)10.1109/ICSECS58457.2023.10256392(164-169)Online publication date: 25-Aug-2023
  • Show More Cited By

Index Terms

  1. Vietnamese Sentiment Analysis for Hotel Review based on Overfitting Training and Ensemble Learning

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      SoICT '19: Proceedings of the 10th International Symposium on Information and Communication Technology
      December 2019
      551 pages
      ISBN:9781450372459
      DOI:10.1145/3368926
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      In-Cooperation

      • SOICT: School of Information and Communication Technology - HUST
      • NAFOSTED: The National Foundation for Science and Technology Development

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 04 December 2019

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Ensemble learning
      2. Error analysis
      3. Hotel review classification
      4. Sentiment analysis
      5. Vietnamese sentiment classification

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Conference

      SoICT 2019

      Acceptance Rates

      Overall Acceptance Rate 147 of 318 submissions, 46%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)17
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 30 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Vietnamese Sentiment Analysis: An Overview and Comparative Study of Fine-tuning Pretrained Language ModelsACM Transactions on Asian and Low-Resource Language Information Processing10.1145/358913122:6(1-27)Online publication date: 4-Apr-2023
      • (2023)A Study of Vietnamese Sentiment Classification with Ensemble Pre-Trained Language ModelsVietnam Journal of Computer Science10.1142/S219688882350017311:01(137-165)Online publication date: 7-Dec-2023
      • (2023)A Comprehensive Ensemble Deep Learning Method for Identifying Native Advertising in News Articles2023 IEEE 8th International Conference On Software Engineering and Computer Systems (ICSECS)10.1109/ICSECS58457.2023.10256392(164-169)Online publication date: 25-Aug-2023
      • (2022)Sentiment Analysis based on word vector representation for short comments in Vietnamese language2022 9th NAFOSTED Conference on Information and Computer Science (NICS)10.1109/NICS56915.2022.10013426(165-169)Online publication date: 31-Oct-2022
      • (2022)Natural language processing applied to tourism research: A systematic review and future research directionsJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2022.10.01034:10(10125-10144)Online publication date: Nov-2022
      • (2022)A Text Classification for Vietnamese Feedback via PhoBERT-Based Deep LearningProceedings of Seventh International Congress on Information and Communication Technology10.1007/978-981-19-2394-4_24(259-272)Online publication date: 12-Jul-2022
      • (2022)Hyperparameter TuningApplied Data Science in Tourism10.1007/978-3-030-88389-8_12(231-251)Online publication date: 31-Jan-2022
      • (2021)Applying Sentiment Product Reviews and Visualization for BI Systems in Vietnamese E-Commerce Website: Focusing on Vietnamese ContextElectronics10.3390/electronics1020248110:20(2481)Online publication date: 12-Oct-2021
      • (2021)A Novel Approach for Enhancing Vietnamese Sentiment ClassificationAdvances and Trends in Artificial Intelligence. From Theory to Practice10.1007/978-3-030-79463-7_9(99-111)Online publication date: 26-Jul-2021

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media