Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1109/WI-IAT.2013.47acmconferencesArticle/Chapter ViewAbstractPublication PageswiConference Proceedingsconference-collections
Article

A Novel Hybrid HDP-LDA Model for Sentiment Analysis

Published: 17 November 2013 Publication History

Abstract

Sentiment analysis studies the public opinions towards an entity, and it is an important research area in data mining. Recently, a lot of sentiment analysis models have been proposed, including supervised and unsupervised approaches. However, the role of supervised models has been undermined by the phenomenon of big data, and the unsupervised ones are drawing more and more attention. But, most current unsupervised methods are based on Latent Dirichlet Allocation (LDA), and they need to specify the number of aspects in advance, making them subjective. In addition, these methods treat factual words and opinioned words the same, and assume that one sentence contains only one aspect, all of which make the existing unsupervised methods unsatisfactory. To solve these problems, this paper proposes a novel hybrid Hierarchical Dirichlet Process-Latent Dirichlet Allocation (HDP-LDA) model. This model can automatically determine the number of aspects, distinguish factual words from opinioned words, and further effectively extracts the aspect specific sentiment words. Experiment result shows that our model can clearly capture the aspects people mentioned and the specific sentiment words they use in each aspect, improving the performance of sentiment analysis efficiently. At last, we compared our model with the influential topic models, namely, JST, AUSM and Maxine-LDA, on the online restaurant review, and found our model performs very well.

References

[1]
Carbonell, J., Subjective Understanding: Computer Models of Belief Systems. PhD thesis, Yale, 1979.
[2]
Wilks, Y. and J. Bien, Beliefs, Points of View and Multiple Environments Proceedings of the international NATO symposium on artificial and human intelligence, 1984: p. 147-171.
[3]
Teh, Y.W., et al., Hierarchical Dirichlet processes. Journal of the American Statistical Association, 2006. 101(476): p. 1566-1581.
[4]
Blei, D.M., A.Y. Ng, and M.I. Jordan, Latent Dirichlet allocation. Journal of Machine Learning Research, 2003. 3(4-5): p. 993-1022.
[5]
Zhao, W.X., et al., Jointly Modeling Aspects and Opinions with a MaxEnt-LDA Hybrid. Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, page 56-65, 2010.
[6]
Jo, Y. and A. Oh, Aspect and Sentiment Unification Model for Online Review Analysis. WSDM'11 February 9-12, Hong Kong, China, 2011.
[7]
Xu, X., et al., Towards Jointly Extracting Aspects and Aspect-Specific Sentiment Knowledge KCIKM'12, October 29-November, 2012 Maui, HI, USA, 2012.
[8]
Hu, M. and B. Liu, Mining and Summarizing Customer Reviews. KDD'04, August 22-25, 2004, Seattle, Washington, USA, 2004.
[9]
Jin, W. and H.H. Ho, A Novel Lexicalized HMM-based Learning Framework for Web Opinion Mining. Proceedings of the 26th International Conference on Machine Learning, Montreal, Canada, 2009.
[10]
Wu, Y., et al., Phrase Dependency Parsing for Opinion Mining. Proceedings of the 2009 Conference on Empirical Methods in Natural Language Process, pages 1533-1541, 2009.
[11]
Mei, Q., et al., Topic Sentiment Mixture: Modeling Facets and Opinions in Weblogs. WWW 2007, May 8-12, 2007, Banff, Alberta, Canada, 2007.
[12]
Hofmann, T., Probabilistic latent semantic indexing. Processdings of the 22nd Annal International ACM SIGIR Conference on Research and Development in Information Retrieval, 1999.
[13]
Lin, C. and Y. He, Joint Sentiment/Topic Model for Sentiment Analysis. CIKM'09, Novermber 2-6, 2009, Hong Kong, China, 2009.
[14]
Brody, S. and N. Elhadad, An Unsuperivised Aspect-Sentiment Model for Online Review. Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the ACL, pages 804-812, Los Angeles, California, June, 2010, 2010.
[15]
Blei, D., COS 597C: Bayesian nonparametrics. Lecture Notes in Priceton University. http://www. cs.princeton. edu/courses/archive/fall07/cos597C/scribe/20070921.pdf, 2007.
[16]
Pitman, J., Combinatorial Stochastic Processes. Lecture Notes for St. Flour Course, July 2002, 2002.
[17]
Ganu, G., N. Elhadad, and A. Marian, Beyond the Stars: Improving Rating Predictions using Revew Text Cotent. Twelfth International Workshop on the Web and Databases, Providence, Rhode, Island, USA, 2009.

Cited By

View all
  • (2021)Deep spatio-temporal emotion analysis of geo-tagged tweets for predicting location based communal emotion during COVID-19 Lock-downJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-21054441:2(3251-3264)Online publication date: 1-Jan-2021
  • (2016)Science communication and dissemination in different culturesJournal of the Association for Information Science and Technology10.1002/asi.2346167:6(1473-1486)Online publication date: 1-Jun-2016

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WI-IAT '13: Proceedings of the 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 01
November 2013
609 pages
ISBN:9780769551456

Sponsors

Publisher

IEEE Computer Society

United States

Publication History

Published: 17 November 2013

Check for updates

Author Tags

  1. Aspect Detection
  2. Hierarchical Dirichlet Process
  3. Latent Dirichlet Allocation
  4. Probabilistic Model
  5. Sentiment Analysis

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Deep spatio-temporal emotion analysis of geo-tagged tweets for predicting location based communal emotion during COVID-19 Lock-downJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-21054441:2(3251-3264)Online publication date: 1-Jan-2021
  • (2016)Science communication and dissemination in different culturesJournal of the Association for Information Science and Technology10.1002/asi.2346167:6(1473-1486)Online publication date: 1-Jun-2016

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media