Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1367497.1367514acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Opinion integration through semi-supervised topic modeling

Published: 21 April 2008 Publication History

Abstract

Web 2.0 technology has enabled more and more people to freely express their opinions on the Web, making the Web an extremely valuable source for mining user opinions about all kinds of topics. In this paper we study how to automatically integrate opinions expressed in a well-written expert review with lots of opinions scattering in various sources such as blogspaces and forums. We formally define this new integration problem and propose to use semi-supervised topic models to solve the problem in a principled way. Experiments on integrating opinions about two quite different topics (a product and a political figure) show that the proposed method is effective for both topics and can generate useful aligned integrated opinion summaries. The proposed method is quite general. It can be used to integrate a well written review with opinions in an arbitrary text collection about any topic to potentially support many interesting applications in multiple domains.

References

[1]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993--1022, 2003.
[2]
D. Dave and S. Lawrence. Mining the peanut gallery: opinion extraction and semantic classification of product reviews.
[3]
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of Royal Statist. Soc. B, 39:1--38, 1977.
[4]
M. Gamon, A. Aue, S. Corston-Oliver, and E. K. Ringger. Pulse: Mining customer opinions from free text. In IDA, volume 3646 of Lecture Notes in Computer Science, pages 121--132, 2005.
[5]
T. Hofmann. Probabilistic latent semantic analysis. In Proc. of Uncertainty in Artificial Intelligence, UAI'99, Stockholm.
[6]
T. Hofmann. Probabilistic latent semantic indexing. In Proceedings of SIGIR '99, pages 50--57.
[7]
M. Hu and B. Liu. Mining and summarizing customer reviews. In KDD, pages 168--177.
[8]
M. Hu and B. Liu. Mining opinion features in customer reviews. In AAAI, pages 755--760.
[9]
W. Li and A. McCallum. Pachinko allocation: Dag-structured mixture models of topic correlations. In ICML '06: Proceedings of the 23rd international conference on Machine learning, pages 577--584.
[10]
B. Liu, M. Hu, and J. Cheng. Opinion observer: analyzing and comparing opinions on the web. In WWW '05: Proceedings of the 14th international conference on World Wide Web, pages 342--351.
[11]
Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai. Topic sentiment mixture: Modeling facets and opinions in weblogs. In Proceedings of the World Wide Conference 2007, pages 171--180.
[12]
Q. Mei, C. Liu, H. Su, and C. Zhai. A probabilistic approach to spatiotemporal theme pattern mining on weblogs. In WWW '06: Proceedings of the 15th international conference on World Wide Web, pages 533--542.
[13]
Q. Mei and C. Zhai. A mixture model for contextual text mining. In KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 649--655.
[14]
G. Mishne and M. de Rijke. MoodViews: Tools for blog mood analysis. In AAAI 2006 Spring Symposium on Computational Approaches to Analysing Weblogs (AAAI-CAAW 2006), pages 153--154.
[15]
G. Mishne and N. Glance. Predicting movie sales from blogger sentiment. In AAAI 2006 Spring Symposium on Computational Approaches to Analysing Weblogs (AAAI-CAAW 2006).
[16]
Opinmind. http://www.opinmind.com.
[17]
A.-M. Popescu and O. Etzioni. Extracting product features and opinions from reviews. In HLT '05: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, pages 339--346.
[18]
M. F. Porter. An algorithm for su±x stripping. pages 313--316, 1997.
[19]
T. Tao and C. Zhai. Regularized estimation of mixture models for robust pseudo-relevance feedback. In SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 162--169.
[20]
C. Zhai and J. Lafferty. Model-based feedback in the language modeling approach to information retrieval. In Proceedings of CIKM 2001, pages 403--410.
[21]
C. Zhai, A. Velivelli, and B. Yu. A cross-collection mixture model for comparative text mining. In Proceedings of KDD '04, pages 743--748.
[22]
L. Zhuang, F. Jing, and X.-Y. Zhu. Movie review mining and summarization. In CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management, pages 43--50.

Cited By

View all
  • (2023)Aspect Extraction from E-Commerce Reviews Based on Word-Word Relationship Classification2023 7th Asian Conference on Artificial Intelligence Technology (ACAIT)10.1109/ACAIT60137.2023.10528431(226-231)Online publication date: 10-Nov-2023
  • (2023)An integrated latent Dirichlet allocation and Word2vec method for generating the topic evolution of mental models from global to localExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.118695212:COnline publication date: 1-Feb-2023
  • (2022)Opinion Mining and Sentiment AnalysisMachine Learning for Text10.1007/978-3-030-96623-2_15(491-514)Online publication date: 10-Feb-2022
  • Show More Cited By

Index Terms

  1. Opinion integration through semi-supervised topic modeling

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WWW '08: Proceedings of the 17th international conference on World Wide Web
    April 2008
    1326 pages
    ISBN:9781605580852
    DOI:10.1145/1367497
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 April 2008

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. expert review
    2. opinion integration
    3. probabilistic topic modeling
    4. semi-supervised

    Qualifiers

    • Research-article

    Conference

    WWW '08
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)33
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 01 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Aspect Extraction from E-Commerce Reviews Based on Word-Word Relationship Classification2023 7th Asian Conference on Artificial Intelligence Technology (ACAIT)10.1109/ACAIT60137.2023.10528431(226-231)Online publication date: 10-Nov-2023
    • (2023)An integrated latent Dirichlet allocation and Word2vec method for generating the topic evolution of mental models from global to localExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.118695212:COnline publication date: 1-Feb-2023
    • (2022)Opinion Mining and Sentiment AnalysisMachine Learning for Text10.1007/978-3-030-96623-2_15(491-514)Online publication date: 10-Feb-2022
    • (2021)Bert-Based Latent Semantic Analysis (Bert-LSA): A Case Study on Geospatial Data Technology and Application Trend AnalysisApplied Sciences10.3390/app11241189711:24(11897)Online publication date: 14-Dec-2021
    • (2021) Semi-supervised Text Classification Based On Graph Attention Neural Networks * 2021 4th International Conference on Artificial Intelligence and Big Data (ICAIBD)10.1109/ICAIBD51990.2021.9459003(325-330)Online publication date: 28-May-2021
    • (2021)MASAD: A large-scale dataset for multimodal aspect-based sentiment analysisNeurocomputing10.1016/j.neucom.2021.05.040455(47-58)Online publication date: Sep-2021
    • (2021)Mining and classifying customer reviews: a surveyArtificial Intelligence Review10.1007/s10462-021-09955-554:8(6343-6389)Online publication date: 1-Mar-2021
    • (2021)A Semi-automated Approach for Identification of Trends in Android Ransomware LiteratureMachine Learning for Networking10.1007/978-3-030-70866-5_18(265-283)Online publication date: 3-Mar-2021
    • (2020)Minimally Supervised Categorization of Text with MetadataProceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3397271.3401168(1231-1240)Online publication date: 25-Jul-2020
    • (2020)Integration of Fuzzy and Deep Learning in Three-Way Decisions2020 International Conference on Data Mining Workshops (ICDMW)10.1109/ICDMW51313.2020.00019(71-78)Online publication date: Nov-2020
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media