Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1277741.1277845acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

ARSA: a sentiment-aware model for predicting sales performance using blogs

Published: 23 July 2007 Publication History
  • Get Citation Alerts
  • Abstract

    Due to its high popularity, Weblogs (or blogs in short) present a wealth of information that can be very helpful in assessing the general public's sentiments and opinions. In this paper, we study the problem of mining sentiment information from blogs and investigate ways to use such information for predicting product sales performance. Based on an analysis of the complex nature of sentiments, we propose Sentiment PLSA (S-PLSA), in which a blog entry is viewed as a document generated by a number of hidden sentiment factors. Training an S-PLSA model on the blog data enables us to obtain a succinct summary of the sentiment information embedded in the blogs. We then present ARSA, an autoregressive sentiment-aware model, to utilize the sentiment information captured by S-PLSA for predicting product sales performance. Extensive experiments were conducted on a movie data set. We compare ARSA with alternative models that do not take into account the sentiment information, as well as a model with a different feature selection method. Experiments confirm the effectiveness and superiority of the proposed approach.

    References

    [1]
    J. Bar-Ilan. An outsider's view on "topic-oriented blogging". In WWW Alt.'04 pages 28--34, 2004.
    [2]
    D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research 2003.
    [3]
    Angelo Dalli. System for spatio-temporal analysis of online news and blogs. In WWW '06 pages 929--930, 2006.
    [4]
    A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of Royal Statistical Society B(39):1--38, 1977.
    [5]
    Miles Efron. The liberal media and right-wing conspiracies: using cocitation information to estimate political orientation in web documents. In CIKM '04 pages 390--398, 2004.
    [6]
    Walter Enders. Applied Econometric Time Series Wiley, New York, 2nd edition, 2004.
    [7]
    Daniel Gruhl, R. Guha, Ravi Kumar, Jasmine Novak, and Andrew Tomkins. The predictive power of online chatter. In KDD '05 pages 78--87, 2005.
    [8]
    Daniel Gruhl, R. Guha, David Liben-Nowell, and Andrew Tomkins. Information di. usion through blogspace. In WWW'04 pages 491--501, 2004.
    [9]
    Thomas Hofmann. Probabilistic latent semantic analysis. In UAI'99 1999.
    [10]
    Wolfgang Jank, Galit Shmueli, and Shanshan Wang. Dynamic, real-time forecasting of online auctions via functional models. In KDD '06 pages 580--585, 2006.
    [11]
    Jaap Kamps and Maarten Marx. Words with attitude. In Proc. of the First International Conference on Global WordNet pages 332--341, 2002.
    [12]
    Ravi Kumar, Jasmine Novak, Prabhakar Raghavan, and Andrew Tomkins. On the bursty evolution of blogspace. In WWW '03 pages 568--576, 2003.
    [13]
    Ravi Kumar, Jasmine Novak, Prabhakar Raghavan, and Andrew Tomkins. Structure and evolution of blogspace. Commun. ACM 47(12):35--39, 2004.
    [14]
    Zhiwei Li, Bin Wang, Mingjing Li, and Wei-Ying Ma. A probabilistic model for retrospective news event detection. In SIGIR '05 pages 106--113, 2005.
    [15]
    Bing Liu, Minqing Hu, and Junsheng Cheng. Opinion observer: analyzing and comparing opinions on the web. In WWW '05 pages 342--351, 2005.
    [16]
    Qiaozhu Mei, Chao Liu, Hang Su, and ChengXiang Zhai. A probabilistic approach to spatiotemporal theme pattern mining on weblogs. In WWW '06 pages 533--542, 2006.
    [17]
    Qiaozhu Mei and ChengXiang Zhai. A mixture model for contextual text mining. In KDD '06 pages 649--655, 2006.
    [18]
    Bo Pang and Lillian Lee. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In ACL '04 pages 271--278, 2004.
    [19]
    Bo Pang and Lillian Lee. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In ACL '05 pages 115--124, 2005.
    [20]
    Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan. Thumbs up? sentiment classification using machine learning techniques. In Proc. of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP) 2002.
    [21]
    Technorati. URL:http://technorati. com/about/. Retrieved on January 27, 2007.
    [22]
    B. L. Tseng, J. Tatemura, and Y. Wu. Tomographic clustering to visualize blog communities as mountain views. In Proc. of 2nd Annual Workshop on the Weblogging Ecosystem 2005.
    [23]
    Peter D. Turney. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classi. cation of reviews. In ACL '02 pages 417--424, 2001.
    [24]
    Casey Whitelaw, Navendu Garg, and Shlomo Argamon. Using appraisal groups for sentiment analysis. In CIKM '05 pages 625--631, 2005.
    [25]
    Zhu Zhang and Balaji Varadarajan. Utility scoring of product reviews. In CIKM '06 pages 51--57, 2006.

    Cited By

    View all
    • (2024)Comparison of Machine Learning Approaches for Sentiment Analysis in SlovakElectronics10.3390/electronics1304070313:4(703)Online publication date: 9-Feb-2024
    • (2024)Utilizing BERT for Information Retrieval: Survey, Applications, Resources, and ChallengesACM Computing Surveys10.1145/364847156:7(1-33)Online publication date: 14-Feb-2024
    • (2023)A Method for Predicting Movie Box-Office using Machine Learning2023 4th International Conference on Electronics and Sustainable Communication Systems (ICESC)10.1109/ICESC57686.2023.10192928(1228-1232)Online publication date: 6-Jul-2023
    • Show More Cited By

    Index Terms

    1. ARSA: a sentiment-aware model for predicting sales performance using blogs

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
      July 2007
      946 pages
      ISBN:9781595935977
      DOI:10.1145/1277741
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 23 July 2007

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. autoregressive model
      2. blog
      3. sentiment mining

      Qualifiers

      • Article

      Conference

      SIGIR07
      Sponsor:
      SIGIR07: The 30th Annual International SIGIR Conference
      July 23 - 27, 2007
      Amsterdam, The Netherlands

      Acceptance Rates

      Overall Acceptance Rate 792 of 3,983 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)42
      • Downloads (Last 6 weeks)3
      Reflects downloads up to

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Comparison of Machine Learning Approaches for Sentiment Analysis in SlovakElectronics10.3390/electronics1304070313:4(703)Online publication date: 9-Feb-2024
      • (2024)Utilizing BERT for Information Retrieval: Survey, Applications, Resources, and ChallengesACM Computing Surveys10.1145/364847156:7(1-33)Online publication date: 14-Feb-2024
      • (2023)A Method for Predicting Movie Box-Office using Machine Learning2023 4th International Conference on Electronics and Sustainable Communication Systems (ICESC)10.1109/ICESC57686.2023.10192928(1228-1232)Online publication date: 6-Jul-2023
      • (2023)Machine learning-based new approach to films reviewSocial Network Analysis and Mining10.1007/s13278-023-01042-713:1Online publication date: 2-Mar-2023
      • (2023)Sentiment analysis using fuzzy logic: A comprehensive literature reviewWIREs Data Mining and Knowledge Discovery10.1002/widm.150913:5Online publication date: 20-Jun-2023
      • (2022)Enhancing Short-Term Sales Prediction with Microblogs: A Case Study of the Movie Box OfficeFuture Internet10.3390/fi1405014114:5(141)Online publication date: 4-May-2022
      • (2022)Sentiment Analysis of Twitter DataApplied Sciences10.3390/app12221177512:22(11775)Online publication date: 19-Nov-2022
      • (2022)A Machine Learning Approach to Predict the Outcome of Urinary Calculi Treatment Using Shock Wave Lithotripsy: Model Development and Validation StudyInteractive Journal of Medical Research10.2196/3335711:1(e33357)Online publication date: 16-Mar-2022
      • (2022)Equity returns and sentimentDependence Modeling10.1515/demo-2022-010910:1(159-176)Online publication date: 14-Jun-2022
      • (2022)Domain Adaptation with Pre-trained Transformers for Query-Focused Abstractive Text SummarizationComputational Linguistics10.1162/coli_a_0043448:2(279-320)Online publication date: 9-Jun-2022
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media