Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2124295.2124371acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

A large-scale sentiment analysis for Yahoo! answers

Published: 08 February 2012 Publication History

Abstract

Sentiment extraction from online web documents has recently been an active research topic due to its potential use in commercial applications. By sentiment analysis, we refer to the problem of assigning a quantitative positive/negative mood to a short bit of text. Most studies in this area are limited to the identification of sentiments and do not investigate the interplay between sentiments and other factors. In this work, we use a sentiment extraction tool to investigate the influence of factors such as gender, age, education level, the topic at hand, or even the time of the day on sentiments in the context of a large online question answering site. We start our analysis by looking at direct correlations, e.g., we observe more positive sentiments on weekends, very neutral ones in the Science & Mathematics topic, a trend for younger people to express stronger sentiments, or people in military bases to ask the most neutral questions. We then extend this basic analysis by investigating how properties of the (asker, answerer) pair affect the sentiment present in the answer. Among other things, we observe a dependence on the pairing of some inferred attributes estimated by a user's ZIP code. We also show that the best answers differ in their sentiments from other answers, e.g., in the Business & Finance topic, best answers tend to have a more neutral sentiment than other answers. Finally, we report results for the task of predicting the attitude that a question will provoke in answers. We believe that understanding factors influencing the mood of users is not only interesting from a sociological point of view, but also has applications in advertising, recommendation, and search.

References

[1]
A. Abbasi, H. Chen, and A. Salem. Sentiment analysis in multiple languages: feature selection for opinion classification in Web forums. ACM Trans. Inf. Syst., 26:12:1--12:34, 2008.
[2]
X. Bai. Predicting consumer sentiments from online text. Decis. Support Syst., 50:732--742, 2011.
[3]
P. Beineke, T. Hastie, C. Manning, and S. Vaithyanathan. Exploring sentiment summarization. In Proc. AAAI Spring Symp. Exploring Attitude and Affect in Text: Theories and Applications, pages 1--4, 2004.
[4]
J. Bollen, H. Mao, and A. Pepe. Determining the public mood state by analysis of microblogging posts. In Proc. Alife XII Conf., pages 667--668, 2010.
[5]
J. Bollen, H. Mao, and X. Zeng. Twitter mood predicts the stock market. J. Comput. Sci, 2:1--8, 2011.
[6]
S. I. Calderon. Facebook shares new data on relationship status and sentiment, 2010. http://www.insidefacebook.com/2010/02/15/dr-facebook-is-in-people-in-relationships-are-happiest/.
[7]
S. R. Das and M. Y. Chen. Yahoo! for Amazon: sentiment extraction from small talk on the Web. Manage. Sci., 53:1375--1388, 2007.
[8]
K. Dave, S. Lawrence, and D. M. Pennock. Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In Proc. 12th Int'l Conf. World Wide Web, pages 519--528, 2003.
[9]
A. Devitt and K. Ahmad. Sentiment analysis in financial news: a cohesion-based approach. In Proc. 45th Annual Meeting of the Assoc. for Computational Linguistics, pages 984--991, 2007.
[10]
J. H. Friedman. Greedy function approximation: a gradient boosting machine. Ann. Stat., 29(5):1189--1232, 2001.
[11]
S. Gerani, M. J. Carman, and F. Crestani. Investigating learning approaches for blog post opinion retrieval. In Proc. 31th Eur. Conf. Information Retrieval, pages 313--324, 2009.
[12]
S. Gerani, M. J. Carman, and F. Crestani. Proximity-based opinion retrieval. In Proc. 33rd Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pages 403--410, 2010.
[13]
N. Godbole, M. Srinivasaiah, and S. Skiena. Large-scale sentiment analysis for news and blogs. In Proc. Int'l Conf. Weblogs and Social Media, 2007.
[14]
M. L. Gregory, N. Chinchor, P. Whitney, R. Carter, E. Hetzler, and A. Turner. User-directed sentiment analysis: visualizing the affective content of documents. In Proc. Workshop on Sentiment and Subjectivity in Text, pages 23--30, 2006.
[15]
S. D. Kamvar and J. Harris. We feel fine and searching the emotional web. In Proc. 4th ACM Int'l Conf. Web Search and Data Mining, pages 117--126, 2011.
[16]
S.-M. Kim and E. H. Hovy. Crystal: analyzing predictive opinions on the Web. In Proc. 2007 Joint Conf. Empirical Methods in Natural Language and Computational Natural Language Learning, pages 1056--1064, 2006.
[17]
K. Lerman, S. Blair-Goldensohn, and R. McDonald. Sentiment summarization: evaluating and learning user preferences. In Proc. 12th Conf. European Chapter of the Assoc. for Computational Linguistics, pages 514--522, 2009.
[18]
B. Pang and L. Lee. Opinion mining and sentiment analysis. Found. Trends Inf. Retr., 2:1--135, 2008.
[19]
B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up?: sentiment classification using machine learning techniques. In Proc. 2002 Conf. Empirical Methods in Natural Language Processing, pages 79--86, 2002.
[20]
M. Thelwall. Emotion homophily in social network site messages. First Monday, 15(4--5), 2010.
[21]
M. Thelwall, K. Buckley, and G. Paltoglou. Sentiment in Twitter events. J. Am. Soc. Inf. Sci. Techn., 62:406--418, 2011.
[22]
M. Thelwall, K. Buckley, G. Paltoglou, D. Cai, and A. Kappas. Sentiment in short strength detection informal text. J. Am. Soc. Inf. Sci. Technol., 61:2544--2558, 2010.
[23]
M. Thelwall, D. Wilkinson, and S. Uppal. Data mining emotion in social network communication: gender differences in MySpace. J. Am. Soc. Inf. Sci. Technol., 61(1):190--199, 2010.
[24]
M. Thomas, B. Pang, and L. Lee. Get out the vote: determining support or opposition from congressional floor-debate transcripts. In Proc. Conf. Empirical Methods in Natural Language Processing, pages 327--335, 2006.
[25]
P. D. Turney. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In Proc. 40th Annual Meeting on Assoc. for Computational Linguistics, pages 417--424, 2002.
[26]
I. Weber and C. Castillo. The demographics of web search. In Proc. 33rd Int'l ACM SIGIR Conf. Research and Development in Information Retrieval, pages 523--530, 2010.
[27]
J. Ye, J.-H. Chow, J. Chen, and Z. Zheng. Stochastic gradient boosted distributed decision trees. In Proc. 18th ACM Conf. Information and Knowledge Management, pages 2061--2064, 2009.
[28]
J. Yi, T. Nasukawa, R. Bunescu, and W. Niblack. Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques. In Proc. 3rd IEEE Int'l Conf. Data Mining, pages 427--434, 2003.
[29]
W. Zhang, C. Yu, and W. Meng. Opinion retrieval from blogs. In Proc. 16th ACM Conf. Information and Knowledge Management, pages 831--840, 2007.

Cited By

View all

Index Terms

  1. A large-scale sentiment analysis for Yahoo! answers

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        WSDM '12: Proceedings of the fifth ACM international conference on Web search and data mining
        February 2012
        792 pages
        ISBN:9781450307475
        DOI:10.1145/2124295
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 08 February 2012

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. attitude
        2. collaborative question answering
        3. prediction
        4. sentiment analysis
        5. sentimentality

        Qualifiers

        • Research-article

        Conference

        Acceptance Rates

        Overall Acceptance Rate 498 of 2,863 submissions, 17%

        Upcoming Conference

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)48
        • Downloads (Last 6 weeks)2
        Reflects downloads up to 30 Aug 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Textual Pre-Trained Models for Age Screening Across Community Question-AnsweringIEEE Access10.1109/ACCESS.2024.336892912(30030-30038)Online publication date: 2024
        • (2023)Improving Question Intent Identification by Exploiting Its Synergy With User AgeIEEE Access10.1109/ACCESS.2023.332245711(112044-112059)Online publication date: 2023
        • (2023)Textual Pre-Trained Models for Gender Identification Across Community Question-Answering MembersIEEE Access10.1109/ACCESS.2023.323573511(3983-3995)Online publication date: 2023
        • (2023)A deep penetration network for sentence classificationInformation Fusion10.1016/j.inffus.2023.02.01595(174-185)Online publication date: Jul-2023
        • (2023)Secure Authentication and Reliable Cloud Storage Scheme for IoT-Edge-Cloud IntegrationJournal of Grid Computing10.1007/s10723-023-09672-z21:3Online publication date: 27-Jun-2023
        • (2023)Visual Low-Code Language for Orchestrating Large-Scale Distributed ComputingJournal of Grid Computing10.1007/s10723-023-09666-x21:3Online publication date: 4-Jul-2023
        • (2023)Smart Caching in a Data Lake for High Energy Physics AnalysisJournal of Grid Computing10.1007/s10723-023-09664-z21:3Online publication date: 12-Jul-2023
        • (2022)Using Sentiment Analysis for Evaluating e-WOMResearch Anthology on Implementing Sentiment Analysis Across Multiple Disciplines10.4018/978-1-6684-6303-1.ch070(1360-1383)Online publication date: 10-Jun-2022
        • (2022)Self-presentation and emotional contagion on Facebook: new experimental measures of profiles' emotional coherencePSICOLOGIA DI COMUNITA'10.3280/PSC2022-002002(13-33)Online publication date: Oct-2022
        • (2022)Towards More Gender-Inclusive Q&AsProceedings of the ACM on Human-Computer Interaction10.1145/35555676:CSCW2(1-23)Online publication date: 11-Nov-2022
        • Show More Cited By

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media