Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2063576.2063726acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach

Published: 24 October 2011 Publication History

Abstract

Twitter is one of the biggest platforms where massive instant messages (i.e. tweets) are published every day. Users tend to express their real feelings freely in Twitter, which makes it an ideal source for capturing the opinions towards various interesting topics, such as brands, products or celebrities, etc. Naturally, people may anticipate an approach to receiving the common sentiment tendency towards these topics directly rather than through reading the huge amount of tweets about them. On the other side, Hashtags, starting with a symbol "#" ahead of keywords or phrases, are widely used in tweets as coarse-grained topics. In this paper, instead of presenting the sentiment polarity of each tweet relevant to the topic, we focus our study on hashtag-level sentiment classification. This task aims to automatically generate the overall sentiment polarity for a given hashtag in a certain time period, which markedly differs from the conventional sentence-level and document-level sentiment analysis. Our investigation illustrates that three types of information is useful to address the task, including (1) sentiment polarity of tweets containing the hashtag; (2) hashtags co-occurrence relationship and (3) the literal meaning of hashtags. Consequently, in order to incorporate the first two types of information into a classification framework where hashtags can be classified collectively, we propose a novel graph model and investigate three approximate collective classification algorithms for inference. Going one step further, we show that the performance can be remarkably improved using an enhanced boosting classification setting in which we employ the literal meaning of hashtags as semi-supervised information. Experimental results on a real-life data set consisting of 29,195 tweets and 2,181 hashtags show the effectiveness of the proposed model and algorithms.

References

[1]
R. Angelova and G. Weikum. Graph-based text classification: learn from your neighbors. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '06, pages 485--492, 2006.
[2]
L. Barbosa and J. Feng. Robust sentiment detection on twitter from biased and noisy data. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters, COLING '10, pages 36--44, Stroudsburg, PA, USA, 2010. Association for Computational Linguistics.
[3]
D. Blaheta. Handling noisy training and testing data. In Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10, pages 111--116. Association for Computational Linguistics, 2002.
[4]
C. Cortes and V. Vapnik. Support-vector networks. Machine Learning, 20:273--297, 1995.
[5]
D. Davidov, O. Tsur, and A. Rappoport. Enhanced sentiment learning using twitter hashtags and smileys. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters, COLING '10, pages 241--249, Stroudsburg, PA, USA, 2010. Association for Computational Linguistics.
[6]
D. Davidov, O. Tsur, and A. Rappoport. Semi-supervised recognition of sarcastic sentences in twitter and amazon. In Proceedings of the Fourteenth Conference on Computational Natural Language Learning, CoNLL '10, pages 107--116, Stroudsburg, PA, USA, 2010. Association for Computational Linguistics.
[7]
X. Ding and B. Liu. The utility of linguistic rules in opinion mining. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '07, pages 811--812, New York, NY, USA, 2007. ACM.
[8]
M. Hu and B. Liu. Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '04, pages 168--177, New York, NY, USA, 2004. ACM.
[9]
L. Jiang, M. Yu, M. Zhou, X. Liu, and T. Zhao. Target-dependent twitter sentiment classification. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 151--160, Portland, Oregon, USA, June 2011. Association for Computational Linguistics.
[10]
C. Lin and Y. He. Joint sentiment/topic model for sentiment analysis. In Proceeding of the 18th ACM conference on Information and knowledge management, CIKM '09, pages 375--384, New York, NY, USA, 2009. ACM.
[11]
Q. Lu and L. Getoor. Link-based classification. In Machine Learning, Proceedings of the Twentieth International Conference, ICML '03, pages 496--503. AAAI Press, 2003.
[12]
Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai. Topic sentiment mixture: modeling facets and opinions in weblogs. In Proceedings of the 16th international conference on World Wide Web, WWW '07, pages 171--180, New York, NY, USA, 2007. ACM.
[13]
P. Melville, N. Shah, L. Mihalkova, and R. Mooney. Experiments on ensembles with missing and noisy data. Multiple Classifier Systems, pages 293--302, 2004.
[14]
T. Nasukawa and J. Yi. Sentiment analysis: capturing favorability using natural language processing. In Proceedings of the 2nd International Conference on Knowledge Capture, K-CAP '03, pages 70--77, 2003.
[15]
J. Neville and D. Jensen. Iterative classification in relational data. In Proceedings of the AAAI 2000 Workshop Learning Statistical Models from Relational Data, AAAI '00, pages 42--49. AAAI Press, 2000.
[16]
B. Pang and L. Lee. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, ACL '04, pages 271--278, Stroudsburg, PA, USA, 2004. Association for Computational Linguistics.
[17]
B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up? sentiment classification using machine learning techniques. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '02, 2002.
[18]
J. Pearl. Reverend bayes on inference engines: A distributed hierarchical approach. In Proceedings of the National Conference on Artificial Intelligence, AAAI '82, pages 133--136, 1982.
[19]
A. Rosenfeld, R. A. Hummel, and S. W. Zucker. Scene labeling by relaxation operations. IEEE Transactions on Systems, Man, and Cybernetics, SMC-6(6):420--433, 1976.
[20]
P. Sen and L. Getoor. Link-based classification. In Technical Report, 2007.
[21]
B. Taskar, P. Abbeel, and D. Koller. Discriminative probabilistic models for relational data. In Proceedings of the 18th Conference in Uncertainty in Artificial Intelligence, UAI '02, pages 485--492. Morgan Kaufmann, 2002.
[22]
T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, pages 347--354. Association for Computational Linguistics, 2005.
[23]
T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing contextual polarity: an exploration of features for phrase-level sentiment analysis. Computational Linguistics, 35(3):399--433, 2009.
[24]
Y. Yang, S. Slattery, and R. Ghani. A study of approaches to hypertext categorization. J. Intell. Inf. Syst., 18:219--241, March 2002.
[25]
J. S. Yedidia, W. T. Freeman, and Y. Weiss. Generalized belief propagation. In Advances in Neural Information Processing Systems 13, Papers from Neural Information Processing Systems, NIPS '00, pages 689--695. MIT Press, 2000.
[26]
J. S. Yedidia, W. T. Freeman, and Y. Weiss. Constructing free-energy approximations and generalized belief propagation algorithms. IEEE Transactions on Information Theory, 51(7):2282--2312, 2005.
[27]
H. Yu and V. Hatzivassiloglou. Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In Proceedings of the 2003 conference on Empirical methods in natural language processing-Volume 10, pages 129--136. Association for Computational Linguistics, 2003.
[28]
W. Zhang, C. Yu, and W. Meng. Opinion retrieval from blogs. In Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, CIKM '07, pages 831--840, New York, NY, USA, 2007. ACM.
[29]
X. Zhu, X. Wu, and Y. Yang. Effective classification of noisy data streams with attribute-oriented dynamic classifier selection. Knowledge and Information Systems, 9(3):339--363, 2006.
[30]
L. Zhuang, F. Jing, and X. Y. Zhu. Movie review mining and summarization. In Proceedings of the 15th ACM international conference on Information and knowledge management, CIKM '06, pages 43--50, New York, NY, USA, 2006. ACM.

Cited By

View all
  • (2024)Twitter and the projection of political personalities in IndiaCommonwealth & Comparative Politics10.1080/14662043.2024.234885862:2(147-171)Online publication date: 9-Jun-2024
  • (2024)Utilizing cognitive signals generated during human reading to enhance keyphrase extraction from microblogsInformation Processing and Management: an International Journal10.1016/j.ipm.2023.10361461:2Online publication date: 1-Mar-2024
  • (2024)Unifying context with labeled property graphExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.122269239:COnline publication date: 1-Apr-2024
  • Show More Cited By

Index Terms

  1. Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management
    October 2011
    2712 pages
    ISBN:9781450307178
    DOI:10.1145/2063576
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 October 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. graph-based classification
    2. hashtag
    3. sentiment analysis
    4. twitter

    Qualifiers

    • Research-article

    Conference

    CIKM '11
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)164
    • Downloads (Last 6 weeks)20
    Reflects downloads up to 10 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Twitter and the projection of political personalities in IndiaCommonwealth & Comparative Politics10.1080/14662043.2024.234885862:2(147-171)Online publication date: 9-Jun-2024
    • (2024)Utilizing cognitive signals generated during human reading to enhance keyphrase extraction from microblogsInformation Processing and Management: an International Journal10.1016/j.ipm.2023.10361461:2Online publication date: 1-Mar-2024
    • (2024)Unifying context with labeled property graphExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.122269239:COnline publication date: 1-Apr-2024
    • (2024)Characteristics of opinions in the societal and non-societal domainsSocial Network Analysis and Mining10.1007/s13278-024-01306-w14:1Online publication date: 3-Aug-2024
    • (2024)Beyond the use of a novel Ensemble based Random Forest-BERT Model (Ens-RF-BERT) for the Sentiment Analysis of the hashtag COVID19 tweetsSocial Network Analysis and Mining10.1007/s13278-024-01240-x14:1Online publication date: 16-Apr-2024
    • (2024)Explainable empirical risk minimizationNeural Computing and Applications10.1007/s00521-023-09269-336:8(3983-3996)Online publication date: 1-Mar-2024
    • (2024)A Comprehensive Study on Reddit Users’ Attitude Toward ChatGPTInnovations in Data Analytics10.1007/978-981-97-4928-7_32(409-424)Online publication date: 10-Sep-2024
    • (2023)Sentiment Analysis of Comment Data Based on BERT-ETextCNN-ELSTMElectronics10.3390/electronics1213291012:13(2910)Online publication date: 3-Jul-2023
    • (2023)Sentiment of the Tweets on Russo-Ukrainian War: the Social Network Analysis2023 46th MIPRO ICT and Electronics Convention (MIPRO)10.23919/MIPRO57284.2023.10159770(1089-1095)Online publication date: 22-May-2023
    • (2023)What Tweets and YouTube comments have in common? Sentiment and graph analysis on data related to US elections 2020PLOS ONE10.1371/journal.pone.027054218:1(e0270542)Online publication date: 31-Jan-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media