Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2539150.2539190acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
research-article

Automatic Domain-Specific Sentiment Lexicon Generation with Label Propagation

Published: 02 December 2013 Publication History
  • Get Citation Alerts
  • Abstract

    Nowadays, the advance of social media has led to the explosive growth of opinion data. Therefore, sentiment analysis has attracted a lot of attentions. Currently, sentiment analysis applications are divided into two main approaches, the lexicon-based approach and the machine-learning approach. However, both of them face the challenge of obtaining a large amount of human-labeled training data and corpus. For the lexicon-based approach, it requires a sentiment lexicon to determine the opinion polarity. There are many existing benchmark sentiment lexicons, but they cannot cover all the domain-specific words meanings. Thus, automatic generation of a domain-specific sentiment lexicon becomes an important task. We propose a framework to automatically generate sentiment lexicon. First, we determine the semantic similarity between two words in the entire unlabeled corpus. We treat the words as nodes and similarities as weighted edges to construct word graphs. A graph-based semi-supervised label propagation method finally assigns the polarity to unlabeled words through the proposed propagation process. Experiments conducted on the microblog data, Twitter, show that our approach leads to a better performance than baseline approaches and general-purpose sentiment dictionaries.

    References

    [1]
    A. E. a. F. Sebastiani, "SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining," in In Proceedings of the 5th Conference on Language Resources and Evaluation(LREC), 2006, pp. 417--422.
    [2]
    S.-M. Kim and E. Hovy, "Identifying and analyzing judgment opinions," presented at the Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics (ACL), New York, New York, 2006.
    [3]
    J. Kamps, M. Marx, R. J. Mokken, and M. D. Rijke, "Using wordnet to measure semantic orientation of adjectives," in National Institute for, ed, 2004, pp. 1115--1118.
    [4]
    M. Hu and B. Liu, "Mining and summarizing customer reviews," presented at the Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, Seattle, WA, USA, 2004.
    [5]
    D. Rao and D. Ravichandran, "Semi-supervised polarity lexicon induction," presented at the Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, Athens, Greece, 2009.
    [6]
    V. Hatzivassiloglou and K. R. McKeown, "Predicting the semantic orientation of adjectives," 35th Annual Meeting of the Association for Computational Linguistics and the 8th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, pp. 174--181, 1997.
    [7]
    P. D. Turney, "Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews," presented at the Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, Philadelphia, Pennsylvania, 2002.
    [8]
    P. D. Turney and M. L. Littman, "Measuring praise and criticism: Inference of semantic orientation from association," ACM Trans. Inf. Syst., vol. 21, pp. 315--346, 2003.
    [9]
    T. Loughran and B. McDonald, "When is a Liability not a Liability? Textual Analysis, Dictionaries, and 10-Ks," Journal of Finance, 2010.
    [10]
    Y. Lu, M. Castellanos, U. Dayal, and C. Zhai, "Automatic construction of a context-aware sentiment lexicon: an optimization approach," presented at the Proceedings of the 20th international conference on World wide web, Hyderabad, India, 2011.
    [11]
    B. O'Connor, R. Balasubramanyan, B. R. Routledge, and N. A. Smith, "From tweets to polls: Linking text sentiment to public opinion time series," Fourth International AAAI Conference on Weblogs and Social Media, 2010.
    [12]
    J. Bollen, A. Pepe, and H. Mao, "Modeling public mood and emotion: Twitter sentiment and socio-economic phenomena," CoRR, pp.-1--1, 2009.
    [13]
    J. Bollen, H. Mao, and X. Zeng, "Twitter mood predicts the stock market," Journal of Computational Science, vol. 2, pp. 1--8, 2011.
    [14]
    G. P. C. Fung, J. X. Yu, and W. Lam, "News Sensitive Stock Trend Prediction," presented at the Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, 2002.
    [15]
    V. Lavrenko, M. Schmill, D. Lawrie, P. Ogilvie, D. Jensen, and J. Allan, "Mining of Concurrent Text and Time Series," In proceedings of the 6 th ACM SIGKDD Int'l Conference on Knowledge Discovery and Data Mining Workshop on Text Mining, 2000.
    [16]
    G. Gidofalvi, "Using News Articles to Predict Stock Price Movements," Department of Computer Science and Engineering University of California, San Diego, June 15 2001.
    [17]
    E. J. Ruiz, V. Hristidis, C. Castillo, and A. Gionis, "Correlating Financial Time Series with Micro-Blogging Activity," Proceedings of the fifth ACM international conference on Web search and data mining (WSDM), pp. 513--522 2012.
    [18]
    H. Mao, S. Counts, and J. Bollen, "Predicting Financial Markets- Comparing Survey, news, twitter and search engine data," 2011.
    [19]
    K. Gimpel, N. Schneider, B. O'Connor, D. Das, D. Mills, J. Eisenstein, et al., "Part-of-speech tagging for Twitter: annotation, features, and experiments," presented at the Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2(ACL), Portland, Oregon, 2011.
    [20]
    G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. J. Miller, "Introduction to WordNet: An On-line Lexical Database," International Journal of Lexicography(special issue), pp. 3(4):235--312, 1990.
    [21]
    M.-c. D. Marneff, B. Maccartney, and C. D. Manning, "GeneratingTyped Dependency Parses from Phrase Structure Parses," presented at the Proceedings of the conference on Language Resources and Evaluation (LREC), 2006.
    [22]
    A. Islam and D. Inkpen, "Second order co-occurrence PMI for determining the semantic similarity of words," in Proceedings of the International Conference on Language Resources and Evaluation (LREC), ed, 2006, pp. 1033--1038.

    Cited By

    View all
    • (2024)Multi-feature fusion stock prediction based on knowledge graphThe Electronic Library10.1108/EL-02-2023-005342:3(455-482)Online publication date: 17-Jun-2024
    • (2024)Dynamic datasets and market environments for financial reinforcement learningMachine Learning10.1007/s10994-023-06511-w113:5(2795-2839)Online publication date: 26-Feb-2024
    • (2023)Beneath the Tip of the Iceberg: Current Challenges and New Directions in Sentiment Analysis ResearchIEEE Transactions on Affective Computing10.1109/TAFFC.2020.303816714:1(108-132)Online publication date: 1-Jan-2023
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    IIWAS '13: Proceedings of International Conference on Information Integration and Web-based Applications & Services
    December 2013
    753 pages
    ISBN:9781450321136
    DOI:10.1145/2539150
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • @WAS: International Organization of Information Integration and Web-based Applications and Services

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 02 December 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Sentiment Analysis
    2. Sentiment Lexicon
    3. Twitter

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    IIWAS '13

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)17
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 27 Jul 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Multi-feature fusion stock prediction based on knowledge graphThe Electronic Library10.1108/EL-02-2023-005342:3(455-482)Online publication date: 17-Jun-2024
    • (2024)Dynamic datasets and market environments for financial reinforcement learningMachine Learning10.1007/s10994-023-06511-w113:5(2795-2839)Online publication date: 26-Feb-2024
    • (2023)Beneath the Tip of the Iceberg: Current Challenges and New Directions in Sentiment Analysis ResearchIEEE Transactions on Affective Computing10.1109/TAFFC.2020.303816714:1(108-132)Online publication date: 1-Jan-2023
    • (2023)HSAM: Hybrid Sentiment Analysis Model for COVID-19 Contact Tracing Applications2023 IEEE World AI IoT Congress (AIIoT)10.1109/AIIoT58121.2023.10174300(0083-0090)Online publication date: 7-Jun-2023
    • (2023)Improving sentiment domain adaptation for Arabic using an unsupervised self-labeling frameworkInformation Processing and Management: an International Journal10.1016/j.ipm.2023.10333860:3Online publication date: 1-May-2023
    • (2023)Building Domain-Specific Sentiment Lexicon Using Random Walk-Based Model on Common-Sense Semantic NetworkInternational Conference on Innovative Computing and Communications10.1007/978-981-99-3010-4_17(193-204)Online publication date: 1-Aug-2023
    • (2022)Semi-supervised learning approaches for predicting South African political sentiment for local government electionsProceedings of the 23rd Annual International Conference on Digital Government Research10.1145/3543434.3543484(129-137)Online publication date: 15-Jun-2022
    • (2022)Hybrid Onion Layered System for the Analysis of Collective Subjectivity in Social NetworksIEEE Access10.1109/ACCESS.2022.321746710(115435-115468)Online publication date: 2022
    • (2022)Systematic literature review of arabic aspect-based sentiment analysisJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2022.07.00134:9(6524-6551)Online publication date: 1-Oct-2022
    • (2022)Bidirectional LSTM-Based Sentiment Analysis of Context-Sensitive Lexicon for Imbalanced TextIntelligent System Design10.1007/978-981-19-4863-3_27(283-297)Online publication date: 28-Oct-2022
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media