Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2124295.2124366acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Find me opinion sources in blogosphere: a unified framework for opinionated blog feed retrieval

Published: 08 February 2012 Publication History
  • Get Citation Alerts
  • Abstract

    This paper aims to find blog feeds having a principal inclination towards making opinionated comments on the given topic, so that we can subscribe to them to track influential and interesting opinions in the blogosphere. One major challenge is assigning topic-related opinion scores to blog feeds, which is embodied in two aspects. Firstly, we should identify whether the blog feed has a principal on-topic opinionated inclination. This inclination should be collectively revealed by all posts of the feed. We should fully consider evidences from all the posts of the feed to identify salient information among many posts of the feed. Secondly, we should capture topic-related opinions in the blog feed while ignoring irrelevant opinions.
    In this paper, we propose a unified framework for opinionated blog feed retrieval, which combines topic relevance and opinion scores with a generative model. Furthermore, we propose a language modeling approach to estimating opinion scores that is seamlessly integrated into the framework, where two language models, Topic-specific Opinion Model (TOM) and Topic-biased Feed Model (TFM), work collectively to reflect whether the blog feed shows a principal on-topic opinionated inclination. To estimate TFM, we propose a topic-biased random walk to exploit both content and structural information to capture topic-biased salient information in the feed. As for TOM estimation, we propose to use a generative mixture model with prior guidance to effectively capture topic-specific opinion expressing language usage. The conducted experiments in the context of the TREC 2009-2010 Blog Track show the effectiveness of our proposed approaches.

    References

    [1]
    Balog, K., Rijke, M., and Weerkamp, W. 2008. Bloggers as Experts Feed Distillation using Expert Retrieval Models. In Proceedings of SIGIR 2008.
    [2]
    Elsas, J. L., Arguello, J., Callan, J., and Carbonell, G. J. 2008. Retrieval and feedback models for blog feed search. In Proceedings of SIGIR '08.
    [3]
    Esuli, A., and Sebastiani, F.2005. Determining the semantic orientation of terms through gloss classification. In Proceedings of CIKM 2005.
    [4]
    Gerani, S., Carman, M. J., and Crestani, F. 2010. Proximity-based opinion retrieval. In Proceeding of SIGIR '10.
    [5]
    He, B., Macdonald, C., He, J., and Ounis, I. 2008. An effective statistical approach to blog post opinion retrieval. In Proceeding of CIKM '08.
    [6]
    Huang, X., and Croft, W. B. 2009. A unified relevance model for opinion retrieval. In Proceedings of CIKM 2009, pages 947--956.
    [7]
    Jia, L., Yu, C. 2010. UIC at TREC 2010 Faceted Blog Distillation. In Proceedings of TREC 2010.
    [8]
    Jiang, P., Zhang, C., Yang, Q., and Niu, Z. 2010. Blog Opinion Retrieval Based on Topic-Opinion Mixture Model. In Proceedings of PAKDD 2010.
    [9]
    Keikha, M., Mahdabi, P., Gerani, S., Inches, G., Carman, M., Crestani, F., and Parapar, J. 2010. University of Lugano at TREC 2010. In Proceedings of TREC 2010.
    [10]
    Li, S., Gao, H., Sun, H., Chen, F., Feng, O., Gao, S., Zhang, H., Li, X., Tan, C., Xu, W., Chen, G., and Guo, J. 2009. A Study of Faceted Blog Distillation - PRIS at TREC 2009 Blog Track. In Proceedings of TREC 2009.
    [11]
    Macdonald, C., Ounis, I., and Soboroff, I. 2007. Overview of the TREC 2007 Blog track. In Proceedings of TREC 2007.
    [12]
    Macdonald, C., and Ounis, I. 2008. Key blog distillation: ranking aggregates. In Proceedings CIKM 2008.
    [13]
    Macdonald, C., Ounis, I., and Soboroff, I. 2009. Overview of the TREC-2009 Blog Track. In Proceedings of TREC 2009.
    [14]
    Macdonald, C., Santos, R. L. T., Ounis, I., and Soboroff, I. 2010. Blog track research at TREC. SIGIR Forum 44, 58--75.
    [15]
    McCreadie, R., Macdonald, C., Ounis, I., Peng, J., and Santos, R. 2009. University of Glasgow at TREC 2009: Experiments with Terrier. In Proceedings of TREC 2009.
    [16]
    Mei, Q., Ling, X., Wondra, M., Su, H., and Zhai, C. 2007. Topic sentiment mixture: modeling facets and opinions in weblogs. In Proceedings of WWW '07, 171--180.
    [17]
    Na, S.-H., Lee, Y., Nam, S.-H., and Lee, J.-H. 2009. Improving opinion retrieval based on query-specific sentiment lexicon. In Proceedings of ECIR 2009, pages 734--738.
    [18]
    Ounis, I., Macdonald, C., Rijke, M. de., Mishne, G., and Soboroff, I. 2006. Overview of the TREC 2006 Blog track. In Proceedings of the 15th Text REtrieval Conference.
    [19]
    Ounis, I., Macdonald, C., and Soboroff, I. 2008. Overview of the TREC-2008 Blog Track. In Proceedings of TREC'08.
    [20]
    Ounis, I., Macdonald, C., and Soboroff, I. 2010. Overview of the TREC-2010 Blog Track (Preliminary). In Proceedings of TREC 2010.
    [21]
    Sanderson, J. 2008. The Blog is Serving Its Purpose: Self-Presentation Strategies on 38pitches.com. Journal of Computer-Mediated Communication. Jul 2008, Vol. 13, No. 4: 912--936.
    [22]
    Santos, R. L. T., He, B., Macdonald, C., and Ounis, I. 2009. Integrating proximity to subjective sentences for blog opinion retrieval. In Proceedings of ECIR 2009, pages 325--336.
    [23]
    Seki, K., and Uehara, K. 2009. Adaptive subjective triggers for opinionated document retrieval. In Proceedings WSDM '09, 25--33.
    [24]
    Song, L., Cheng, X., Guo, Y., Liu, L., and Ding, G. 2009. ContentEx: A Framework for Automatic Content Extraction Programs. In Proceedings of ISI'2009.
    [25]
    Vechtomova, O. 2010. Facet-based opinion retrieval from blogs. Information Processing and Management, 46(1):71--88.
    [26]
    Wei, F., Li, W., Lu, Q., and He, Y. 2008. Query-sensitive mutual reinforcement chain and its application in query-oriented multi-document summarization. In Proceedings of SIGIR '08.
    [27]
    Xu, X., Meng, T., Cheng, X., and Liu, Y. 2011. A probabilistic model for opinionated blog feed retrieval. In Proceedings of the 20th international conference companion on World wide web (WWW '11).
    [28]
    Zhai C., and Lafferty, J. 2001. Model-based feedback in the language modeling approach to information retrieval. In Proceedings of CIKM 2001.
    [29]
    Zhai, C., and Lafferty, J. 2004. A study of smoothing methods for language models applied to information retrieval. ACM Transactions on Information Systems, Vol. 22, No. 2, 179--214.
    [30]
    Zhai, C., Velivelli, A., and Yu, B. 2004. A Cross-Collection Mixture Model for Comparative Text Mining. In Proceedings of KDD 2004.
    [31]
    Zhang, W., Yu, C., and Meng, W. 2007. Opinion retrieval from blogs. In Proceedings of CIKM 2007, pages 831--840.
    [32]
    Zhang, M., and Ye, X. 2008. A generation model to unify topic relevance and lexicon-based sentiment for opinion retrieval. In Proceedings of SIGIR 2008, pages 411--418.
    [33]
    Zhou, Z., Zhang, X.,Vines, P. 2010. RMIT at TREC 2010 Blog Track: Faceted Blog Distillation Task. Online Proceedings of TREC 2010.

    Cited By

    View all
    • (2014)SORMProceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 0110.1109/WI-IAT.2014.19(78-85)Online publication date: 11-Aug-2014

    Index Terms

    1. Find me opinion sources in blogosphere: a unified framework for opinionated blog feed retrieval

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WSDM '12: Proceedings of the fifth ACM international conference on Web search and data mining
      February 2012
      792 pages
      ISBN:9781450307475
      DOI:10.1145/2124295
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 08 February 2012

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. mixture model
      2. opinionated blog feed retrieval
      3. topic-biased random walk
      4. topic-related opinionatedness

      Qualifiers

      • Research-article

      Conference

      Acceptance Rates

      Overall Acceptance Rate 498 of 2,863 submissions, 17%

      Upcoming Conference

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 11 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2014)SORMProceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 0110.1109/WI-IAT.2014.19(78-85)Online publication date: 11-Aug-2014

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media