Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

On the Relationship between Novelty and Popularity of User-Generated Content

Published: 01 September 2012 Publication History

Abstract

This work deals with the task of predicting the popularity of user-generated content. We demonstrate how the novelty of newly published content plays an important role in affecting its popularity. More specifically, we study three dimensions of novelty. The first one, termed contemporaneous novelty, models the relative novelty embedded in a new post with respect to contemporary content that was generated by others. The second type of novelty, termed self novelty, models the relative novelty with respect to the user’s own contribution history. The third type of novelty, termed discussion novelty, relates to the novelty of the comments associated by readers with respect to the post content. We demonstrate the contribution of the new novelty measures to estimating blog-post popularity by predicting the number of comments expected for a fresh post. We further demonstrate how novelty based measures can be utilized for predicting the citation volume of academic papers.

References

[1]
Agarwal, N., Liu, H., Tang, L., and Yu, P. S. 2008. Identifying the inuential bloggers in a community. In Proceedings of the International Conference on Web Search and Web Data Mining (WSDM’08). ACM, New York, NY, 207--218.
[2]
Agichtein, E., Castillo, C., Donato, D., Gionis, A., and Mishne, G. 2008. Finding high-quality content in social media. In Proceedings of the International Conference on Web Search and Web Data Mining (WSDM’08). ACM, New York, NY, 183--194.
[3]
Allan, J., Wade, C., and Bolivar, A. 2003. Retrieval and novelty detection at the sentence level. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’03). ACM, New York, NY, 314--321.
[4]
Amitay, E., Yogev, S., and Yom-Tov, E. 2007. Serial sharers: Detecting split identities of web authors. In Proceedings of the SIGIR’07 Workshop on Plagiarism Analysis, Authorship Identification, and Near-Duplicate Detection.
[5]
Benedetto, D., Caglioti, E., and Loreto, V. 2002. Language trees and zipping. Phys. Rev. Lett. 88, 4.
[6]
Bischoff, K., Firan, C. S., Georgescu, M., Nejdl, W., and Paiu, R. 2009. Social knowledge-driven music hit prediction. In Proceedings of the 5th International Conference on Advanced Data Mining and Applications (ADMA’09). Springer, 43--54.
[7]
Carmel, D., Roitman, H., and Yom-Tov, E. 2010. On the relationship between novelty and popularity of user-generated content. In Proceedings of the 19th ACM International Conference on Information and Knowledge (CIKM’10).
[8]
Castillo, C., Donato, D., and Gionis, A. 2007. Estimating number of citations using author reputation. In Proceedings of the 14th International Conference on String Processing and Information Retrieval (SPIRE’07). Springer, 107--117.
[9]
Cha, M., Kwak, H., Rodriguez, P., Ahn, Y.-Y., and Moon, S. 2007. I tube, you tube, everybody tubes: Analyzing the world’s largest user generated content video system. In Proceedings of the 7th ACM SIGCOMM Conference on Internet Measurement (IMC’07). ACM, New York, NY, 1--14.
[10]
Chai, K., Potdar, V., and Dillon, T. 2009. Content quality assessment related frameworks for social media. In Proceedings of the International Conference on Computational Science and Its Applications (ICCSA’09). Springer, 791--805.
[11]
Cilibrasi, R. and Vitányi, P. M. B. 2005. Clustering by compression. IEEE Trans. Inf. Theory 51, 1523--1545.
[12]
Cilibrasi, R., Vitányi, P., and De Wolf, R. 2004. Algorithmic clustering of music based on string compression. Comput. Music J. 28, 4, 49--67.
[13]
De Choudhury, M., Sundaram, H., John, A., and Seligmann, D. D. 2009. What makes conversations interesting? Themes, participants and consequences of conversations in online social media. In Proceedings of the 18th International Conference on World Wide Web (WWW’09). ACM, New York, NY, 331--340.
[14]
Dietz, L., Bickel, S., and Scheffer, T. 2007. Unsupervised prediction of citation inuences. In Proceedings of the 24th International Conference on Machine Learning (ICML’07). ACM, New York, NY, 233--240.
[15]
Duda, R., Hart, P., and Stork, D. 2001. Pattern Classification. John Wiley and Sons, Inc, New York, NY.
[16]
Gehrke, J., Ginsparg, P., and Kleinberg, J. M. 2003. Overview of the 2003 KDD Cup. SIGKDD Explor. 5, 2, 149--151.
[17]
Gruhl, D., Guha, R., Liben-Nowell, D., and Tomkins, A. 2004. Information diffusion through blogspace. In Proceedings of the 13th International Conference on World Wide Web (WWW’04). ACM, New York, NY, 491--501.
[18]
Hasan Dalip, D., André Gonçalves, M., Cristo, M., and Calado, P. 2009. Automatic quality assessment of content created collaboratively by web communities: A case study of Wikipedia. In Proceedings of the 9th ACM/IEEE-CS Joint Conference on Digital libraries (JCDL’09). ACM, New York, NY, 295--304.
[19]
Hsu, C.-F., Khabiri, E., and Caverlee, J. 2009. Ranking comments on the social web. In Proceedings of the International Conference on Computational Science and Engineering. Vol. 4, 90--97.
[20]
Huh, J., Jones, L., Erickson, T., Kellogg, W. A., Bellamy, R. K. E., and Thomas, J. C. 2007. Blogcentral: The role of internal blogs at work. In CHI’07 Extended Abstracts on Human Factors in Computing Systems. ACM, New York, NY, 2447--2452.
[21]
Kempe, D., Kleinberg, J., and Tardos, E. 2003. Maximizing the spread of inuence through a social network. In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’03). ACM, New York, NY, 137--146.
[22]
Khabiri, E., Hsu, C.-F., and Caverlee, J. 2009. Analyzing and predicting community preference of socially generated metadata: A case study on comments in the digg community. In Proceedings of the AAAI International Conference on Weblogs and Social Media (ICWSM’09).
[23]
Kumar, R., Novak, J., Raghavan, P., and Tomkins, A. 2005. On the bursty evolution of blogspace. World Wide Web 8, 2, 159--178.
[24]
Lerman, K. and Hogg, T. 2010. Using a model of social dynamics to predict popularity of news. In Proceedings of the 19th International Conference on World Wide Web (WWW’10). ACM, New York, NY, 621--630.
[25]
Mishne, G. and Glance, N. 2006. Leave a reply: An analysis of weblog comments. In Proceedings of the 3rd Annual Workshop on the Weblogging Ecosystem (WWW’06).
[26]
Mishne, G., Carmel, D., and Lempel, R. 2005. Blocking blog spam with language model disagreement. In Proceedings of the 1st International Workshop on Adversarial Information Retrieval on the Web (AIRWeb).
[27]
Nardi, B. A., Schiano, D. J., Gumbrecht, M., and Swartz, L. 2004. Why we blog. Comm. ACM 47, 12, 41--46.
[28]
Roitman, H., Carmel, D., and Yom-Tov, E. 2008. Maintaining dynamic channel profiles on the web. Proc. VLDB Endow. 1, 151--162.
[29]
Salomon, D. 2004. Data Compression: The Complete Reference. Springer.
[30]
Soboroff, I. and Harman, D. 2005. Novelty detection: The TREC experience. In Proceedings of the Human Language Technology Conference (HLT’05). Association for Computational Linguistics, Morristown, NJ, 105--112.
[31]
Song, X., Chi, Y., Hino, K., and Tseng, B. 2007. Identifying opinion leaders in the blogosphere. In Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM’07). ACM, New York, NY, 971--974.
[32]
Szabo, G. and Huberman, B. A. 2010. Predicting the popularity of online content. Comm. ACM 53, 80--88.
[33]
Tsagkias, M., Weerkamp, W., and de Rijke, M. 2009. Predicting the volume of comments on online news stories. In Proceeding of the 18th ACM Conference on Information and Knowledge Management (CIKM’09). ACM, New York, NY, 1765--1768.

Cited By

View all
  • (2025)Quickly Finding the Semantically Optimal Presentation Order for a Set of Text ArtifactsInformation10.3390/info1601005916:1(59)Online publication date: 16-Jan-2025
  • (2025)What does leveling up bring? A study on the ex post incentives of the hierarchical privilege levels in online brand communitiesComputers in Human Behavior10.1016/j.chb.2025.108557165(108557)Online publication date: Apr-2025
  • (2024)Boosting Platform Engagement through Diversity: An Algorithmic Feedback ApproachSSRN Electronic Journal10.2139/ssrn.4769961Online publication date: 2024
  • Show More Cited By

Index Terms

  1. On the Relationship between Novelty and Popularity of User-Generated Content

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Intelligent Systems and Technology
    ACM Transactions on Intelligent Systems and Technology  Volume 3, Issue 4
    September 2012
    410 pages
    ISSN:2157-6904
    EISSN:2157-6912
    DOI:10.1145/2337542
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 September 2012
    Accepted: 01 May 2011
    Revised: 01 March 2011
    Received: 01 November 2010
    Published in TIST Volume 3, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Popularity
    2. novelty
    3. user-generated content

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)66
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 06 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Quickly Finding the Semantically Optimal Presentation Order for a Set of Text ArtifactsInformation10.3390/info1601005916:1(59)Online publication date: 16-Jan-2025
    • (2025)What does leveling up bring? A study on the ex post incentives of the hierarchical privilege levels in online brand communitiesComputers in Human Behavior10.1016/j.chb.2025.108557165(108557)Online publication date: Apr-2025
    • (2024)Boosting Platform Engagement through Diversity: An Algorithmic Feedback ApproachSSRN Electronic Journal10.2139/ssrn.4769961Online publication date: 2024
    • (2024)Generative AI and Creator Economy: Investigating the Effects of AI-Generated Voice on Online Video CreationSSRN Electronic Journal10.2139/ssrn.4676705Online publication date: 2024
    • (2024)Who Makes Popular Content? Information Cues from Content Creators for Users’ game Choice: Focusing on User-Created Content Platform “Roblox”Entertainment Computing10.1016/j.entcom.2024.10069750(100697)Online publication date: May-2024
    • (2024)From whales to minnows: The impact of crypto-reward fairness on user engagement in social mediaDecision Support Systems10.1016/j.dss.2024.114289(114289)Online publication date: Jul-2024
    • (2022)Building Stronger Brand Evangelism for Sustainable Marketing through Micro-Influencer-Generated Content on Instagram in the Fashion IndustrySustainability10.3390/su14231577014:23(15770)Online publication date: 27-Nov-2022
    • (2022)The Importance of Social Influencer-Generated Contents for User Cognition and Emotional Attachment: An Information Relevance PerspectiveSustainability10.3390/su1411667614:11(6676)Online publication date: 30-May-2022
    • (2022)Formation mechanism of contributors’ self-identity based on social identity in online knowledge communitiesFrontiers in Psychology10.3389/fpsyg.2022.104652513Online publication date: 15-Dec-2022
    • (2022)How Do Peer Awards Motivate Creative Content? Experimental Evidence from RedditManagement Science10.1287/mnsc.2021.404068:5(3488-3506)Online publication date: May-2022
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media