Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/988672.988738acmconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
Article

Newsjunkie: providing personalized newsfeeds via analysis of information novelty

Published: 17 May 2004 Publication History
  • Get Citation Alerts
  • Abstract

    We present a principled methodology for filtering news stories by formal measures of information novelty, and show how the techniques can be usedto custom-tailor news feeds based on information that a user has already reviewed. We review methods for analyzing novelty and then describe Newsjunkie, a system that personalizes news for users by identifying the novelty of stories in the context of stories they have already reviewed. Newsjunkie employs novelty-analysis algorithms that represent articles as words and named entities. The algorithms analyze inter-andintra-document dynamics by considering how information evolves over timefrom article to article, as well as within individual articles. We review the results of a user study undertaken to gauge the value of the approachover legacy time-based review of newsfeeds, and also to compare the performance of alternate distance metrics that are used to estimate the dissimilarity between candidate new articles and sets of previously reviewed articles.

    References

    [1]
    J. Allan, V. Khandelwal, and R. Gupta. Temporal summaries of news topics. In Proceedings of the 24th International Conference on Research and Development in Information Retrieval, pages 10--18, 2001.
    [2]
    C. Burgess, K. Livesay, and L. Kevin. Explorations in context space: Words, sentences, discourse. Discourse Processes, 25(2&3):211--258, 1998.
    [3]
    J. Carbonell and J. Goldstein. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st International Conference on Research and Development in Information Retrieval, pages 335--336, 1998.
    [4]
    K. Collins-Thompson, P. Ogilvie, Y. Zhang, and J. Callan. Information filtering, novelty detection, and named-page finding. In Proceedings of the 11th Text REtrieval Conference. National Institute of Standards and Technology, 2002.
    [5]
    T. Cover and J. Thomas. Elements of Information Theory. John Wiley & Sons, 1991.
    [6]
    S. Cronen-Townsend and W. B. Croft. Quantifying query ambiguity. In Proceedings of the Human Language Technology Conference (HLT-2002), pages 94--98, March 2002.
    [7]
    S. Cronen-Townsend, Y. Zhou, and W. B. Croft. Predicting query performance. In Proceedings of the 25th International Conference on Research and Development in Information Retrieval, pages 299--306, August 2002.
    [8]
    S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, and R. Harshman. Indexing by latent semantic analysis. Journal of the American Society For Information Science, 41:391--407, 1990.
    [9]
    F. Douglis, T. Ball, Y.-F. Chen, and E. Koutsofios. The AT&T internet difference engine: Tracking and viewing changes on the web. World Wide Web, pages 27--44, January 1998.
    [10]
    C. Fellbaum, editor. WordNet: An Electronic Lexical Database. MIT Press, 1998.
    [11]
    L. Finkelstein, E. Gabrilovich, Y. Matias, E. Rivlin, Z. Solan, G. Wolfman, and E. Ruppin. Placing search in context: The concept revisited. ACM Transactions on Information Systems, 20(1):116--131, January 2002.
    [12]
    R. Gonzalez and P. Wintz. Digital Image Processing. Addison-Wesley, second edition, 1987.
    [13]
    D. Harman. Overview of the TREC 2002 novelty track. In Proceedings of the 11th Text REtrieval Conference, pages 46--55. ACM Press, 2002. NIST Special Publication 500-251.
    [14]
    E. Horvitz, C. Kadie, T. Paek, and D. Hovel. Models of attention in computing and communication: From principles to applications. Communications of the ACM, 46(3):52--59, March 2003. http://research.microsoft.com/~horvitz/cacm-attention.htm.
    [15]
    A. Kilgarriff. Comparing corpora. International Journal of Corpus Linguistics, 6(1):97--133, 2001.
    [16]
    J. Kleinberg. Bursty and hierarchical structure in streams. In Proceedings of the 8th International Conference on Knowledge Discovery and Data Mining, 2002.
    [17]
    L. Lee. Measures of distributional similarity. In Proceedings of the 37th Annual Meeting of the ACL, 1999.
    [18]
    C. D. Manning and H. Schuetze. Foundations of Statistical Natural Language Processing. The MIT Press, 2000.
    [19]
    J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann, 1993.
    [20]
    P. Resnik. Semantic similarity in a taxonomy: An information-based measure and its application to problems of ambiguity in natural language. Journal of Artificial Intelligence Research, 11:95--130, 1999.
    [21]
    E. Ristad. A natural law of succession. Technical Report Technical Report CS-TR-495-95, Princeton University, 1995.
    [22]
    G. Salton and C. Buckley. Term weighting approaches in automatic text retrieval. Information Processing and Management, 24(5):513--523, 1988.
    [23]
    B. Schiffman, A. Nenkova, and K. McKeown. Experiments in multidocument summarization. In Proceedings of the Human Language Technology Conference (HLT-2002), March 2002.
    [24]
    R. Swan and D. Jensen. Timemines: Constructing timelines with statistical models of word usage. In Proceedings of the ACM SIGKDD 2000 Workshop on Text Mining, pages 73--80, 2000.
    [25]
    F. Wilcoxon. Individual comparisons by ranking methods. Biometrics, 1:80--83, 1945.
    [26]
    Y. Yang, T. Ault, and T. Pierce. Combining multiple learning strategies for effective cross-validation. In Proceedings of ICML-00, 17th International Conference on Machine Learning, pages 1167--1182, 2000.
    [27]
    Y. Yang, J. Zhang, J. Carbonell, and C. Jin. Topic-conditioned novelty detection. In Proceedings of the Internaltional Conference on Knowledge Discovery and Data Mining, pages 688--693, 2002.

    Cited By

    View all
    • (2024)Where Are the Values? A Systematic Literature Review on News Recommender SystemsACM Transactions on Recommender Systems10.1145/36548052:3(1-40)Online publication date: 28-Mar-2024
    • (2024)Authors' Values and Attitudes Towards AI-bridged Scalable Personalization of Creative Language ArtsProceedings of the CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642529(1-16)Online publication date: 11-May-2024
    • (2024)NRMG: News Recommendation With Multiview Graph Convolutional NetworksIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.326652011:2(2245-2255)Online publication date: Apr-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WWW '04: Proceedings of the 13th international conference on World Wide Web
    May 2004
    754 pages
    ISBN:158113844X
    DOI:10.1145/988672
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 May 2004

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. news
    2. novelty detection
    3. personalization

    Qualifiers

    • Article

    Conference

    WWW04
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)34
    • Downloads (Last 6 weeks)2
    Reflects downloads up to

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Where Are the Values? A Systematic Literature Review on News Recommender SystemsACM Transactions on Recommender Systems10.1145/36548052:3(1-40)Online publication date: 28-Mar-2024
    • (2024)Authors' Values and Attitudes Towards AI-bridged Scalable Personalization of Creative Language ArtsProceedings of the CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642529(1-16)Online publication date: 11-May-2024
    • (2024)NRMG: News Recommendation With Multiview Graph Convolutional NetworksIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.326652011:2(2245-2255)Online publication date: Apr-2024
    • (2024)SciND: a new triplet-based dataset for scientific novelty detection via knowledge graphsInternational Journal on Digital Libraries10.1007/s00799-023-00386-xOnline publication date: 8-Jan-2024
    • (2023)Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord QuestionsProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581569(1-21)Online publication date: 19-Apr-2023
    • (2023)Personalized News Recommendation: Methods and ChallengesACM Transactions on Information Systems10.1145/353025741:1(1-50)Online publication date: 10-Jan-2023
    • (2023)A Survey of Personalized News RecommendationData Science and Engineering10.1007/s41019-023-00228-58:4(396-416)Online publication date: 2-Sep-2023
    • (2023)Predicting document novelty: an unsupervised learning approachKnowledge and Information Systems10.1007/s10115-023-01989-166:3(1709-1728)Online publication date: 12-Oct-2023
    • (2022)Novelty Detection: A Perspective from Natural Language ProcessingComputational Linguistics10.1162/coli_a_0042948:1(77-117)Online publication date: 4-Apr-2022
    • (2022)A Framework for Exploring Computational Models of Novelty in Unstructured TextProceedings of the 6th International Conference on Information System and Data Mining10.1145/3546157.3546164(36-45)Online publication date: 27-May-2022
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media