Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2911451.2914761acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

First Story Detection using Multiple Nearest Neighbors

Published: 07 July 2016 Publication History
  • Get Citation Alerts
  • Abstract

    First Story Detection (FSD) systems aim to identify those news articles that discuss an event that was not reported before. Recent work on FSD has focussed almost exclusively on efficiently detecting documents that are dissimilar from their nearest neighbor. We propose a novel FSD approach that is more effective, by adapting a recently proposed method for news summarization based on 3-nearest neighbor clustering. We show that this approach is more effective than a baseline that uses dissimilarity of an individual document from its nearest neighbor.

    References

    [1]
    J. Allan, V. Lavrenko, D. Malin, and R. Swan. Detections, bounds, and timelines: Umass and TDT-3. In Proceedings of TDT-3 Workshop, pages 167--174, 2000.
    [2]
    J. Allan, R. Papka, and V. Lavrenko. On-line new event detection and tracking. In Proceedings of SIGIR 1998, pages 37--45. ACM, 1998.
    [3]
    M. Karkali, F. Rousseau, A. Ntoulas, and M. Vazirgiannis. Efficient online novelty detection in news streams. In WISE 2013, pages 57--71. Springer, 2013.
    [4]
    R. McCreadie, C. Macdonald, I. Ounis, M. Osborne, and S. Petrovic. Scalable distributed event detection for Twitter. In IEEE Big Data, pages 543--549. IEEE, 2013.
    [5]
    M. Osborne, S. Petrovic, R. McCreadie, C. Macdonald, and I. Ounis. Bieber no more: First story detection using twitter and wikipedia. In SIGIR 2012 TAIA Workshop, 2012.
    [6]
    R. Papka and J. Allan. Topic detection and tracking: Event clustering as a basis for first story detection. In Advances in Information Retrieval, pages 97--126. Springer, 2002.
    [7]
    S. Petrović, M. Osborne, and V. Lavrenko. Streaming first story detection with application to twitter. In Proceedings of NAACL 2010, pages 181--189. ACL, 2010.
    [8]
    S. Petrović, M. Osborne, and V. Lavrenko. Using paraphrases for improving first story detection in news and twitter. In Proceedings of NAACL 2012, pages 338--346, 2012.
    [9]
    J. B. Vuurens, A. P. de Vries, R. Blanco, and P. Mika. Hierarchy construction for news summarizations. In Proceedings of SIGIR 2015 TAIA Workshop, 2015.
    [10]
    J. B. Vuurens, A. P. de Vries, R. Blanco, and P. Mika. Online news tracking for ad-hoc information needs. In Proceedings of ICTIR 2015, pages 221--230. ACM, 2015.
    [11]
    Y. Yang, J. Zhang, J. Carbonell, and C. Jin. Topic-conditioned novelty detection. In Proceedings of SIGKDD 2002, pages 688--693. ACM, 2002.

    Cited By

    View all
    • (2022)Real-Time Detection of First Stories in Twitter Using a FastText ModelArtificial Intelligence for Data Science in Theory and Practice10.1007/978-3-030-92245-0_9(179-218)Online publication date: 2022
    • (2021)A General Framework for First Story Detection Utilizing Entities and Their RelationsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.297005133:11(3482-3493)Online publication date: 1-Nov-2021
    • (2019)The importance of unexpectedness: Discovering buzzing stories in anomalous temporal graphsWeb Intelligence10.3233/WEB-19041217:3(177-198)Online publication date: 16-Aug-2019
    • Show More Cited By

    Index Terms

    1. First Story Detection using Multiple Nearest Neighbors

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGIR '16: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval
      July 2016
      1296 pages
      ISBN:9781450340694
      DOI:10.1145/2911451
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 07 July 2016

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tag

      1. first story detection

      Qualifiers

      • Short-paper

      Conference

      SIGIR '16
      Sponsor:

      Acceptance Rates

      SIGIR '16 Paper Acceptance Rate 62 of 341 submissions, 18%;
      Overall Acceptance Rate 792 of 3,983 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 09 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)Real-Time Detection of First Stories in Twitter Using a FastText ModelArtificial Intelligence for Data Science in Theory and Practice10.1007/978-3-030-92245-0_9(179-218)Online publication date: 2022
      • (2021)A General Framework for First Story Detection Utilizing Entities and Their RelationsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.297005133:11(3482-3493)Online publication date: 1-Nov-2021
      • (2019)The importance of unexpectedness: Discovering buzzing stories in anomalous temporal graphsWeb Intelligence10.3233/WEB-19041217:3(177-198)Online publication date: 16-Aug-2019
      • (2019)Stream-based live public opinion monitoring approach with adaptive probabilistic topic modelSoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-018-3391-723:16(7451-7470)Online publication date: 1-Aug-2019

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media