Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2983323.2983653acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

A Study of Realtime Summarization Metrics

Published: 24 October 2016 Publication History

Abstract

Unexpected news events, such as natural disasters or other human tragedies, create a large volume of dynamic text data from official news media as well as less formal social media. Automatic real-time text summarization has become an important tool for quickly transforming this overabundance of text into clear, useful information for end-users including affected individuals, crisis responders, and interested third parties. Despite the importance of real-time summarization systems, their evaluation is not well understood as classic methods for text summarization are inappropriate for real-time and streaming conditions.
The TREC 2013-2015 Temporal Summarization (TREC-TS) track was one of the first evaluation campaigns to tackle the challenges of real-time summarization evaluation, introducing new metrics, ground-truth generation methodology and dataset. In this paper, we present a study of TREC-TS track evaluation methodology, with the aim of documenting its design, analyzing its effectiveness, as well as identifying improvements and best practices for the evaluation of temporal summarization systems.

References

[1]
J. Allan, editor. Topic Detection and Tracking: Event-based Information Organization. Inf. Retrieval. 2002.
[2]
J. Allan, R. Gupta, and V. Khandelwal. Temporal summaries of new topics. In Proc of SIGIR, 2001.
[3]
O. Alonso and R. Baeza-Yates. Design and implementation of relevance assessments using crowdsourcing. In Proc. of ECIR, 2011.
[4]
G. Baruah, A. Roegiest, and M. D. Smucker. The effect of expanding relevance judgements with duplicates. In Proc. of SIGIR, 2014.
[5]
B. Carterette, E. Gabrilovich, V. Josifovski, and D. Metzler. Measuring the reusability of test collections. In Proc. of WSDM, 2010.
[6]
Y. Chen, N. J. Conroy, and V. L. Rubin. News in an online world: the need for an automatic crap detector. In Proc. of ASIS&T, 2015.
[7]
H. T. Dang and K. Owczarzak. Overview of the TAC 2008 update summarization task. In Proc. of TAC, 2008.
[8]
M. Dostert and D. Kelly. Users' stopping behaviors and estimates of recall. In Proc. of SIGIR, 2009.
[9]
Q. Guo, F. Diaz, and E. Yom-Tov. Updating users about time critical events. In Inf. Retrieval. Springer, 2013.
[10]
M. Imran, C. Castillo, F. Diaz, and S. Vieweg. Processing social media messages in mass emergency: A survey. ACM Comput. Surv., July 2015.
[11]
B. Keegan, D. Gergle, and N. Contractor. Hot off the wiki: Structures and dynamics of wikipedia's coverage of breaking news events. American Behavioral Scientist, 2013.
[12]
C.-Y. Lin. ROUGE: a Package for Automatic Evaluation of Summaries. In Proc. of ACL, 2004.
[13]
C.-Y. Lin and F. Och. Looking for a few good metrics: Rouge and its evaluation. In NTCIR Workshop, 2004.
[14]
H. P. Luhn. A business intelligence system. IBM J. Res. Dev., 1958.
[15]
R. Mccreadie, C. Macdonald, and I. Ounis. Identifying top news using crowdsourcing. Inf. Retrieval, 2013.
[16]
A. Nenkova and K. McKeown. Automatic summarization. Foundations and Trends in Information Retrieval, 2011.
[17]
P. Thomas and D. Hawking. Evaluation by comparing result sets in context. In Proc. of CIKM, 2006.
[18]
Y. Wang, G. Sherman, J. Lin, and M. Efron. Assessor differences and user preferences in tweet timeline generation. In Proc. of SIGIR, 2015.
[19]
E. Yilmaz, J. A. Aslam, and S. Robertson. A new rank correlation coefficient for information retrieval. In Proc. of SIGIR, 2008.
[20]
Y. Zhang, J. Callan, and T. Minka. Novelty and redundancy detection in adaptive filtering. In Proc. of SIGIR, 2002.

Cited By

View all
  • (2022)Review of automatic text summarization techniques & methodsJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2020.05.00634:4(1029-1046)Online publication date: Apr-2022
  • (2018)Automatic Ground Truth Expansion for Timeline EvaluationThe 41st International ACM SIGIR Conference on Research & Development in Information Retrieval10.1145/3209978.3210034(685-694)Online publication date: 27-Jun-2018
  • (2017)A Comparison of Nuggets and Clusters for Evaluating Timeline SummariesProceedings of the 2017 ACM on Conference on Information and Knowledge Management10.1145/3132847.3133000(67-76)Online publication date: 6-Nov-2017
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
October 2016
2566 pages
ISBN:9781450340731
DOI:10.1145/2983323
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 October 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. metrics
  2. real-time summarization
  3. summarization
  4. summarization evaluation
  5. temporal summarization
  6. trec

Qualifiers

  • Research-article

Conference

CIKM'16
Sponsor:
CIKM'16: ACM Conference on Information and Knowledge Management
October 24 - 28, 2016
Indiana, Indianapolis, USA

Acceptance Rates

CIKM '16 Paper Acceptance Rate 160 of 701 submissions, 23%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)11
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Review of automatic text summarization techniques & methodsJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2020.05.00634:4(1029-1046)Online publication date: Apr-2022
  • (2018)Automatic Ground Truth Expansion for Timeline EvaluationThe 41st International ACM SIGIR Conference on Research & Development in Information Retrieval10.1145/3209978.3210034(685-694)Online publication date: 27-Jun-2018
  • (2017)A Comparison of Nuggets and Clusters for Evaluating Timeline SummariesProceedings of the 2017 ACM on Conference on Information and Knowledge Management10.1145/3132847.3133000(67-76)Online publication date: 6-Nov-2017
  • (2017)ECIR 2017 Workshop on Exploitation of Social Media for Emergency Relief and Preparedness (SMERP 2017)ACM SIGIR Forum10.1145/3130332.313033851:1(36-41)Online publication date: 2-Aug-2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media