Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/345508.345638acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article
Free access

Influence of speech recognition errors on topic detection (poster session)

Published: 01 July 2000 Publication History

Abstract

We investigate the effect of speech-recognition errors on a system for the unsupervised, nearly synchronous clustering of broadcast news stories, using the TDT (Topic Detection and Tracking) Corpora. Two questions are addressed: (1) Are speech recognition errors detrimental to the performance of the system? (2) Can a background collection of contemporaneous clean text improve performance? We investigate both the large-cluster and small-cluster limits.

References

[1]
J. Allan, J. Carbonell, G. Doddington, J. Yamron, and Y. Yang, "Topic Detection and Tracking Pilot Study Final Report", in Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop. February, 1998.
[2]
S. Dharanipragada, M. Franz, J.S. McCarley, S. Roukos, and T. Ward, "Story Segmentation and Topic Detection for Recognized Speech", in 6th European Conference On Speech Communication and Technology 1999.
[3]
S. Dharanipragada, M. Franz, J.S. McCarley, S. Roukos, and T. Ward, "Story Segmentation and Topic Detection in the Broadcast News Domain", in Proceedings of the DARPA Broadcast News Workshop., Feb. 1999.
[4]
J. Fiscus, G. Doddington, J. Garofolo, A. Martin, "NIST's 1998 Topic Detection and Tracking Evaluation (TDT-2)", in Proceedings of the DARPA Broadcast News Workshop., Feb. 1999.
[5]
J.S. Garofolo, E.M. Voorhees, C.G.P. Auzanne, V.M. Stanford, B.A. Lund "1998 TREC-7 Spoken Document Retrieval Track Overview and Results", in The 7th Test REtrieval Conference (TREC-7) ed. by E.M. Voorhees and D.K. Harman.
[6]
A. Singhal, F. Pereira, "Document Expansion for Speech Retrieval", in Proceedings of the 22st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval p.34, 1999.
[7]
http://www.ldc.upenn.edu/TDT
[8]
"The Topic Detection and Tracking Phase 3 (TDT- 3) Evaluation Plan", Version 2.7, Aug. 10, 1999, http://www.itl.nist.gov/iaui/894.01/tdt3/tdt 3.htm

Cited By

View all
  • (2008)ANTSProceedings of the 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services10.1109/WIAMIS.2008.15(219-222)Online publication date: 7-May-2008
  • (2007)The use of topic evolution to help users browse and find answers in news video corpusProceedings of the 15th ACM international conference on Multimedia10.1145/1291233.1291278(198-207)Online publication date: 29-Sep-2007
  • (2006)News video search with fuzzy event clustering using high-level featuresProceedings of the 14th ACM international conference on Multimedia10.1145/1180639.1180687(169-172)Online publication date: 23-Oct-2006
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '00: Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
July 2000
396 pages
ISBN:1581132263
DOI:10.1145/345508
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 July 2000

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

SIGIR00
Sponsor:
  • Greek Com Soc
  • SIGIR
  • Athens U of Econ & Business

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)21
  • Downloads (Last 6 weeks)3
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2008)ANTSProceedings of the 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services10.1109/WIAMIS.2008.15(219-222)Online publication date: 7-May-2008
  • (2007)The use of topic evolution to help users browse and find answers in news video corpusProceedings of the 15th ACM international conference on Multimedia10.1145/1291233.1291278(198-207)Online publication date: 29-Sep-2007
  • (2006)News video search with fuzzy event clustering using high-level featuresProceedings of the 14th ACM international conference on Multimedia10.1145/1180639.1180687(169-172)Online publication date: 23-Oct-2006
  • (2004)Automatic Recognition of Spontaneous Speech for Access to Multilingual Oral History ArchivesIEEE Transactions on Speech and Audio Processing10.1109/TSA.2004.82870212:4(420-435)Online publication date: Jul-2004
  • (2002)Cross-Language Access to Recorded Speech in the MALACH ProjectProceedings of the 5th International Conference on Text, Speech and Dialogue10.5555/647240.718642(57-64)Online publication date: 9-Sep-2002
  • (2002)Cross-Language Access to Recorded Speech in the MALACH ProjectText, Speech and Dialogue10.1007/3-540-46154-X_8(57-64)Online publication date: 23-Aug-2002
  • (2000)Topic detection and tracking in English and ChineseProceedings of the fifth international workshop on on Information retrieval with Asian languages10.1145/355214.355238(165-172)Online publication date: 1-Nov-2000

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media