Accessing the spoken word

Goldman, Jerry; Renals, Steve; Bird, Steven; de Jong, Franciska; Federico, Marcello; Fleischhauer, Carl; Kornbluh, Mark; Lamel, Lori; Oard, Douglas W.; Stewart, Claire; Wright, Richard

doi:10.1007/s00799-004-0101-0

Accessing the spoken word

Regular contribution
Published: 01 August 2005

Volume 5, pages 287–298, (2005)
Cite this article

International Journal on Digital Libraries Aims and scope Submit manuscript

Jerry Goldman¹,
Steve Renals²,
Steven Bird^3,4,
Franciska de Jong⁵,
Marcello Federico⁶,
Carl Fleischhauer⁷,
Mark Kornbluh⁸,
Lori Lamel⁹,
Douglas W. Oard¹⁰,
Claire Stewart¹¹ &
…
Richard Wright¹²

169 Accesses
21 Citations
3 Altmetric
Explore all metrics

Abstract

Spoken-word audio collections cover many domains, including radio and television broadcasts, oral narratives, governmental proceedings, lectures, and telephone conversations. The collection, access, and preservation of such data is stimulated by political, economic, cultural, and educational needs. This paper outlines the major issues in the field, reviews the current state of technology, examines the rapidly changing policy issues relating to privacy and copyright, and presents issues relating to the collection and preservation of spoken audio content .

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

IASA Technical Committee(1997) The safeguarding of the audio heritage: ethics, principles and preservation strategy, February 1997. IASA-TC 03 Version 1
(1999) Risk management suggestions. In: Multimedia Web Strategist 5
Appelt D, Martin D (1999) Named entity recognition in speech: approach and results using the TextPro system. In: Proc DARPA workshop on broadcast news, pp 51–54
Arons B (1997) SpeechSkimmer: a systen for interactively skimming recorded speech. ACM Trans Comput Hum Interact 4:3–38
Article Google Scholar
Bird S, Harrington J (eds) (2001) Special issue on speech annotation and corpus tools. Speech Commun 33(1–2):1–174
Article Google Scholar
Bird S, Simons G (2003) Seven dimensions of portability for language documentation and description. Language 79:557–582
Article Google Scholar
Campbell JP Jr (1997) Speaker recognition: a tutorial. Proc IEEE 85:1437–1462
Article Google Scholar
Chen S, Gopalakrishnan PS (1998) Clustering via the Bayesian Information Criterion with applications in speech recognition. In: Proceedings of IEEE ICASSP-98, pp 645–648
Christensen CM (1997) The innovator’s dilemma. Harvard Business School Press, Boston
Electronic Privacy Information Center (EPIC) and Privacy International (2002) Privacy and Human Rights 2002, Washington, DC
Garofolo JS, Auzanne CGP, Voorhees EM (2000) The TREC spoken document retrieval track: a success story. In: Proc. RIAO 2000
Gauvain J-L, Lamel L (2000) Large-vocabulary continuous speech recognition: advances and applications. Proc IEEE 88:1181–1200
Article Google Scholar
Glover R, Worlton A (2002) Trans-national employers must harmonize conflicting privacy rules. In: Metropolitan Corporate Counsel, Mid-atlantic edn. Metropolitan Corporate Counsel, Mountainside, NJ, p 20
Godsill SJ, Rayner PJW (1995) A Bayesian approach to the restoration of degraded audio signals. IEEE Trans Speech Audio Process 3:267–278
Article Google Scholar
Gotoh Y, Renals S (2000) Information extraction from broadcast news. Philos Trans R Soc Lond Ser A 358:1295–1310
Article MATH Google Scholar
Hori C, Furui S, Malkin R, Yu H, Waibel A (2003) A statistical approach for automatic speech summarization. EURASIP J Appl Signal Process 2:128–139
Article MATH Google Scholar
Lagoze C, Van de Sompel H (2001) The Open Archives Initiative: building a low-barrier interoperability framework. In: Proceedings of the 1st ACM/IEEE-CS joint conference on digital libraries, pp 54–62
Ling T (2002) Why the archive introduced digitisation on demand. RLG Diginews, 6(4) http://www.rlg.org/preserv/diginews/diginews6-4.html#feature1
Lippmann RP (1997) Speech recognition by machines and humans. Speech Commun 22(1):1–15
Article Google Scholar
Litman J (2001) Digital Copyright. Prometheus Books, Amherst, NY, p 84
Logan B, Robinson T (2001) Adaptive model-based speech enhancement. Speech Commun 34:351–368
Article MATH Google Scholar
Makhoul J, Kubala F, Leek T, Liu D, Nguyen L, Schwartz R, Srivastava A (2000) Speech and language technologies for audio indexing and retrieval. Proc IEEE 88:1338–1353
Article Google Scholar
Maybury M (ed) (2000) Special issue on news on demand. Commun ACM 43(2):32–34
Article Google Scholar
Oard DW (1997) Serving users in many languages: cross-language information retrieval. D-Lib Mag http://www.dlib.org/dlib/december97/oard/12oard.html
Oard DW (2000) User interface design for speech-based retrieval. Bull Am Soc Inf Sci 26(5):20–22
Google Scholar
Rigoll G (2001) The ALERT system: advanced broadcast speech recognition technology for selective dissemination of multimedia information. In: IEEE workshop on automatic speech recognition and understanding, pp 301–306
Rothenberg LE (2000) Rethinking privacy: peeping toms, video voyeurs and failure of the criminal law to recognize a reasonable expectiation of privacy in the public space. Am University Law Rev 49:1127
Google Scholar
Simons G, Bird S (2003) Building an Open Language Archives Community on the OAI foundation. Library Hi Tech 21:210–218
Sundara Rajan MT (2002) Moral rights and copyright harmonization: prospects for an “international moral right”. In: 17th BILETA annual conference, April 2002
Wactlar HD, Kanade T, Smith MA, Stevens SM (1996) Intelligent access to digital video: informedia project. IEEE Comput 29(5):46–53
Article Google Scholar
Wahlster W (ed) (2000) Verbmobil: foundations of speech-to-speech translation. Springer, Berlin Heidelberg New York
Wayne C (2000) Multilingual topic detection and tracking: Successful research enabled by corpora and evaluation. In: Language resources and evaluation conference (LREC), pp 1487–1494
Whittaker S, Hirschberg J, Choi J, Hindle D, Pereira F, Singhal A (1999) SCAN: designing and evaluating user interfaces to support retrieval from speech archives. In: Proceedings of ACM SIGIR-99 conference on research and development in information retrieval, pp 26–33
World Intellectual Property Organization (WIPO) (1979) Berne Convention for the Protection of Literary and Artistic Works. http://www.wipo.int/treaties/ip/berne/
Young S (1996) A review of large-vocabulary continuous-speech recognition. IEEE Signal Process Mag 13(5):45–57
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Political Science, Northwestern University, USA
Jerry Goldman
CSTR and School of Informatics, University of Edinburgh, UK
Steve Renals
LDC, University of Pennsylvania, USA
Steven Bird
Dept. of Computer Science, University of Melbourne, Australia
Steven Bird
CTIT, University of Twente, The Netherlands
Franciska de Jong
ITC-IRST, Trento, Italy
Marcello Federico
Library of Congress, USA
Carl Fleischhauer
MATRIX and Department of History, Michigan State University, USA
Mark Kornbluh
LIMSI-CNRS, Orsay, France
Lori Lamel
College of Information Studies/UMIACS, University of Maryland, USA
Douglas W. Oard
Library, Northwestern University, USA
Claire Stewart
BBC Information and Archives, UK
Richard Wright

Authors

Jerry Goldman
View author publications
You can also search for this author in PubMed Google Scholar
Steve Renals
View author publications
You can also search for this author in PubMed Google Scholar
Steven Bird
View author publications
You can also search for this author in PubMed Google Scholar
Franciska de Jong
View author publications
You can also search for this author in PubMed Google Scholar
Marcello Federico
View author publications
You can also search for this author in PubMed Google Scholar
Carl Fleischhauer
View author publications
You can also search for this author in PubMed Google Scholar
Mark Kornbluh
View author publications
You can also search for this author in PubMed Google Scholar
Lori Lamel
View author publications
You can also search for this author in PubMed Google Scholar
Douglas W. Oard
View author publications
You can also search for this author in PubMed Google Scholar
Claire Stewart
View author publications
You can also search for this author in PubMed Google Scholar
Richard Wright
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jerry Goldman.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Goldman, J., Renals, S., Bird, S. et al. Accessing the spoken word. Int J Digit Libr 5, 287–298 (2005). https://doi.org/10.1007/s00799-004-0101-0

Download citation

Published: 01 August 2005
Issue Date: August 2005
DOI: https://doi.org/10.1007/s00799-004-0101-0

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Accessing the spoken word

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Open Web-Based Text-to-Speech Services for the Citizens

The Spoken Wikipedia Corpus collection: Harvesting, alignment and an application to hyperlistening

Spoken Corpora

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Accessing the spoken word

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Open Web-Based Text-to-Speech Services for the Citizens

The Spoken Wikipedia Corpus collection: Harvesting, alignment and an application to hyperlistening

Spoken Corpora

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation