Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3010915.3010972acmotherconferencesArticle/Chapter ViewAbstractPublication PagesozchiConference Proceedingsconference-collections
short-paper

Retrieving radio news broadcasts in Danish: accuracy and categorization of unrecognized words

Published: 29 November 2016 Publication History

Abstract

Digital archives of radio news broadcasts can possibly be made searchable by combining speech recognition with information retrieval. We explore this possibility for the retrieval of news broadcasts in Danish. An average of 84% of the words in the broadcasts was recognized. Most of the unrecognized words were compounds, names, and other words that appear of value to retrieval. Thus, the set of words describing a broadcast has to be expanded to compensate for the recognition errors. We discuss doing this by exploiting the alternative matches from the speech recognizer and by extracting words from a related corpus.

References

[1]
Alapetite, A., Andersen, H.B. and Hertzum, M. Acceptance of speech recognition by physicians: A survey of expectations, experiences, and social influence. International Journal of Human-Computer Studies 67, 1 (2009), 36--49.
[2]
Allen, J. Robust techniques for organizing and retrieving spoken documents. EURASIP Journal on Applied Signal Processing 2003, 2 (2003), 103--114.
[3]
Anick, P.G. Adapting a full-text information retrieval system to the computer troubleshooting domain. In Proc. SIGIR 1994, ACM Press, New York (1994), 349--358.
[4]
Cohen, J. A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20, 1 (1960), 37--46.
[5]
Datta, R., Joshi, D., Li, J. and Wang, J.Z. Image retrieval: Ideas, influences, and trends of the new age. ACM Computing Surveys 40, 2 (2008), 5:01-5:60.
[6]
Hu, W., Xie, N., Li, L., Zeng, X. and Maybank, S. A survey on visual content-based video indexing and retrieval. IEEE Transactions on Systems, Man, and Cybernetics - Part C: Applications and Reviews 41, 6 (2011), 797--819.
[7]
Jadhav, A., Andrews, D., Fiksdal, A., Kumbamu, A., McCormick, J.B., Mistano, A., Nelsen, L., Ryu, E., Sheth, A., Wu, S. and Pathak, J. Comparative analysis of online health queries originating from personal computers and smart devices on a consumer health information portal. Journal of Medical Internet Research 16, 7 (2014), e160.
[8]
Jiang, D., Pei, J. and Li, H. Mining search and browse logs for web search: A survey. ACM Transactions on Intelligent Systems and Technology 4, 4 (2013), Article 57.
[9]
Larson, M. and Jones, G.J.F. Spoken content retrieval: A survey of techniques and technologies. Foundations and Trends in Information Retrieval 5, 4&5 (2011), 235--422.
[10]
Lazar, J., Feng, J.H. and Hochheiser, H. Research methods in human-computer interaction. Wiley, Chichester, UK (2010).
[11]
Leech, G., Rayson, P. and Wilson, A. Word frequencies in written and spoken English - based on the British National Corpus. Routledge, New York (2001).
[12]
Lund, H., Bogers, T., Larsen, B. and Lykke, M. CHAOS: User-driven development of a metadata scheme for radio broadcast archives. In Proc. iConference 2013 (2013), 990--994.
[13]
Munteanu, C., Baecker, R., Penn, G., Toms, E. and James, D. The effect of speech recognition accuracy rates on the usefulness and usability of webcast archives. In Proc. CHI 2006, ACM Press, New York (2006), 493--502.
[14]
Povey, D., Ghoshal, A., Boulianne, G., Burget, L., Glembek, O., Goel, N., Hannemann, M., Motlicek, P., Qian, Y., Schwartz, P., Silovský, J., Stemmer, G. and Veselý, K. The Kaldi speech recognition toolkit. Paper presented at IEEE Workshop on Automatic Speech Recognition and Understanding (2011).
[15]
Singhal, A. and Pereira, F. Document expansion for speech retrieval. In Proc. SIGIR 1999, ACM Press, New York (1999), 34--41.
[16]
Sparck Jones, K., Jones, G.J.F. and Young, S.J. Experiments in spoken document retrieval. Information Processing & Management 32, 4 (1996), 399--417.
[17]
Yu, D. and Deng, L. Automatic speech recognition: A deep learning approach. Springer, Heidelberg (2015).

Index Terms

  1. Retrieving radio news broadcasts in Danish: accuracy and categorization of unrecognized words

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      OzCHI '16: Proceedings of the 28th Australian Conference on Computer-Human Interaction
      November 2016
      706 pages
      ISBN:9781450346184
      DOI:10.1145/3010915
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      • IEEE-SMCS: Systems, Man & Cybernetics Society
      • Australian Comp Soc: Australian Computer Society
      • Data61: Data61, CSIRO
      • ICACHI: International Chinese Association of Computer Human Interaction
      • Infoxchange: Infoxchange
      • HITLab AU: Human Interface Technology Laboratory Australia
      • James Boag: James Boag
      • Tourism Tasmania: Tourism Tasmania
      • HFESA: Human Factors and Ergonomics Society of Australia Inc.
      • IEEEVIC: IEEE Victorian Section
      • UTAS: University of Tasmania, Australia

      In-Cooperation

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 29 November 2016

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. audio archives
      2. information retrieval
      3. speech recognition

      Qualifiers

      • Short-paper

      Funding Sources

      • Innovation Fund Denmark

      Conference

      OzCHI '16
      Sponsor:
      • IEEE-SMCS
      • Australian Comp Soc
      • Data61
      • ICACHI
      • Infoxchange
      • HITLab AU
      • James Boag
      • Tourism Tasmania
      • HFESA
      • IEEEVIC
      • UTAS
      OzCHI '16: The 28th Australian Conference on Human-Computer Interaction
      November 29 - December 2, 2016
      Tasmania, Launceston, Australia

      Acceptance Rates

      Overall Acceptance Rate 362 of 729 submissions, 50%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 40
        Total Downloads
      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 13 Jan 2025

      Other Metrics

      Citations

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media