Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/646402.689250guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Out-of-Vocabulary Word Modeling and Rejection for Spanish Keyword Spotting Systems

Published: 22 April 2002 Publication History

Abstract

This paper presents a combination of out-of-vocabulary (OOV) word modeling and rejection techniques in an attempt to accept utterances embedding a keyword and reject utterances with nonkeywords. The goal of this research is to develop a robust, task-independent Spanish keyword spotter and to develop a method for optimizing confidence thresholds for a particular context. To model OOV words, we employed both word and sub-word units as fillers, combined with n-gram language models. We also introduce a methodology for optimizing confidence thresholds to control the tradeoffs between acceptance, confirmation, and rejection of utterances. Our experiments are based on a Mexican Spanish auto-attendant system using the SpeechWorks recognizer release 6.5 Second Edition, in which we achieved a reduction in error of 8.9% as compared to the baseline system. Most of the error reduction is attributed to better keyword detection in utterances that contain both keywords and OOV words.

References

[1]
Lleida, J. B., Salavedra, J., Bonafonte, A., Monte, E., Martinez, A.: Out-Of-Vocabulary Word Modeling and Rejection for Keyword Spotting. In Proc. EUROSPEECH, pp. 1265-1268, 1993.
[2]
Manos, A.: A Study on Out-Of-Vocabulary Word Modeling for a Segment-Based Keyword Spotting System. Master Thesis, Massachusetts Institute of Technology, Cambridge, MA, USA, April 1996.
[3]
Bazzi, I., Glass, J.: Learning Units for Domain-Independent Out-of-Vocabulary Word Modeling. In Proc. EUROSPEECH, Aalborg, Denmark, September 2001.
[4]
Hazen, J. T., Bazzi, I.: A Comparison and Combination of Methods for OOV Detection and Word Confidence Scoring. In Proc. ICASSP, Salt Lake City, USA, May 2001.
[5]
Qing, G., Yonghong, Y., Zhiwei, L, Baosshen, Y., Quingwei, Z., Juian, L.: Keyword Spotting in Auto-Attendant System. In Proc. ICSLP, Beijing, China, October 2000.
[6]
Benitez, C. M., Rubio, A., Garcia, P., Verdejo, D. J.: Word Verification Using Confidence Measures in Speech Recognition. In Proc. ICASSP, Istanbul, Turkey, June 2000.
[7]
Jouvet, D., Bartkova, K. Mercier, G.: Hypothesis Dependent Threshold Setting for Improved Out-Of-Vocabulary Data Rejection. In Proc. ICASSP, Phoenix, Arizona, USA, March 1999.
[8]
Bouwman, G., Sturm, J., Boves, L.: Effect of OOV rates on Keyphrase Rejection Schemes. In Proc. EUROSPEECH, Aalborg, Denmark, September 2001.
[9]
Zhilong, H., Schalkwyk, J., Barnard, E., Cole, R.: Speech Recognition Using Syllable-Like Units. In Proc. ICSLP, Philadelphia, USA, October 1996.
[10]
Zue, V., Glass, J., Phillips, M., Sennef, S.: The SUMMIT Speech Recognition System: Phonological Modeling and Lexical Access. In Proc. ICASSP, pp. 49-52, 1990.
[11]
Cuayáhuitl, H.: Técnicas para Mejorar el Reconocimiento de Voz en Presencia de Habla Fuera del Vocabulario. Master Thesis, Universidad de las Américas Puebla, Cholula, Puebla, Mexico, May 2000.

Cited By

View all
  • (2018)ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluationEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-018-0125-92018:1(1-25)Online publication date: 1-Dec-2018
  • (2017)ALBAYZIN 2016 spoken term detection evaluationEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-017-0119-z2017:1(1-23)Online publication date: 1-Dec-2017
  • (2016)Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluationsEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-016-0080-22016:1(1-19)Online publication date: 1-Dec-2016
  • Show More Cited By

Index Terms

  1. Out-of-Vocabulary Word Modeling and Rejection for Spanish Keyword Spotting Systems

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    MICAI '02: Proceedings of the Second Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence
    April 2002
    545 pages

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 22 April 2002

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 06 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2018)ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluationEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-018-0125-92018:1(1-25)Online publication date: 1-Dec-2018
    • (2017)ALBAYZIN 2016 spoken term detection evaluationEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-017-0119-z2017:1(1-23)Online publication date: 1-Dec-2017
    • (2016)Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluationsEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-016-0080-22016:1(1-19)Online publication date: 1-Dec-2016
    • (2012)Comparison of methods for language-dependent and language-independent query-by-example spoken term detectionACM Transactions on Information Systems10.1145/2328967.232897130:3(1-34)Online publication date: 6-Sep-2012
    • (2007)The listening roomProceedings of the 15th ACM international conference on Multimedia10.1145/1291233.1291391(681-690)Online publication date: 29-Sep-2007

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media