Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Improving Throat Microphone Speech Recognition by Joint Analysis of Throat and Acoustic Microphone Recordings

Published: 01 September 2009 Publication History

Abstract

We present a new framework for joint analysis of throat and acoustic microphone (TAM) recordings to improve throat microphone only speech recognition. The proposed analysis framework aims to learn joint sub-phone patterns of throat and acoustic microphone recordings through a parallel branch HMM structure. The joint sub-phone patterns define temporally correlated neighborhoods, in which a linear prediction filter estimates a spectrally rich acoustic feature vector from throat feature vectors. Multimodal speech recognition with throat and throat-driven acoustic features significantly improves throat-only speech recognition performance. Experimental evaluations on a parallel TAM database yield benchmark phoneme recognition rates for throat-only and multimodal TAM speech recognition systems as 46.81% and 60.69%, respectively. The proposed throat-driven multimodal speech recognition system improves phoneme recognition rate to 52.58%, a significant relative improvement with respect to the throat-only speech recognition benchmark system.

Cited By

View all
  • (2016)Source and filter estimation for throat-microphone speech enhancementIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2015.249904024:2(265-275)Online publication date: 1-Feb-2016
  • (2010)Adding voice to whisper using a simple heuristic algorithm inferred from empirical observationProceedings of the 12th international conference on Computers helping people with special needs: Part I10.5555/1886667.1886780(613-620)Online publication date: 14-Jul-2010
  1. Improving Throat Microphone Speech Recognition by Joint Analysis of Throat and Acoustic Microphone Recordings

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image IEEE Transactions on Audio, Speech, and Language Processing
      IEEE Transactions on Audio, Speech, and Language Processing  Volume 17, Issue 7
      September 2009
      192 pages

      Publisher

      IEEE Press

      Publication History

      Published: 01 September 2009

      Author Tags

      1. Joint processing of throat and acoustic microphone (TAM) recordings
      2. robust speech recognition
      3. throat microphone speech recognition

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 25 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2016)Source and filter estimation for throat-microphone speech enhancementIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2015.249904024:2(265-275)Online publication date: 1-Feb-2016
      • (2010)Adding voice to whisper using a simple heuristic algorithm inferred from empirical observationProceedings of the 12th international conference on Computers helping people with special needs: Part I10.5555/1886667.1886780(613-620)Online publication date: 14-Jul-2010

      View Options

      View options

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media