default search action
Douglas D. O'Shaughnessy
Person information
- affiliation: INRS-EMT, Montreal, Canada
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j56]Douglas D. O'Shaughnessy:
Trends and developments in automatic speech recognition research. Comput. Speech Lang. 83: 101538 (2024) - [j55]Douglas D. O'Shaughnessy:
Review of Methods for Automatic Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1776-1789 (2024) - [j54]Douglas D. O'Shaughnessy:
Speech Enhancement - A Review of Modern Methods. IEEE Trans. Hum. Mach. Syst. 54(1): 110-120 (2024) - 2023
- [j53]Douglas D. O'Shaughnessy:
Review of methods for coding of speech signals. EURASIP J. Audio Speech Music. Process. 2023(1): 8 (2023) - [j52]Douglas D. O'Shaughnessy:
Review of analysis methods for speech applications. Speech Commun. 151: 64-75 (2023) - 2022
- [c202]Anwar Tantawy, Douglas D. O'Shaughnessy:
The Effects of Model Capacity in Modelling Variability between Training and Testing Environments for Automatic Speech Recognition. AIKE 2022: 61-64 - 2021
- [j51]Anderson R. Avila, Jahangir Alam, Fabiano O. Costa Prado, Douglas D. O'Shaughnessy, Tiago H. Falk:
On the use of blind channel response estimation and a residual neural network to detect physical access attacks to speaker verification systems. Comput. Speech Lang. 66: 101163 (2021) - [j50]Anderson R. Avila, Douglas D. O'Shaughnessy, Tiago H. Falk:
Automatic speaker verification from affective speech using Gaussian mixture model based estimation of neutral speech characteristics. Speech Commun. 132: 21-31 (2021) - [j49]Anderson R. Avila, Zahid Akhtar, João Felipe Santos, Douglas D. O'Shaughnessy, Tiago H. Falk:
Feature Pooling of Modulation Spectrum Features for Improved Speech Emotion Recognition in the Wild. IEEE Trans. Affect. Comput. 12(1): 177-188 (2021) - 2020
- [j48]Juan Ignacio Godino-Llorente, Douglas D. O'Shaughnessy, Tan Lee, Najim Dehak, Claudia Manfredi:
Introduction to the Issue on Automatic Assessment of Health Disorders Based on Voice, Speech, and Language Processing. IEEE J. Sel. Top. Signal Process. 14(2): 234-239 (2020) - [j47]Anderson R. Avila, Jahangir Alam, Douglas D. O'Shaughnessy, Tiago H. Falk:
On the use of the i-vector speech representation for instrumental quality measurement. Qual. User Exp. 5(1) (2020) - [j46]Anderson R. Avila, Douglas D. O'Shaughnessy, Tiago H. Falk:
Non-intrusive speech quality prediction based on the blind estimation of clean speech and the i-vector framework. Qual. User Exp. 5(1) (2020)
2010 – 2019
- 2019
- [j45]Douglas D. O'Shaughnessy:
Recognition and Processing of Speech Signals Using Neural Networks. Circuits Syst. Signal Process. 38(8): 3454-3481 (2019) - [c201]Anderson R. Avila, Shruti Rajendra Kshirsagar, Abhishek Tiwari, Daniel Lafond, Douglas D. O'Shaughnessy, Tiago H. Falk:
Speech-Based Stress Classification based on Modulation Spectral Features and Convolutional Neural Networks. EUSIPCO 2019: 1-5 - [c200]Anderson R. Avila, Jahangir Alam, Douglas D. O'Shaughnessy, Tiago H. Falk:
Blind Channel Response Estimation for Replay Attack Detection. INTERSPEECH 2019: 2893-2897 - [c199]Anderson R. Avila, Jahangir Alam, Douglas D. O'Shaughnessy, Tiago H. Falk:
Intrusive Quality Measurement of Noisy and Enhanced Speech based on i-Vector Similarity. QoMEX 2019: 1-5 - 2018
- [c198]Anderson R. Avila, Md. Jahangir Alam, Douglas D. O'Shaughnessy, Tiago H. Falk:
Investigating Speech Enhancement and Perceptual Quality for Speech Emotion Recognition. INTERSPEECH 2018: 3663-3667 - 2017
- [c197]Anderson R. Avila, João Monteiro, Douglas D. O'Shaughnessy, Tiago H. Falk:
Speech emotion recognition on mobile devices based on modulation spectral feature pooling and deep neural networks. ISSPIT 2017: 360-365 - 2016
- [c196]Yacine Benahmed, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Evaluation of graph metrics for optimizing bin-based ontologically smoothed language models. EUSIPCO 2016: 1906-1910 - [c195]Milton Orlando Sarria-Paja, Mohammed Senoussaoui, Douglas D. O'Shaughnessy, Tiago H. Falk:
Feature mapping, score-, and feature-level fusion for improved normal and whispered speech speaker verification. ICASSP 2016: 5480-5484 - 2015
- [j44]Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Unsupervised language model adaptation using LDA-based mixture models and latent semantic marginals. Comput. Speech Lang. 29(1): 20-31 (2015) - [j43]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Regularized minimum variance distortionless response-based cepstral features for robust continuous speech recognition. Speech Commun. 73: 28-46 (2015) - [c194]Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Document-specific context plsa language model for speech recognition. ICASSP 2015: 5326-5330 - [c193]Rafik Djemili, Mohamed Cherif Amara Korba, Hocine Bourouba, Douglas D. O'Shaughnessy:
Boosting speaker identification performance using a frame level based algorithm. ICCSPA 2015: 1-6 - 2014
- [j42]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Robust feature extraction based on an asymmetric level-dependent auditory filterbank and a subband spectrum enhancement technique. Digit. Signal Process. 29: 147-157 (2014) - [j41]Habiba Dahmani, Sid-Ahmed Selouani, Noureddine Doghmane, Douglas D. O'Shaughnessy, Mohamed Chetouani:
On the relevance of using rhythmic metrics and SVM to assess dysarthric severity. Int. J. Biom. 6(3): 248-271 (2014) - [c192]Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Interpolated Dirichlet Class Language Model for Speech Recognition Incorporating Long-distance N-grams. COLING 2014: 1793-1802 - [c191]Md. Jahangir Alam, Patrick Kenny, Pierre Dumouchel, Douglas D. O'Shaughnessy:
Robust feature extractors for continuous speech recognition. EUSIPCO 2014: 944-948 - [c190]Md. Jahangir Alam, Patrick Kenny, Pierre Dumouchel, Douglas D. O'Shaughnessy:
Robust speech recognition using warped DFT-based cepstral features in clean and multistyle training. EUSIPCO 2014: 1791-1795 - [c189]Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Novel topic n-gram count LM incorporating document-based topic distributions and n-gram counts. EUSIPCO 2014: 2310-2314 - [c188]Yacine Benahmed, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Abin-based ontological framework for low-resourcen-gram smoothing in language modelling. ICASSP 2014: 4918-4922 - [c187]Anderson R. Avila, Milton Orlando Sarria-Paja, Francisco J. Fraga, Douglas D. O'Shaughnessy, Tiago H. Falk:
Improving the performance of far-field speaker verification using multi-condition training: the case of GMM-UBM and i-vector systems. INTERSPEECH 2014: 1096-1100 - [c186]Md. Jahangir Alam, Patrick Kenny, Pierre Dumouchel, Douglas D. O'Shaughnessy:
Noise spectrum estimation using Gaussian mixture model-based speech presence probability for robust speech recognition. INTERSPEECH 2014: 2759-2763 - [c185]Md. Jahangir Alam, Yazid Attabi, Patrick Kenny, Pierre Dumouchel, Douglas D. O'Shaughnessy:
Automatic Emotion Recognition from Cochlear Implant-Like Spectrally Reduced Speech. IWAAL 2014: 332-340 - [c184]Douglas D. O'Shaughnessy, Géza Kolumbán, Roger Lecomte:
Keynote speakers: The challenges of pattern recognition for speech signals. NEWCAS 2014: 1-6 - [c183]Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Document-based Dirichlet class language model for speech recognition using document-based n-gram events. SLT 2014: 42-47 - 2013
- [j40]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems. Cogn. Comput. 5(4): 533-544 (2013) - [j39]Habiba Dahmani, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy, Mohamed Chetouani, Noureddine Doghmane:
Assessment of dysarthric speech through rhythm metrics. J. King Saud Univ. Comput. Inf. Sci. 25(1): 43-49 (2013) - [j38]Douglas D. O'Shaughnessy, Li Deng, Haizhou Li:
Speech Information Processing: Theory and Applications [Scanning the Issue]. Proc. IEEE 101(5): 1034-1037 (2013) - [j37]Douglas D. O'Shaughnessy:
Acoustic Analysis for Automatic Speech Recognition. Proc. IEEE 101(5): 1038-1053 (2013) - [j36]Md. Jahangir Alam, Tomi Kinnunen, Patrick Kenny, Pierre Ouellet, Douglas D. O'Shaughnessy:
Multitaper MFCC and PLP features for speaker verification using i-vectors. Speech Commun. 55(2): 237-251 (2013) - [c182]Ali Jannatpour, Adam Krzyzak, Douglas D. O'Shaughnessy:
A new approach to short-time harmonic analysis of tonal audio signals using harmonic sinusoidals. CCECE 2013: 1-6 - [c181]Md. Akmal Haidar, Douglas D. O'Shaughnessy:
PLSA enhanced with a long-distance bigram language model for speech recognition. EUSIPCO 2013: 1-5 - [c180]Jan-Niklas Antons, Khalil ur Rehman Laghari, Sebastian Arndt, Robert Schleicher, Sebastian Möller, Douglas D. O'Shaughnessy, Tiago H. Falk:
Cognitive, affective, and experience correlates of speech quality perception in complex listening conditions. ICASSP 2013: 3672-3676 - [c179]Milton Orlando Sarria-Paja, Tiago H. Falk, Douglas D. O'Shaughnessy:
Whispered speaker verification and gender detection using weighted instantaneous frequencies. ICASSP 2013: 7209-7213 - [c178]Yazid Attabi, Md. Jahangir Alam, Pierre Dumouchel, Patrick Kenny, Douglas D. O'Shaughnessy:
Multiple windowed spectral features for emotion recognition. ICASSP 2013: 7527-7531 - [c177]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Speech recognition using regularized minimum variance distortionless response spectrum estimation-based cepstral features. ICASSP 2013: 8071-8075 - [c176]Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Comparison of a bigram PLSA and a novel context-based PLSA language model for speech recognition. ICASSP 2013: 8440-8444 - [c175]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Regularized MVDR spectrum estimation-based robust feature extractors for speech recognition. INTERSPEECH 2013: 891-895 - [c174]Md. Jahangir Alam, Yazid Attabi, Pierre Dumouchel, Patrick Kenny, Douglas D. O'Shaughnessy:
Amplitude modulation features for emotion recognition from speech. INTERSPEECH 2013: 2420-2424 - [c173]Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Fitting long-range information using interpolated distanced n-grams and cache models into a latent dirichlet language model for speech recognition. INTERSPEECH 2013: 2678-2682 - [c172]Tomi Kinnunen, Md. Jahangir Alam, Pavel Matejka, Patrick Kenny, Jan Cernocký, Douglas D. O'Shaughnessy:
Frequency warping and robust speaker verification: a comparison of alternative mel-scale representations. INTERSPEECH 2013: 3122-3126 - [c171]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Smoothed Nonlinear Energy Operator-Based Amplitude Modulation Features for Robust Speech Recognition. NOLISP 2013: 168-175 - 2012
- [j35]Ladan Golipour, Douglas D. O'Shaughnessy:
A segmental non-parametric-based phoneme recognition approach at the acoustical level. Comput. Speech Lang. 26(4): 244-259 (2012) - [j34]Md Foezur Rahman Chowdhury, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Bayesian on-line spectral change point detection: a soft computing approach for on-line ASR. Int. J. Speech Technol. 15(1): 5-23 (2012) - [j33]Mouloud Djamah, Douglas D. O'Shaughnessy:
Fine granularity scalable speech coding using embedded tree-structured vector quantization. Speech Commun. 54(1): 23-39 (2012) - [c170]Yacine Benahmed, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Ontology-based pattern generator and root semantic analyser for spoken dialogue systems. CCECE 2012: 1-4 - [c169]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Robust speech recognition under noisy environments using asymmetric tapers. EUSIPCO 2012: 1638-1642 - [c168]Md. Akmal Haidar, Douglas D. O'Shaughnessy:
LDA-based LM adaptation using latent semantic marginals and minimum discriminant information. EUSIPCO 2012: 2040-2044 - [c167]Md Foezur Rahman Chowdhury, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
A highly non-stationary noise tracking and compensation algorithm, with applications to speech enhancement and on-line ASR. ICASSP 2012: 4337-4340 - [c166]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Robust Feature Extraction for Speech Recognition by Enhancing Auditory Spectrum. INTERSPEECH 2012: 1360-1363 - [c165]Md Foezur Rahman Chowdhury, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
A soft computing approach to improve the robustness of on-line ASR in previously unseen highly non-stationary acoustic environments. ISSPA 2012: 522-527 - [c164]Yacine Benahmed, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Effects of discriminative training on the RACAD corpus of the French language spoken in the Canadian province of New-Brunswick. ISSPA 2012: 695-699 - [c163]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
On the use of asymmetric-shaped tapers for speaker verification using i-vectors. Odyssey 2012: 256-262 - [c162]Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Topic n-gram count language model adaptation for speech recognition. SLT 2012: 165-169 - [c161]Mouloud Djamah, Douglas D. O'Shaughnessy:
Codage échelonnable à granularité fine de la parole : Application au codeur G.729 (Fine granularity scalable speech coding: Application to the G.729 coder) [in French]. JEP-TALN-RECITAL 2012 2012: 505-512 - 2011
- [j32]Md. Jahangir Alam, Douglas D. O'Shaughnessy:
Perceptual improvement of Wiener filtering employing a post-filter. Digit. Signal Process. 21(1): 54-65 (2011) - [c160]Md. Jahangir Alam, Tomi Kinnunen, Patrick Kenny, Pierre Ouellet, Douglas D. O'Shaughnessy:
Multi-taper MFCC features for speaker verification using I-vectors. ASRU 2011: 547-552 - [c159]Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Unsupervised language model adaptation using n-gram weighting. CCECE 2011: 857-860 - [c158]Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Unsupervised language model adaptation using latent Dirichlet allocation and dynamic marginals. EUSIPCO 2011: 1480-1484 - [c157]Yacine Benahmed, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy, Amin Haji Abolhassani:
Real-life speech-enabled system to enhance interaction with rfid networks in noisy environments. ICASSP 2011: 1781-1784 - [c156]Yasmina Benabderrahmane, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Blind Speech Separation in Multiple Environments Using a Frequency Oriented PCA Method for Convolutive Mixtures. INTERSPEECH 2011: 557-560 - [c155]Md Foezur Rahman Chowdhury, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
A Rapid Adaptation Algorithm for Tracking Highly Non-Stationary Noises based on Bayesian Inference for On-Line Spectral Change Point Detection. INTERSPEECH 2011: 1205-1208 - [c154]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
A Study of Low-variance Multi-taper Features for Distributed Speech Recognition. NOLISP 2011: 239-245 - [c153]Md. Jahangir Alam, Pierre Ouellet, Patrick Kenny, Douglas D. O'Shaughnessy:
Comparative Evaluation of Feature Normalization Techniques for Speaker Verification. NOLISP 2011: 246-253 - [c152]Md Foezur Rahman Chowdhury, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Real-Time Bayesian Inference: A Soft Computing Approach to Environmental Learning for On-Line Robust Automatic Speech Recognition. SOCO 2011: 445-452 - 2010
- [c151]Md Foezur Rahman Chowdhury, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Frame recursive dynamic mean bias removal technique for robust environment-aware speech recognition in real world applications. CCECE 2010: 1-5 - [c150]Yasmina Benabderrahmane, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Blind speech separation for convolutive mixtures using an oriented principal components analysis method. EUSIPCO 2010: 1553-1557 - [c149]Mouloud Djamah, Douglas D. O'Shaughnessy:
An efficient tree-structured codebook design for embedded vector quantization. ICASSP 2010: 4686-4689 - [c148]Yasmina Benabderrahmane, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Oriented PCA method for blind speech separation of convolutive mixtures. INTERSPEECH 2010: 390-393 - [c147]Ladan Golipour, Douglas D. O'Shaughnessy:
Phoneme classification and lattice rescoring based on a k-NN approach. INTERSPEECH 2010: 1954-1957 - [c146]Ladan Golipour, Douglas D. O'Shaughnessy:
A segment-based non-parametric approach for monophone recognition. INTERSPEECH 2010: 2334-2337 - [c145]Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Novel weighting scheme for unsupervised language model adaptation using latent dirichlet allocation. INTERSPEECH 2010: 2438-2441 - [c144]Md Foezur Rahman Chowdhury, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Text-independent distributed speaker identification and verification using GMM-UBM speaker models for mobile communications. ISSPA 2010: 57-60
2000 – 2009
- 2009
- [j31]Sid-Ahmed Selouani, Mohammed Sidi Yakoub, Douglas D. O'Shaughnessy:
Alternative Speech Communication System for Persons with Severe Speech Disorders. EURASIP J. Adv. Signal Process. 2009 (2009) - [j30]Janet M. Baker, Li Deng, James R. Glass, Sanjeev Khudanpur, Chin-Hui Lee, Nelson Morgan, Douglas D. O'Shaughnessy:
Developments and directions in speech recognition and understanding, Part 1 [DSP Education]. IEEE Signal Process. Mag. 26(3): 75-80 (2009) - [j29]Janet M. Baker, Li Deng, Sanjeev Khudanpur, Chin-Hui Lee, James R. Glass, Nelson Morgan, Douglas D. O'Shaughnessy:
Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education]. IEEE Signal Process. Mag. 26(4): 78-85 (2009) - [c143]Negar Ghourchian, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Robust distributed speech recognition using two-stage Filtered Minima Controlled Recursive Averaging. ASRU 2009: 249-254 - [c142]Md. Jahangir Alam, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
An improved perceptual speech enhancement technique employing a psychoacoustically motivated weighting factor. ASRU 2009: 266-270 - [c141]Md Foezur Rahman Chowdhury, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Distributed automatic text-independent speaker identification using GMM-UBM speaker models. CCECE 2009: 372-375 - [c140]Mouloud Djamah, Douglas D. O'Shaughnessy:
Low-complexity encoding of speech lsf parameters using multistage tree-structured vector quantization: Application to the MELP coder. CCECE 2009: 376-380 - [c139]Md Foezur Rahman Chowdhury, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
A study on bias-based speech signal conditioning techniques for improving the robustness of automatic speech recognition. CCECE 2009: 664-669 - [c138]Yasmina Benabderrahmane, Abderraouf Ben Salem, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Blind speech separation using high order statistics. CCECE 2009: 670-673 - [c137]Iman Haji Abolhassani, Douglas D. O'Shaughnessy, Sid-Ahmed Selouani:
A method utilizing window function frequency characteristics for noise-robust spectral pitch estimation. EUSIPCO 2009: 2544-2548 - [c136]Negar Ghourchian, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Robust Speech Enhancement Using Two-Stage Filtered Minima Controlled Recursive Averaging. FGIT-SIP 2009: 72-81 - [c135]Yasmina Benabderrahmane, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy, Habib Hamam:
A Comparative Study of Blind Speech Separation Using Subspace Methods and Higher Order Statistics. FGIT-SIP 2009: 117-124 - [c134]Ladan Golipour, Douglas D. O'Shaughnessy:
Context-independent phoneme recognition using a K-Nearest Neighbour classification approach. ICASSP 2009: 1341-1344 - [c133]Iman Haji Abolhassani, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
STFT-based speech enhancement by reconstructing the harmonics. INTERSPEECH 2009: 1371-1374 - [c132]Mouloud Djamah, Douglas D. O'Shaughnessy:
Fine-granular scalable MELP coder based on embedded vector quantization. INTERSPEECH 2009: 2603-2606 - [c131]Lakshmish Kaushik, Douglas D. O'Shaughnessy:
A novel method for epoch extraction from speech signals. INTERSPEECH 2009: 2883-2886 - 2008
- [j28]Yousef Ajami Alotaibi, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Experiments on Automatic Recognition of Nonnative Arabic Speech. EURASIP J. Audio Speech Music. Process. 2008 (2008) - [j27]Sid-Ahmed Selouani, Tang-Ho Lê, Yacine Benahmed, Douglas D. O'Shaughnessy:
Speech-Enabled Tools for Augmented Interaction in E-Learning Applications. Int. J. Distance Educ. Technol. 6(2): 1-20 (2008) - [j26]Douglas D. O'Shaughnessy:
Invited paper: Automatic speech recognition: History, methods and challenges. Pattern Recognit. 41(10): 2965-2979 (2008) - [c130]Amin Haji Abolhassani, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Subspace-based speech enhancement by updating noise characteristics in the presence of speech. EUSIPCO 2008: 1-5 - [c129]Md. Jahangir Alam, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Speech enhancement based on a hybrid a priori signal-to-noise ratio (SNR) estimator and a self-adaptive Lagrange multiplier. EUSIPCO 2008: 1-5 - [c128]Raphael Steinberg, Douglas D. O'Shaughnessy:
Segmentation of a speech spectrogram using mathematical morphology. ICASSP 2008: 1637-1640 - [c127]Xiao-Bing Li, Douglas D. O'Shaughnessy:
Likelihood-based non-uniform allocation of Gaussian kernels in scalar dimension for HMM compression. ICME 2008: 753-756 - [c126]Md. Jahangir Alam, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy, Sofia Ben Jebara:
Speech enhancement using a wiener denoising technique and musical noise reduction. INTERSPEECH 2008: 407-410 - [c125]Md. Jahangir Alam, Douglas D. O'Shaughnessy, Sid-Ahmed Selouani:
Speech enhancement based on novel two-step a priori SNR estimators. INTERSPEECH 2008: 565-568 - [c124]Ladan Golipour, Douglas D. O'Shaughnessy:
An intuitive class discriminability measure for feature selection in a speech recognition system. INTERSPEECH 2008: 1345-1348 - [c123]Lakshmish Kaushik, Douglas D. O'Shaughnessy:
Voice activity detection using modified Wigner-ville distribution. INTERSPEECH 2008: 2574-2577 - [c122]Xufang Zhao, Douglas D. O'Shaughnessy:
Seed models combination and state level mappings of cross-lingual transfer for rapid HMM development: from English to Mandarin. INTERSPEECH 2008: 2699-2702 - 2007
- [j25]Rangarao Muralishankar, Abhijeet Sangwan, Douglas D. O'Shaughnessy:
Theoretical Complex Cepstrum of DCT and Warped DCT Filters. IEEE Signal Process. Lett. 14(5): 367-370 (2007) - [j24]Xuechuan Wang, Douglas D. O'Shaughnessy:
Environmental Independent ASR Model Adaptation/Compensation by Bayesian Parametric Representation. IEEE Trans. Speech Audio Process. 15(4): 1204-1217 (2007) - [j23]P. Vijayalakshmi, M. Ramasubba Reddy, Douglas D. O'Shaughnessy:
Acoustic Analysis and Detection of Hypernasality Using a Group Delay Function. IEEE Trans. Biomed. Eng. 54(4): 621-629 (2007) - [c121]Amin Haji Abolhassani, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Speech enhancement using PCA and variance of the reconstruction error in distributed speech recognition. ASRU 2007: 19-23 - [c120]Huiqun Deng, Douglas D. O'Shaughnessy, Jean-Guy Dahan, William F. Ganong III:
Interpolative variable frame rate transmission of speech features for distributed speech recognition. ASRU 2007: 591-595 - [c119]T. Nagarajan, Douglas D. O'Shaughnessy:
Bias Estimation and Correction in a Classifier using Product of Likelihood-Gaussians. ICASSP (3) 2007: 1061-1064 - [c118]R. Muralishankar, H. N. Shankar, Douglas D. O'Shaughnessy:
A Performance Analysis of Features from Complex Cepstra of Warped DST, DCT and DHT Filters for Phoneme Recognition. DSP 2007: 591-594 - [c117]Huiqun Deng, Douglas D. O'Shaughnessy:
Voiced-Unvoiced-Silence Speech Sound Classification Based on Unsupervised Learning. ICME 2007: 176-179 - [c116]Hao-Zheng Li, Douglas D. O'Shaughnessy:
Frame margin probability discriminative training algorithm for noisy speech recognition. INTERSPEECH 2007: 38-41 - [c115]Huiqun Deng, Douglas D. O'Shaughnessy:
Effect of incomplete glottal closures on estimates of glottal waves via inverse filtering of vowel sounds. INTERSPEECH 2007: 546-549 - [c114]Amin Haji Abolhassani, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy, Mohamed Faouzi Harkat:
Speech enhancement using PCA and variance of the reconstruction error model identification. INTERSPEECH 2007: 974-977 - [c113]Xiao-Bing Li, Douglas D. O'Shaughnessy:
Clustering-based two-dimensional linear discriminant analysis for speech recognition. INTERSPEECH 2007: 1126-1129 - [c112]Xufang Zhao, Douglas D. O'Shaughnessy:
An evaluation of cross-language adaptation and native speech training for rapid HMM construction based on very limited training data. INTERSPEECH 2007: 1433-1436 - [c111]Ladan Golipour, Douglas D. O'Shaughnessy:
A new approach for phoneme segmentation of speech signals. INTERSPEECH 2007: 1933-1936 - [c110]Sid-Ahmed Selouani, Habib Hamam, Douglas D. O'Shaughnessy:
A Hybrid Genetic-Neural Front-End Extension for Robust Speech Recognition over Telephone Lines. NOLISP 2007: 169-178 - 2006
- [c109]Huiqun Deng, Rabab K. Ward, Michael P. Beddoes, Douglas D. O'Shaughnessy:
Obtaining LIP and Glottal Reflection Coefficients from Vowel Sounds. ICASSP (1) 2006: 373-376 - [c108]Gang Chen, Hesham Tolba, Douglas D. O'Shaughnessy:
Noise-robust speech recognition of conversational telephone speech. INTERSPEECH 2006 - [c107]Hao-Zheng Li, Douglas D. O'Shaughnessy:
State-level variable modeling for phoneme classification. INTERSPEECH 2006 - [c106]T. Nagarajan, Douglas D. O'Shaughnessy:
Discriminative MLE training using a product of Gaussian likelihoods. INTERSPEECH 2006 - [c105]T. Nagarajan, P. Vijayalakshmi, Douglas D. O'Shaughnessy:
Combining multiple-sized sub-word units in a speech recognition system using baseform selection. INTERSPEECH 2006 - [c104]Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Speaker adaptation using evolutionary-based linear transform. INTERSPEECH 2006 - [c103]P. Vijayalakshmi, M. Ramasubba Reddy, Douglas D. O'Shaughnessy:
Assessment of articulatory sub-systems of dysarthric speech using an isolated-style phoneme recognition system. INTERSPEECH 2006 - 2005
- [c102]Rangarao Muralishankar, Abhijeet Sangwan, Douglas D. O'Shaughnessy:
Warped discrete cosine transform cepstrum: A new feature for speech processing. EUSIPCO 2005: 1-4 - [c101]Weizhong Zhu, Douglas D. O'Shaughnessy:
Log-Energy Dynamic Range Normalizaton for Robust Speech Recognition. ICASSP (1) 2005: 245-248 - [c100]R. Muralishankar, Douglas D. O'Shaughnessy:
Subspace-based Speaker-independent Vowel Recognition. ICASSP (1) 2005: 549-552 - [c99]R. Muralishankar, Abhijeet Sangwan, Douglas D. O'Shaughnessy:
Statistical properties of the warped discrete cosine transform cepstrum compared with MFCC. INTERSPEECH 2005: 341-344 - [c98]T. Nagarajan, Douglas D. O'Shaughnessy:
Explicit segmentation of speech based on frequency-domain AR modeling. INTERSPEECH 2005: 653-656 - [c97]Hesham Tolba, Zili Li, Douglas D. O'Shaughnessy:
Robust automatic speech recognition using a perceptually-based optimal spectral amplitude estimator speech enhancement algorithm in various low-SNR environments. INTERSPEECH 2005: 937-940 - [c96]Vincent Barreaud, Douglas D. O'Shaughnessy, Jean-Guy Dahan:
Experiments on speaker profile portability. INTERSPEECH 2005: 997-1000 - [c95]Xuechuan Wang, Douglas D. O'Shaughnessy:
Environmental compensation using ASR model adaptation by a Bayesian parametric representation method. INTERSPEECH 2005: 1801-1804 - [c94]Gang Chen, Douglas D. O'Shaughnessy, Hesham Tolba:
A performance investigation of noisy voice recognition over IP telephony networks. INTERSPEECH 2005: 2681-2684 - [c93]Mohamed Mihoubi, Douglas D. O'Shaughnessy, Pierre Dumouchel:
Relevant information extraction for discriminative training applied to speaker identification. INTERSPEECH 2005: 3097-3100 - 2004
- [j22]Douglas D. O'Shaughnessy:
ICASSP 2004 in Montreal. IEEE Signal Process. Mag. 21(2): 120-122 (2004) - [c92]Amr H. Nour-Eldin, Hesham Tolba, Douglas D. O'Shaughnessy:
Loss recovery through spectral interpolation for robust speech recognition over packet voice communications. EUSIPCO 2004: 1465-1468 - [c91]Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Robustness of speech recognition using genetic algorithms and a Mel-cepstral subspace approach. ICASSP (1) 2004: 201-204 - [c90]Amr H. Nour-Eldin, Hesham Tolba, Douglas D. O'Shaughnessy:
Automatic recognition of Bluetooth speech in 802.11 interference and the effectiveness of insertion-based compensation techniques. ICASSP (1) 2004: 1033-1036 - [c89]Xuechuan Wang, Douglas D. O'Shaughnessy:
Noise adaptation for robust AURORA 2 noisy digit recognition using statistical data mapping. INTERSPEECH 2004: 125-128 - [c88]Zili Li, Hesham Tolba, Douglas D. O'Shaughnessy:
Robust automatic speech recognition using an optimal spectral amplitude estimator algorithm in low-SNR car environments. INTERSPEECH 2004: 2041-2044 - [c87]Mohamed Mihoubi, Douglas D. O'Shaughnessy, Pierre Dumouchel:
The use of typical sequences for robust speaker identification. INTERSPEECH 2004: 2349-2352 - [c86]Xuechuan Wang, Douglas D. O'Shaughnessy:
Robust ASR model adaptation by feature-based statistical data mapping. INTERSPEECH 2004: 2905-2908 - 2003
- [j21]Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions. EURASIP J. Adv. Signal Process. 2003(8): 814-823 (2003) - [j20]Douglas D. O'Shaughnessy:
Interacting with computers by voice: automatic speech recognition and synthesis. Proc. IEEE 91(9): 1272-1305 (2003) - [c85]Xuechuan Wang, Douglas D. O'Shaughnessy:
Improving the efficiency of automatic speech recognition by feature transformation and dimensionality reduction. INTERSPEECH 2003: 1025-1028 - [c84]Hesham Tolba, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Comparative experiments to evaluate the use of auditory-based acoustic distinctive features and formant cues for robust automatic speech recognition in low-SNR car environments. INTERSPEECH 2003: 3085-3088 - [c83]Sid-Ahmed Selouani, Hesham Tolba, Douglas D. O'Shaughnessy:
Auditory-based Acoustic Distinctive Features and Spectral Cues for Robust Automatic Speech Recognition in Low-SNR Car Environments. HLT-NAACL 2003 - 2002
- [c82]Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
A hybrid HMM/autoregressive Time-Delay Neural Network Automatic Speech Recognition system. EUSIPCO 2002: 1-4 - [c81]Hesham Tolba, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Auditory-based acoustic distinctive features and spectral cues for automatic speech recognition using a multi-stream paradigm. ICASSP 2002: 837-840 - [c80]Omar Halmi, Hesham Tolba, Driss Guerchi, Douglas D. O'Shaughnessy:
On improving the performance of analysis-by-synthesis coding using a multi-magnitude algebraic code-book excitation signal. INTERSPEECH 2002: 1857-1860 - [c79]Hesham Tolba, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Comparative experiments to evaluate the use of auditory-based acoustic distinctive features and formant cues for automatic speech recognition using a multi-stream paradigm. INTERSPEECH 2002: 2113-2116 - [c78]Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Noise-robust speech recognition in car environments using genetic algorithms and a mel-cepstral subspace approach. INTERSPEECH 2002: 2173-2176 - 2001
- [c77]Sid-Ahmed Selouani, Hesham Tolba, Douglas D. O'Shaughnessy:
Robust automatic speech recognition in low-SNR car environments by the application of a connectionist subspace-based approach to the melbased cepstral coefficients. INTERSPEECH 2001: 1577-1580 - [c76]Hassan Ezzaidi, Jean Rouat, Douglas D. O'Shaughnessy:
Towards combining pitch and MFCC for speaker recognition systems. INTERSPEECH 2001: 2825-2828 - [c75]Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Hybrid architectures for complex phonetic features classification: a unified approach. ISSPA 2001: 719-722 - [c74]Hassan Ezzaidi, Jean Rouat, Douglas D. O'Shaughnessy:
Combining pitch and MFCC for speaker identification systems. Odyssey 2001: 207-212 - 2000
- [b2]Douglas D. O'Shaughnessy:
Speech communications - human and machine, 2nd Edition. IEEE 2000, ISBN 978-0-7803-3449-6, pp. I-XXV, 1-547 - [c73]Marcel Gabrea, Douglas D. O'Shaughnessy:
Speech signal recovery in white noise using an adaptive Kalman filter. EUSIPCO 2000: 1-4 - [c72]Hesham Tolba, Douglas D. O'Shaughnessy:
Towards a large-vocabulary French vocal dictation based on a size-independent language-model search using the INRS recognizer. ICASSP 2000: 1651-1654 - [c71]Xiaohu Liu, Douglas D. O'Shaughnessy:
Practical language modeling: an interpolating method. INTERSPEECH 2000: 354-357 - [c70]Douglas D. O'Shaughnessy, Marcel Gabrea:
Recognition of digit strings in noisy speech with limited resources. INTERSPEECH 2000: 554-557 - [c69]Qingsheng Zeng, Douglas D. O'Shaughnessy:
Microphone array within a handset or face mask for speech enhancement. INTERSPEECH 2000: 602-605 - [c68]Marcel Gabrea, Douglas D. O'Shaughnessy:
Detection of filled pauses in spontaneous conversational speech. INTERSPEECH 2000: 678-681 - [c67]Rachida El Méliani, Douglas D. O'Shaughnessy:
Speech recognition using error spotting. INTERSPEECH 2000: 1057-1060
1990 – 1999
- 1999
- [j19]Rivarol Vergin, Douglas D. O'Shaughnessy, Azarshid Farhat:
Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition. IEEE Trans. Speech Audio Process. 7(5): 525-532 (1999) - [c66]Rivarol Vergin, Douglas D. O'Shaughnessy:
On the use of some divergence measures in speaker recognition. ICASSP 1999: 309-312 - [c65]Douglas D. O'Shaughnessy, Hesham Tolba:
Towards a robust/fast continuous speech recognition system using a voiced-unvoiced decision. ICASSP 1999: 413-416 - [c64]Rachida El Méliani, Douglas D. O'Shaughnessy:
Error spotting using syllabic fillers in spontaneous conversational speech recognition. EUROSPEECH 1999: 279-282 - [c63]Hesham Tolba, Douglas D. O'Shaughnessy:
Towards recognizing "non-lexical" words in spontaneous conversational speech. EUROSPEECH 1999: 723-726 - [c62]Rivarol Vergin, Douglas D. O'Shaughnessy, Pierre Dumouchel:
Toward parametric representation of speech for speaker recognition systems. EUROSPEECH 1999: 795-798 - [c61]Hesham Tolba, Douglas D. O'Shaughnessy:
Combating nonlinear telephone channel-noise using the multiband AM-FM model. NSIP 1999: 168-171 - 1998
- [j18]François Yvon, Philippe Boula de Mareüil, Christophe d'Alessandro, Véronique Aubergé, Michel Bagein, Gérard Bailly, Frédéric Béchet, S. Foukia, J.-F. Goldman, Eric Keller, Douglas D. O'Shaughnessy, Vincent Pagel, Fred Sannier, Jean Véronis, Brigitte Zellner:
Objective evaluation of grapheme to phoneme conversion for text-to-speech synthesis in French. Comput. Speech Lang. 12(4): 393-410 (1998) - [c60]Rachida El Méliani, Douglas D. O'Shaughnessy:
Specific language modelling for new-word detection in continuous-speech recognition. ICASSP 1998: 321-324 - [c59]Hesham Tolba, Douglas D. O'Shaughnessy:
Automatic speech recognition based on cepstral coefficients and a mel-based discrete energy operator. ICASSP 1998: 973-976 - [c58]Michel Héon, Hesham Tolba, Douglas D. O'Shaughnessy:
Robust automatic speech recognition by the application of a temporal-correlation-based recurrent multilayer neural network to the mel-based cepstral coefficients. ICSLP 1998 - [c57]Clark Z. Lee, Douglas D. O'Shaughnessy:
A new method to achieve fast acoustic matching for speech recognition. ICSLP 1998 - [c56]Rachida El Méliani, Douglas D. O'Shaughnessy:
Powerful syllabic fillers for general-task keyword-spotting and unlimited-vocabulary continuous-speech recognition. ICSLP 1998 - [c55]Hesham Tolba, Douglas D. O'Shaughnessy:
Comparative experiments to evaluate a voiced-unvoiced-based pre-processing approach to robust automatic speech recognition in low-SNR environments. ICSLP 1998 - [c54]Hesham Tolba, Douglas D. O'Shaughnessy:
On the application of the AM-FM model for the recovery of missing frequency bands of telephone speech. ICSLP 1998 - [c53]Hesham Tolba, Douglas D. O'Shaughnessy:
Robust automatic continuous-speech recognition based on a voiced-unvoiced decision. ICSLP 1998 - [c52]Philippe Boula de Mareüil, François Yvon, Christophe d'Alessandro, V. Auberg, Michel Bagein, Gérard Bailly, Frédéric Béchet, S. Fonkia, Jean-Philippe Goldman, Eric Keller, Douglas D. O'Shaughnessy, Steve Pagel, F. Sannier, Jean Véronis, Brigitte Zellner Keller:
Evaluation of grapheme-to phoneme conversion for text-to-speech synthesis in French. LREC 1998: 641-646 - 1997
- [c51]Rachida El Méliani, Douglas D. O'Shaughnessy:
Accurate keyword spotting using strictly lexical fillers. ICASSP 1997: 907-910 - [c50]Rivarol Vergin, Douglas D. O'Shaughnessy, Azarshid Farhat:
Time domain technique for pitch modification and robust voice transformation. ICASSP 1997: 947-950 - [c49]Clark Z. Lee, Douglas D. O'Shaughnessy:
Clustering beyond phoneme contexts for speech recognition. EUROSPEECH 1997: 19-22 - [c48]Wei-Ying Li, Douglas D. O'Shaughnessy:
Hybrid networks based on RBFN and GMM for speaker recognition. EUROSPEECH 1997: 955-958 - [c47]Rivarol Vergin, Douglas D. O'Shaughnessy:
A double Gaussian mixture modeling approach to speaker recognition. EUROSPEECH 1997: 2287-2290 - [c46]Hesham Tolba, Douglas D. O'Shaughnessy:
Speech enhancement via energy separation. EUROSPEECH 1997: 2583-2586 - 1996
- [c45]Zhishun Li, Douglas D. O'Shaughnessy:
Using a transcription graph for large vocabulary continuous speech recognition. ICASSP 1996: 121-124 - [c44]Azarshid Farhat, Jean-Francois Isabelle, Douglas D. O'Shaughnessy:
Clustering words for statistical language models based on contextual word similarity. ICASSP 1996: 180-183 - [c43]Rivarol Vergin, Douglas D. O'Shaughnessy, Vishwa Gupta:
Compensated mel frequency cepstrum coefficients. ICASSP 1996: 323-326 - [c42]Zhishun Li, Michel Héon, Douglas D. O'Shaughnessy:
New developments in the INRS continuous speech recognition system. ICSLP 1996: 2-5 - [c41]Rachida El Méliani, Douglas D. O'Shaughnessy:
New efficient fillers for unlimited word recognition and keyword spotting. ICSLP 1996: 590-593 - [c40]Rivarol Vergin, Azarshid Farhat, Douglas D. O'Shaughnessy:
Robust gender-dependent acoustic-phonetic modelling in continuous speech recognition based on a new automatic male/female classification. ICSLP 1996: 1081-1084 - 1995
- [c39]Zhishun Li, Patrick Kenny, Douglas D. O'Shaughnessy:
Searching with a transcription graph. ICASSP 1995: 564-567 - [c38]Douglas D. O'Shaughnessy:
Timing patterns in fluent and disfluent spontaneous speech. ICASSP 1995: 600-603 - [c37]Zhishun Li, Patrick Kenny, Douglas D. O'Shaughnessy:
Hybrid hidden Markov models in speech recognition. EUROSPEECH 1995: 795-798 - [c36]Pierre Dumouchel, Douglas D. O'Shaughnessy:
Segmental duration and HMM modeling. EUROSPEECH 1995: 803-806 - [c35]Azarshid Farhat, Douglas D. O'Shaughnessy:
A shared-distribution approach in a hidden Markov model-based continuous speech recognition system. EUROSPEECH 1995: 1503-1506 - [c34]Rachida El Méliani, Douglas D. O'Shaughnessy:
Lexical fillers for task-independent-training based keyword spotting and detection of new words. EUROSPEECH 1995: 2129-2132 - 1994
- [j17]Patrick Kenny, Gilles Boulianne, Harinath Garudadri, S. Trudelle, Rene Hollan, Matthew Lennig, Douglas D. O'Shaughnessy:
Experiments in continuous speech recognition using books on tape. Speech Commun. 14(1): 49-60 (1994) - [j16]Gilles Boulianne, Patrick Kenny, Matthew Lennig, Douglas D. O'Shaughnessy, Paul Mermelstein:
Books on tape as training data for continuous speech recognition. Speech Commun. 14(1): 61-70 (1994) - [j15]Changxue Ma, Douglas D. O'Shaughnessy:
The masking of narrowband noise by broadband harmonic complex sounds and implications for the processing of speech sounds. Speech Commun. 14(2): 103-118 (1994) - [j14]Yan Ming Cheng, Douglas D. O'Shaughnessy, Paul Mermelstein:
Statistical recovery of wideband speech from narrowband speech. IEEE Trans. Speech Audio Process. 2(4): 544-548 (1994) - [c33]Douglas D. O'Shaughnessy:
Correcting complex false starts in spontaneous speech. ICASSP (1) 1994: 349-352 - [c32]Patrick Kenny, Paul Labute, Zhishun Li, Douglas D. O'Shaughnessy:
New graph search techniques for speech recognition. ICASSP (1) 1994: 553-556 - 1993
- [j13]Fang-Ming Wang, Peter Kabal, Ravi Prakash Ramachandran, Douglas D. O'Shaughnessy:
Frequency domain adaptive postfiltering for enhancement of noisy speech. Speech Commun. 12(1): 41-56 (1993) - [j12]Patrick Kenny, Rene Hollan, Vishwa Gupta, Matthew Lennig, Paul Mermelstein, Douglas D. O'Shaughnessy:
A*-admissible heuristics for rapid lexical access. IEEE Trans. Speech Audio Process. 1(1): 49-58 (1993) - [j11]Yan Ming Cheng, Douglas D. O'Shaughnessy:
On 450-600 b/s natural sounding speech coding. IEEE Trans. Speech Audio Process. 1(2): 207-220 (1993) - [c31]Patrick Kenny, Paul Labute, Zhishun Li, Rene Hollan, Matthew Lennig, Douglas D. O'Shaughnessy:
A new fast match for very large vocabulary continuous speech recognition. ICASSP (2) 1993: 656-659 - [c30]Douglas D. O'Shaughnessy:
Analysis and automatic recognition of false starts in spontaneous speech. ICASSP (2) 1993: 724-727 - [c29]Changxue Ma, Douglas D. O'Shaughnessy:
A psychophysical study of fourier phase and amplitude coding of speech. EUROSPEECH 1993: 753-756 - [c28]R. Zhao, Patrick Kenny, Paul Labute, Douglas D. O'Shaughnessy:
Issues in large scale statistical language modeling. EUROSPEECH 1993: 965-968 - [c27]Changwen Yang, Douglas D. O'Shaughnessy:
The inks ATIS system and its n-best interface. EUROSPEECH 1993: 2051-2054 - [c26]Patrick Kenny, Paul Labute, Zhishun Li, Rene Hollan, Matthew Lennig, Douglas D. O'Shaughnessy:
A very fast method for scoring phonetic transcriptions. EUROSPEECH 1993: 2117-2120 - [c25]Douglas D. O'Shaughnessy:
Locating disfluencies in spontaneous speech: an acoustical analysis. EUROSPEECH 1993: 2187-2190 - [c24]Pierre Dumouchel, Douglas D. O'Shaughnessy:
Prosody and continuous speech recognition. EUROSPEECH 1993: 2195-2198 - [c23]Changwen Yang, Douglas D. O'Shaughnessy:
Development of the INRS ATIS system. IUI 1993: 133-140 - 1992
- [c22]Douglas D. O'Shaughnessy:
Recognition of hesitations in spontaneous speech. ICASSP 1992: 521-524 - [c21]Yan Ming Cheng, Douglas D. O'Shaughnessy, Vishwa Gupta, Patrick Kenny, Matthew Lennig, Paul Mermelstein, Sarangarajan Parthasarathy:
Hybrid segmental-LVQ/HMM for large vocabulary speech recognition. ICASSP 1992: 593-596 - [c20]Patrick Kenny, Rene Hollan, Gilles Boulianne, Harinath Garudadri, Yan Ming Cheng, Matthew Lennig, Douglas D. O'Shaughnessy:
Experiments in continuous speech recognition with a 60, 000 word vocabulary. ICSLP 1992: 225-228 - [c19]Gilles Boulianne, Patrick Kenny, Matthew Lennig, Douglas D. O'Shaughnessy, Paul Mermelstein:
HMM training on unconstrained speech for large vocabulary, continuous speech recognition. ICSLP 1992: 229-232 - [c18]Yan Ming Cheng, Douglas D. O'Shaughnessy, Peter Kabal:
Speech enhancement using a statistically derived filter mapping. ICSLP 1992: 515-518 - [c17]Douglas D. O'Shaughnessy:
Analysis of false starts in spontaneous speech. ICSLP 1992: 931-934 - [c16]Yan Ming Cheng, Douglas D. O'Shaughnessy, Paul Mermelstein:
Statistical recovery of wideband speech from narrowband speech. ICSLP 1992: 1577-1580 - [c15]Patrick Kenny, Rene Hollan, Gilles Boulianne, Harinath Garudadri, Matthew Lennig, Douglas D. O'Shaughnessy:
An A* algorithm for very large vocabulary continuous speech recognition. HLT 1992 - 1991
- [j10]Yan Ming Cheng, Douglas D. O'Shaughnessy:
Short-term temporal decomposition and its properties for speech compression. IEEE Trans. Signal Process. 39(6): 1282-1290 (1991) - [j9]Yan Ming Cheng, Douglas D. O'Shaughnessy:
Speech enhancement based conceptually on auditory evidence. IEEE Trans. Signal Process. 39(9): 1943-1954 (1991) - [c14]Vishwa Gupta, Matthew Lennig, Paul Mermelstein, Patrick Kenny, Franz Seitz, Douglas D. O'Shaughnessy:
Using phoneme duration and energy contour information to improve large vocabulary isolated-word recognition. ICASSP 1991: 341-344 - [c13]Patrick Kenny, Rene Hollan, Vishwa Gupta, Matthew Lennig, Paul Mermelstein, Douglas D. O'Shaughnessy:
A*-admissible heuristics for rapid lexical access. ICASSP 1991: 689-692 - [c12]Yan Ming Cheng, Douglas D. O'Shaughnessy:
Speech enhancement based conceptually on auditory evidence. ICASSP 1991: 961-964 - [c11]Patrick Kenny, Sarangarajan Parthasarathy, Vishwa Gupta, Matthew Lennig, Paul Mermelstein, Douglas D. O'Shaughnessy:
Energy, duration and Markov models. EUROSPEECH 1991: 655-658 - [c10]Douglas D. O'Shaughnessy:
A Textual processor to handle ATIS queries. HLT 1991 - 1990
- [c9]Yan Ming Cheng, Douglas D. O'Shaughnessy:
A 450 b.p.s. vocoder with natural-sounding speech. ICASSP 1990: 649-652 - [c8]Matthew Lennig, Vishwa Gupta, Patrick Kenny, Paul Mermelstein, Douglas D. O'Shaughnessy:
An 86, 000-Word Recognizer Based on Phonemic Models. HLT 1990 - [c7]Douglas D. O'Shaughnessy:
Spectral transitions in rule-based and diphone synthesis. SSW 1990: 21-24
1980 – 1989
- 1989
- [j8]Douglas D. O'Shaughnessy:
Enhancing speech degraded by additive noise or interfering speakers. IEEE Commun. Mag. 27(2): 46-52 (1989) - [j7]Douglas D. O'Shaughnessy:
Parsing with a Small Dictionary for Applications such as Text to Speech. Comput. Linguistics 15(2): 97-108 (1989) - [j6]Douglas D. O'Shaughnessy:
Specifying Accent Marks in French Text for Teletext and Speech Synthesis. Int. J. Man Mach. Stud. 31(4): 405-414 (1989) - [j5]Bernard Kiriakos, Douglas D. O'Shaughnessy:
Lexical stress detection in isolated English words. Speech Commun. 8(2): 113-124 (1989) - [j4]Yan Ming Cheng, Douglas D. O'Shaughnessy:
Automatic and reliable estimation of glottal closure instant and period. IEEE Trans. Acoust. Speech Signal Process. 37(12): 1805-1815 (1989) - [c6]Yan Ming Cheng, Douglas D. O'Shaughnessy:
Parameter sensitivity and robust estimation in an ARX model with glottal excitation. ICASSP 1989: 564-567 - [c5]Douglas D. O'Shaughnessy:
Using syntactic information to improve large-vocabulary word recognition. ICASSP 1989: 715-718 - 1988
- [j3]Douglas D. O'Shaughnessy, Louis Barbeau, David Bernardi, Danièle Archambault:
Diphone speech synthesis. Speech Commun. 7(1): 55-65 (1988) - [c4]Douglas D. O'Shaughnessy:
Speech enhancement using vector quantization and a formant distance measure. ICASSP 1988: 549-552 - 1987
- [c3]Douglas D. O'Shaughnessy:
Specifying intonation in a text-to-speech system using only a small dictionary. ICASSP 1987: 1430-1433 - 1986
- [c2]Douglas D. O'Shaughnessy:
The effects of speaking rate on formant transitions in French synthesis-by-rule. ICASSP 1986: 2027-2030 - 1984
- [j2]Douglas D. O'Shaughnessy:
Design of a real-time French text-to-speech system. Speech Commun. 3(3): 233-243 (1984) - 1983
- [j1]Douglas D. O'Shaughnessy:
Automatic speech synthesis. IEEE Commun. Mag. 21(9): 26-34 (1983)
1970 – 1979
- 1976
- [b1]Douglas D. O'Shaughnessy:
Modelling fundamental frequency, and its relationship to syntax, semantics, and phonetics. Massachusetts Institute of Technology, Cambridge, MA, USA, 1976 - [c1]Jonathan Allen, Douglas D. O'Shaughnessy:
A comprehensive model for fundamental frequency generation. ICASSP 1976: 701-704
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:14 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint