default search action
Philip C. Woodland
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j44]Guangzhi Sun, Chao Zhang, Ivan Vulic, Pawel Budzianowski, Philip C. Woodland:
Knowledge-aware audio-grounded generative slot filling for limited annotated data. Comput. Speech Lang. 89: 101707 (2025) - 2024
- [j43]Keqi Deng, Philip C. Woodland:
Decoupled structure for improved adaptability of end-to-end models. Speech Commun. 163: 103109 (2024) - [j42]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Graph Neural Networks for Contextual ASR With the Tree-Constrained Pointer Generator. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2407-2417 (2024) - [j41]Keqi Deng, Philip C. Woodland:
Label-Synchronous Neural Transducer for Adaptable Online E2E Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3507-3516 (2024) - [c192]Wen Wu, Wenlin Chen, Chao Zhang, Philip C. Woodland:
Modelling Variability in Human Annotator Simulation. ACL (Findings) 2024: 1139-1157 - [c191]Wen Wu, Bo Li, Chao Zhang, Chung-Cheng Chiu, Qiujia Li, Junwen Bai, Tara N. Sainath, Philip C. Woodland:
Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation. ACL (1) 2024: 2078-2093 - [c190]Guangzhi Sun, Shutong Feng, Dongcheng Jiang, Chao Zhang, Milica Gasic, Philip C. Woodland:
Speech-based Slot Filling using Large Language Models. ACL (Findings) 2024: 6351-6362 - [c189]Keqi Deng, Philip C. Woodland:
Label-Synchronous Neural Transducer for E2E Simultaneous Speech Translation. ACL (1) 2024: 8235-8251 - [c188]Nineli Lashkarashvili, Wen Wu, Guangzhi Sun, Philip C. Woodland:
Parameter Efficient Finetuning for Speech Emotion Recognition and Domain Adaptation. ICASSP 2024: 10986-10990 - [c187]Keqi Deng, Philip C. Woodland:
FastInject: Injecting Unpaired Text Data into CTC-Based ASR Training. ICASSP 2024: 11836-11840 - [c186]Mingjie Chen, Hezhao Zhang, Yuanchao Li, Jiachen Luo, Wen Wu, Ziyang Ma, Peter Bell, Catherine Lai, Joshua D. Reiss, Lin Wang, Philip C. Woodland, Xie Chen, Huy Phan, Thomas Hain:
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem. Odyssey 2024: 260-265 - [i55]Nineli Lashkarashvili, Wen Wu, Guangzhi Sun, Philip C. Woodland:
Parameter Efficient Finetuning for Speech Emotion Recognition and Domain Adaptation. CoRR abs/2402.11747 (2024) - [i54]Wen Wu, Bo Li, Chao Zhang, Chung-Cheng Chiu, Qiujia Li, Junwen Bai, Tara N. Sainath, Philip C. Woodland:
Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation. CoRR abs/2402.12862 (2024) - [i53]Guangzhi Sun, Potsawee Manakul, Adian Liusie, Kunat Pipatanakul, Chao Zhang, Philip C. Woodland, Mark J. F. Gales:
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models. CoRR abs/2405.13684 (2024) - [i52]Mingjie Chen, Hezhao Zhang, Yuanchao Li, Jiachen Luo, Wen Wu, Ziyang Ma, Peter Bell, Catherine Lai, Joshua D. Reiss, Lin Wang, Philip C. Woodland, Xie Chen, Huy Phan, Thomas Hain:
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem. CoRR abs/2405.20064 (2024) - [i51]Keqi Deng, Guangzhi Sun, Philip C. Woodland:
Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning. CoRR abs/2406.00522 (2024) - [i50]Keqi Deng, Philip C. Woodland:
Label-Synchronous Neural Transducer for E2E Simultaneous Speech Translation. CoRR abs/2406.04541 (2024) - [i49]Xiaodong Wu, Wenyi Yu, Chao Zhang, Philip C. Woodland:
An Improved Empirical Fisher Approximation for Natural Gradient Descent. CoRR abs/2406.06420 (2024) - [i48]Wen Wu, Chao Zhang, Philip C. Woodland:
Confidence Estimation for Automatic Detection of Depression and Alzheimer's Disease Based on Clinical Interviews. CoRR abs/2407.19984 (2024) - [i47]Xiaoyu Yang, Qiujia Li, Chao Zhang, Philip C. Woodland:
MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events. CoRR abs/2409.17010 (2024) - [i46]Guangzhi Sun, Anmol Kagrecha, Potsawee Manakul, Philip C. Woodland, Mark J. F. Gales:
SkillAggregation: Reference-free LLM-Dependent Aggregation. CoRR abs/2410.10215 (2024) - [i45]Keqi Deng, Jinxi Guo, Yingyi Ma, Niko Moritz, Philip C. Woodland, Ozlem Kalinli, Mike Seltzer:
Transducer-Llama: Integrating LLMs into Streamable Transducer-based Speech Recognition. CoRR abs/2412.16464 (2024) - 2023
- [j40]Qiujia Li, Chao Zhang, Philip C. Woodland:
Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring. Speech Commun. 147: 12-21 (2023) - [j39]Wen Wu, Chao Zhang, Xixin Wu, Philip C. Woodland:
Estimating the Uncertainty in Emotion Class Labels With Utterance-Specific Dirichlet Priors. IEEE Trans. Affect. Comput. 14(4): 2810-2822 (2023) - [j38]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Minimising Biasing Word Errors for Contextual ASR With the Tree-Constrained Pointer Generator. IEEE ACM Trans. Audio Speech Lang. Process. 31: 345-354 (2023) - [c185]Wen Wu, Chao Zhang, Philip C. Woodland:
Estimating the Uncertainty in Emotion Attributes using Deep Evidential Regression. ACL (1) 2023: 15681-15695 - [c184]Keqi Deng, Philip C. Woodland:
Adaptable End-to-End ASR Models Using Replaceable Internal LMs and Residual Softmax. ICASSP 2023: 1-5 - [c183]Evonne P. C. Lee, Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Spectral Clustering-Aware Learning of Embeddings for Speaker Diarisation. ICASSP 2023: 1-5 - [c182]Yuang Li, Xianrui Zheng, Philip C. Woodland:
Self-Supervised Learning-Based Source Separation for Meeting Data. ICASSP 2023: 1-5 - [c181]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
End-to-End Spoken Language Understanding with Tree-Constrained Pointer Generator. ICASSP 2023: 1-5 - [c180]Wen Wu, Chao Zhang, Philip C. Woodland:
Self-Supervised Representations in Speech-Based Depression Detection. ICASSP 2023: 1-5 - [c179]Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Can Contextual Biasing Remain Effective with Whisper and GPT-2? INTERSPEECH 2023: 1289-1293 - [c178]Dongcheng Jiang, Chao Zhang, Philip C. Woodland:
A Neural Time Alignment Module for End-to-End Automatic Speech Recognition. INTERSPEECH 2023: 1374-1378 - [c177]Wen Wu, Chao Zhang, Philip C. Woodland:
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations. INTERSPEECH 2023: 3607-3611 - [c176]Florian L. Kreyssig, Yangyang Shi, Jinxi Guo, Leda Sari, Abdel-rahman Mohamed, Philip C. Woodland:
Biased Self-supervised Learning for ASR. INTERSPEECH 2023: 4948-4952 - [i44]Keqi Deng, Philip C. Woodland:
Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax. CoRR abs/2302.08579 (2023) - [i43]Xiaoyu Yang, Qiujia Li, Chao Zhang, Philip C. Woodland:
Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition. CoRR abs/2303.10917 (2023) - [i42]Wen Wu, Chao Zhang, Philip C. Woodland:
Self-supervised representations in speech-based depression detection. CoRR abs/2305.12263 (2023) - [i41]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator. CoRR abs/2305.18824 (2023) - [i40]Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Can Contextual Biasing Remain Effective with Whisper and GPT-2? CoRR abs/2306.01942 (2023) - [i39]Wen Wu, Chao Zhang, Philip C. Woodland:
Estimating the Uncertainty in Emotion Attributes using Deep Evidential Regression. CoRR abs/2306.06760 (2023) - [i38]Guangzhi Sun, Chao Zhang, Ivan Vulic, Pawel Budzianowski, Philip C. Woodland:
Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data. CoRR abs/2307.01764 (2023) - [i37]Wen Wu, Chao Zhang, Philip C. Woodland:
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations. CoRR abs/2308.07145 (2023) - [i36]Keqi Deng, Philip C. Woodland:
Decoupled Structure for Improved Adaptability of End-to-End Models. CoRR abs/2308.13345 (2023) - [i35]Wen Wu, Wenlin Chen, Chao Zhang, Philip C. Woodland:
It HAS to be Subjective: Human Annotator Simulation via Zero-shot Density Estimation. CoRR abs/2310.00486 (2023) - [i34]Theodor Nguyen, Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Conditional Diffusion Model for Target Speaker Extraction. CoRR abs/2310.04791 (2023) - [i33]Guangzhi Sun, Shutong Feng, Dongcheng Jiang, Chao Zhang, Milica Gasic, Philip C. Woodland:
Speech-based Slot Filling using Large Language Models. CoRR abs/2311.07418 (2023) - [i32]Keqi Deng, Philip C. Woodland:
FastInject: Injecting Unpaired Text Data into CTC-based ASR training. CoRR abs/2312.09100 (2023) - 2022
- [j37]Cai Wingfield, Chao Zhang, Barry Devereux, Elisabeth Fonteneau, Andrew Thwaites, Xunying Liu, Philip C. Woodland, William D. Marslen-Wilson, Li Su:
On the similarities of representations in artificial and brain neural networks for speech recognition. Frontiers Comput. Neurosci. 16 (2022) - [c175]Qiujia Li, Yu Zhang, David Qiu, Yanzhang He, Liangliang Cao, Philip C. Woodland:
Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition. ICASSP 2022: 6537-6541 - [c174]Xiaoyu Yang, Qiujia Li, Philip C. Woodland:
Knowledge Distillation for Neural Transducers from Large Self-Supervised Pre-Trained Models. ICASSP 2022: 8527-8531 - [c173]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition. INTERSPEECH 2022: 2043-2047 - [c172]Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription. INTERSPEECH 2022: 3844-3848 - [c171]Wen Wu, Chao Zhang, Philip C. Woodland:
Distribution-Based Emotion Recognition in Conversation. SLT 2022: 860-867 - [i31]Wen Wu, Chao Zhang, Xixin Wu, Philip C. Woodland:
Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors. CoRR abs/2203.04443 (2022) - [i30]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator. CoRR abs/2205.09058 (2022) - [i29]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition. CoRR abs/2207.00857 (2022) - [i28]Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription. CoRR abs/2207.03852 (2022) - [i27]Evonne P. C. Lee, Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Spectral Clustering-aware Learning of Embeddings for Speaker Diarisation. CoRR abs/2210.13576 (2022) - [i26]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator. CoRR abs/2210.16554 (2022) - [i25]Florian L. Kreyssig, Yangyang Shi, Jinxi Guo, Leda Sari, Abdelrahman Mohamed, Philip C. Woodland:
Biased Self-supervised learning for ASR. CoRR abs/2211.02536 (2022) - [i24]Wen Wu, Chao Zhang, Philip C. Woodland:
Distribution-based Emotion Recognition in Conversation. CoRR abs/2211.04834 (2022) - 2021
- [j36]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Combination of deep speaker embeddings for diarisation. Neural Networks 141: 372-384 (2021) - [j35]Adnan Haider, Chao Zhang, Florian L. Kreyssig, Philip C. Woodland:
A distributed optimisation framework combining natural gradient with Hessian-free for discriminative sequence training. Neural Networks 143: 537-549 (2021) - [c170]Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition. ASRU 2021: 162-168 - [c169]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Tree-Constrained Pointer Generator for End-to-End Contextual Speech Recognition. ASRU 2021: 780-787 - [c168]Wen Wu, Chao Zhang, Philip C. Woodland:
Emotion Recognition by Fusing Time Synchronous and Time Asynchronous Representations. ICASSP 2021: 6269-6273 - [c167]Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Philip C. Woodland, Liangliang Cao, Trevor Strohman:
Confidence Estimation for Attention-Based Sequence-to-Sequence Models for Speech Recognition. ICASSP 2021: 6388-6392 - [c166]Guangzhi Sun, D. Liu, Chao Zhang, Philip C. Woodland:
Content-Aware Speaker Embeddings for Speaker Diarisation. ICASSP 2021: 7168-7172 - [c165]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Transformer Language Models with LSTM-Based Cross-Utterance Information Representation. ICASSP 2021: 7363-7367 - [c164]Dongcheng Jiang, Chao Zhang, Philip C. Woodland:
Variable Frame Rate Acoustic Models Using Minimum Error Reinforcement Learning. Interspeech 2021: 2601-2605 - [c163]Qiujia Li, Yu Zhang, Bo Li, Liangliang Cao, Philip C. Woodland:
Residual Energy-Based Models for End-to-End Speech Recognition. Interspeech 2021: 4069-4073 - [c162]Qiujia Li, Florian L. Kreyssig, Chao Zhang, Philip C. Woodland:
Discriminative Neural Clustering for Speaker Diarisation. SLT 2021: 574-581 - [i23]Guangzhi Sun, D. Liu, Chao Zhang, Philip C. Woodland:
Content-Aware Speaker Embeddings for Speaker Diarisation. CoRR abs/2102.06467 (2021) - [i22]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Transformer Language Models with LSTM-based Cross-utterance Information Representation. CoRR abs/2102.06474 (2021) - [i21]Adnan Haider, Chao Zhang, Florian L. Kreyssig, Philip C. Woodland:
A Distributed Optimisation Framework Combining Natural Gradient with Hessian-Free for Discriminative Sequence Training. CoRR abs/2103.07554 (2021) - [i20]Qiujia Li, Yu Zhang, Bo Li, Liangliang Cao, Philip C. Woodland:
Residual Energy-Based Models for End-to-End Speech Recognition. CoRR abs/2103.14152 (2021) - [i19]Xianrui Zheng, Chao Zhang, Philip C. Woodland:
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition. CoRR abs/2108.07789 (2021) - [i18]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition. CoRR abs/2109.00627 (2021) - [i17]Qiujia Li, Yu Zhang, David Qiu, Yanzhang He, Liangliang Cao, Philip C. Woodland:
Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition. CoRR abs/2110.03327 (2021) - 2020
- [c161]Yassir Fathullah, Chao Zhang, Philip C. Woodland:
Improved Large-Margin Softmax Loss for Speaker Diarisation. ICASSP 2020: 7104-7108 - [c160]Florian L. Kreyssig, Philip C. Woodland:
Cosine-Distance Virtual Adversarial Training for Semi-Supervised Speaker-Discriminative Acoustic Embeddings. INTERSPEECH 2020: 3241-3245 - [i16]Florian L. Kreyssig, Philip C. Woodland:
Cosine-Distance Virtual Adversarial Training for Semi-Supervised Speaker-Discriminative Acoustic Embeddings. CoRR abs/2008.03756 (2020) - [i15]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Cross-Utterance Language Models with Acoustic Error Sampling. CoRR abs/2009.01008 (2020) - [i14]Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Philip C. Woodland, Liangliang Cao, Trevor Strohman:
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition. CoRR abs/2010.11428 (2020) - [i13]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Combination of Deep Speaker Embeddings for Diarisation. CoRR abs/2010.12025 (2020) - [i12]Wen Wu, Chao Zhang, Philip C. Woodland:
Emotion recognition by fusing time synchronous and time asynchronous representations. CoRR abs/2010.14102 (2020)
2010 – 2019
- 2019
- [c159]Qiujia Li, Chao Zhang, Philip C. Woodland:
Integrating Source-Channel and Attention-Based Sequence-to-Sequence Models for Speech Recognition. ASRU 2019: 39-46 - [c158]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Speaker Diarisation Using 2D Self-attentive Combination of Embeddings. ICASSP 2019: 5801-5805 - [c157]Chao Zhang, Florian L. Kreyssig, Qiujia Li, Philip C. Woodland:
PyHTK: Python Library and ASR Pipelines for HTK. ICASSP 2019: 6470-6474 - [c156]Patrick von Platen, Chao Zhang, Philip C. Woodland:
Multi-Span Acoustic Modelling Using Raw Waveform Signals. INTERSPEECH 2019: 1393-1397 - [i11]Guangzhi Sun, Chao Zhang, Philip C. Woodland:
Speaker diarisation using 2D self-attentive combination of embeddings. CoRR abs/1902.03190 (2019) - [i10]Patrick von Platen, Chao Zhang, Philip C. Woodland:
Multi-Span Acoustic Modelling using Raw Waveform Signals. CoRR abs/1906.11047 (2019) - [i9]Qiujia Li, Chao Zhang, Philip C. Woodland:
Integrating Source-channel and Attention-based Sequence-to-sequence Models for Speech Recognition. CoRR abs/1909.06614 (2019) - [i8]Qiujia Li, Florian L. Kreyssig, Chao Zhang, Philip C. Woodland:
Discriminative Neural Clustering for Speaker Diarisation. CoRR abs/1910.09703 (2019) - [i7]Yassir Fathullah, Chao Zhang, Philip C. Woodland:
Improved Large-margin Softmax Loss for Speaker Diarisation. CoRR abs/1911.03970 (2019) - 2018
- [c155]Florian L. Kreyssig, Chao Zhang, Philip C. Woodland:
Improved Tdnns Using Deep Kernels and Frequency Dependent Grid-RNNS. ICASSP 2018: 4864-4868 - [c154]Chao Zhang, Philip C. Woodland:
High Order Recurrent Neural Networks for Acoustic Modelling. ICASSP 2018: 5849-5853 - [c153]Yu Wang, Chao Zhang, Mark J. F. Gales, Philip C. Woodland:
Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems. INTERSPEECH 2018: 872-876 - [c152]Chao Zhang, Philip C. Woodland:
Semi-tied Units for Efficient Gating in LSTM and Highway Networks. INTERSPEECH 2018: 1773-1777 - [c151]Adnan Haider, Philip C. Woodland:
Combining Natural Gradient with Hessian Free Methods for Sequence Training. INTERSPEECH 2018: 2918-2922 - [i6]Florian Kreyssig, Chao Zhang, Philip C. Woodland:
Improved TDNNs using Deep Kernels and Frequency Dependent Grid-RNNs. CoRR abs/1802.06412 (2018) - [i5]Chao Zhang, Philip C. Woodland:
High Order Recurrent Neural Networks for Acoustic Modelling. CoRR abs/1802.08314 (2018) - [i4]Adnan Haider, Philip C. Woodland:
Sequence Training of DNN Acoustic Models With Natural Gradient. CoRR abs/1804.02204 (2018) - [i3]Chao Zhang, Philip C. Woodland:
Semi-tied Units for Efficient Gating in LSTM and Highway Networks. CoRR abs/1806.06513 (2018) - [i2]Adnan Haider, Philip C. Woodland:
Combining Natural Gradient with Hessian Free Methods for Sequence Training. CoRR abs/1810.01873 (2018) - 2017
- [j34]Cai Wingfield, Li Su, Xunying Liu, Chao Zhang, Philip C. Woodland, Andrew Thwaites, Elisabeth Fonteneau, William D. Marslen-Wilson:
Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem. PLoS Comput. Biol. 13(9) (2017) - [j33]Penny Karanasou, Chunyang Wu, Mark J. F. Gales, Philip C. Woodland:
I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models. IEEE ACM Trans. Audio Speech Lang. Process. 25(4): 818-828 (2017) - [c150]Adnan Haider, Philip C. Woodland:
Sequence training of DNN acoustic models with natural gradient. ASRU 2017: 178-184 - [c149]Chao Zhang, Philip C. Woodland:
Joint optimisation of tandem systems using Gaussian mixture density neural network discriminative sequence training. ICASSP 2017: 5015-5019 - 2016
- [j32]Xunying Liu, Xie Chen, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland:
Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models. IEEE ACM Trans. Audio Speech Lang. Process. 24(8): 1438-1449 (2016) - [j31]Xie Chen, Xunying Liu, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland:
Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(11): 2146-2157 (2016) - [c148]Chao Zhang, Philip C. Woodland:
DNN speaker adaptation using parameterised sigmoid and ReLU hidden activation functions. ICASSP 2016: 5300-5304 - [c147]J. Yang, Chao Zhang, Anton Ragni, Mark J. F. Gales, Philip C. Woodland:
System combination with log-linear models. ICASSP 2016: 5675-5679 - [c146]Linlin Wang, Chao Zhang, Philip C. Woodland, Mark J. F. Gales, Panagiota Karanasou, Pierre Lanchantin, Xunying Liu, Yanmin Qian:
Improved DNN-based segmentation for multi-genre broadcast audio. ICASSP 2016: 5700-5704 - [c145]Xie Chen, Xunying Liu, Y. Qian, Mark J. F. Gales, Philip C. Woodland:
CUED-RNNLM - An open-source toolkit for efficient training and evaluation of recurrent neural network language models. ICASSP 2016: 6000-6004 - [c144]Pierre Lanchantin, Mark J. F. Gales, Penny Karanasou, Xunying Liu, Yanman Qian, Linlin Wang, Philip C. Woodland, Chao Zhang:
Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems. INTERSPEECH 2016: 3057-3061 - [c143]Yanmin Qian, Philip C. Woodland:
Very deep convolutional neural networks for robust speech recognition. SLT 2016: 481-488 - [i1]Yanmin Qian, Philip C. Woodland:
Very Deep Convolutional Neural Networks for Robust Speech Recognition. CoRR abs/1610.00277 (2016) - 2015
- [c142]Xie Chen, Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Investigation of back-off based interpolation between recurrent neural network and n-gram language models. ASRU 2015: 181-186 - [c141]Jia Cui, Brian Kingsbury, Bhuvana Ramabhadran, Abhinav Sethy, Kartik Audhkhasi, Xiaodong Cui, Ellen Kislal, Lidia Mangu, Markus Nußbaum-Thom, Michael Picheny, Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney, Mark J. F. Gales, Kate M. Knill, Anton Ragni, Haipeng Wang, Philip C. Woodland:
Multilingual representations for low resource speech recognition and keyword search. ASRU 2015: 259-266 - [c140]Philip C. Woodland, Xunying Liu, Yanmin Qian, Chao Zhang, Mark J. F. Gales, Penny Karanasou, Pierre Lanchantin, Linlin Wang:
Cambridge university transcription systems for the multi-genre broadcast challenge. ASRU 2015: 639-646 - [c139]Pierre Lanchantin, Mark J. F. Gales, Penny Karanasou, Xunying Liu, Yanmin Qian, Linlin Wang, Philip C. Woodland, Chao Zhang:
The development of the cambridge university alignment systems for the multi-genre broadcast challenge. ASRU 2015: 647-653 - [c138]Penny Karanasou, Mark J. F. Gales, Pierre Lanchantin, Xunying Liu, Yanmin Qian, Linlin Wang, Philip C. Woodland, Chao Zhang:
Speaker diarisation and longitudinal linking in multi-genre broadcast data. ASRU 2015: 660-666 - [c137]Peter Bell, Mark J. F. Gales, Thomas Hain, Jonathan Kilgour, Pierre Lanchantin, Xunying Liu, Andrew McParland, Steve Renals, Oscar Saz, Mirjam Wester, Philip C. Woodland:
The MGB challenge: Evaluating multi-genre broadcast media recognition. ASRU 2015: 687-693 - [c136]Xie Chen, Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Improving the training and evaluation efficiency of recurrent neural network language models. ICASSP 2015: 5401-5405 - [c135]Xunying Liu, Xie Chen, Mark J. F. Gales, Philip C. Woodland:
Paraphrastic recurrent neural network language models. ICASSP 2015: 5406-5410 - [c134]Xie Chen, Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Recurrent neural network language model training with noise contrastive estimation for speech recognition. ICASSP 2015: 5411-5415 - [c133]Penny Karanasou, Mark J. F. Gales, Philip C. Woodland:
I-vector estimation using informative priors for adaptation of deep neural networks. INTERSPEECH 2015: 2872-2876 - [c132]Xunying Liu, Federico Flego, Linlin Wang, Chao Zhang, Mark J. F. Gales, Philip C. Woodland:
The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation. INTERSPEECH 2015: 3145-3149 - [c131]Chao Zhang, Philip C. Woodland:
Parameterised sigmoid and reLU hidden activation functions for DNN acoustic modelling. INTERSPEECH 2015: 3224-3228 - [c130]Xie Chen, T. Tan, Xunying Liu, Pierre Lanchantin, M. Wan, Mark J. F. Gales, Philip C. Woodland:
Recurrent neural network language model adaptation for multi-genre broadcast speech recognition. INTERSPEECH 2015: 3511-3515 - [c129]Chao Zhang, Philip C. Woodland:
A general artificial neural network extension for HTK. INTERSPEECH 2015: 3581-3585 - [c128]Haipeng Wang, Anton Ragni, Mark J. F. Gales, Kate M. Knill, Philip C. Woodland, Chao Zhang:
Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages. INTERSPEECH 2015: 3660-3664 - 2014
- [j30]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Paraphrastic language models. Comput. Speech Lang. 28(6): 1298-1316 (2014) - [c127]Matthew Stephen Seigel, Philip C. Woodland:
Detecting deletions in ASR output. ICASSP 2014: 2302-2306 - [c126]Matthew Stephen Seigel, Philip C. Woodland:
Direct sub-word confidence estimation with hidden-state conditional random fields. ICASSP 2014: 2307-2311 - [c125]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Paraphrastic neural network language models. ICASSP 2014: 4903-4907 - [c124]Xunying Liu, Yongqiang Wang, Xie Chen, Mark J. F. Gales, Philip C. Woodland:
Efficient lattice rescoring using recurrent neural network language models. ICASSP 2014: 4908-4912 - [c123]Chao Zhang, Philip C. Woodland:
Standalone training of context-dependent deep neural network acoustic models. ICASSP 2014: 5597-5601 - [c122]Xie Chen, Yongqiang Wang, Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch. INTERSPEECH 2014: 641-645 - [c121]Penny Karanasou, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland:
Adaptation of deep neural network acoustic models using factorised i-vectors. INTERSPEECH 2014: 2180-2184 - 2013
- [j29]Xunying Liu, Mark John Francis Gales, Philip C. Woodland:
Use of contexts in language model interpolation and adaptation. Comput. Speech Lang. 27(1): 301-321 (2013) - [j28]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Language model cross adaptation for LVCSR system combination. Comput. Speech Lang. 27(4): 928-942 (2013) - [c120]Kate M. Knill, Mark J. F. Gales, Shakti P. Rath, Philip C. Woodland, Chao Zhang, Shi-Xiong Zhang:
Investigation of multilingual deep neural networks for spoken term detection. ASRU 2013: 138-143 - [c119]Jonathan Mamou, Jia Cui, Xiaodong Cui, Mark J. F. Gales, Brian Kingsbury, Kate M. Knill, Lidia Mangu, David Nolden, Michael Picheny, Bhuvana Ramabhadran, Ralf Schlüter, Abhinav Sethy, Philip C. Woodland:
System combination and score normalization for spoken term detection. ICASSP 2013: 8272-8276 - [c118]Brian Kingsbury, Jia Cui, Xiaodong Cui, Mark J. F. Gales, Kate M. Knill, Jonathan Mamou, Lidia Mangu, David Nolden, Michael Picheny, Bhuvana Ramabhadran, Ralf Schlüter, Abhinav Sethy, Philip C. Woodland:
A high-performance Cantonese keyword search system. ICASSP 2013: 8277-8281 - [c117]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Paraphrastic language models and combination with neural network language models. ICASSP 2013: 8421-8425 - [c116]Matthew Stephen Seigel, Philip C. Woodland, Mark J. F. Gales:
A confidence-based approach for improving keyword hypothesis scores. ICASSP 2013: 8565-8569 - [c115]Pierre Lanchantin, Peter Bell, Mark J. F. Gales, Thomas Hain, Xunying Liu, Yanhua Long, Jennifer Quinnell, Steve Renals, Oscar Saz, Matthew Stephen Seigel, Pawel Swietojanski, Philip C. Woodland:
Automatic Transcription of Multi-genre Media Archives. SLAM@INTERSPEECH 2013: 26-31 - [c114]Yanhua Long, Mark J. F. Gales, Pierre Lanchantin, Xunying Liu, Matthew Stephen Seigel, Philip C. Woodland:
Improving lightly supervised training for broadcast transcription. INTERSPEECH 2013: 2187-2191 - [c113]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Cross-domain paraphrasing for improving language modelling using out-of-domain data. INTERSPEECH 2013: 3424-3428 - 2012
- [j27]Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Philip C. Woodland:
Morphological decomposition in Arabic ASR systems. Comput. Speech Lang. 26(4): 229-243 (2012) - [c112]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Paraphrastic Language Models. INTERSPEECH 2012: 1656-1659 - [c111]Matthew Stephen Seigel, Philip C. Woodland:
Using Sub-word-level Information for Confidence Estimation with Conditional Random Field Models. INTERSPEECH 2012: 2338-2341 - [c110]Frank Diehl, Philip C. Woodland:
Complementary Phone Error Training. INTERSPEECH 2012: 2610-2613 - [c109]Peter Bell, Mark J. F. Gales, Pierre Lanchantin, Xunying Liu, Yanhua Long, Steve Renals, Pawel Swietojanski, Philip C. Woodland:
Transcription of multi-genre media archives using out-of-domain data. SLT 2012: 324-329 - 2011
- [j26]Junho Park, Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Philip C. Woodland:
The efficient incorporation of MLP features into automatic speech recognition systems. Comput. Speech Lang. 25(3): 519-534 (2011) - [c108]Xunying Liu, Mark John Francis Gales, Jim L. Hieronymus, Philip C. Woodland:
Investigation of acoustic units for LVCSR systems. ICASSP 2011: 4872-4875 - [c107]Frank Diehl, Mark John Francis Gales, Xunying Liu, Marcus Tomalin, Philip C. Woodland:
Word Boundary Modelling and Full Covariance Gaussians for Arabic Speech-to-Text Systems. INTERSPEECH 2011: 777-780 - [c106]Matthew Stephen Seigel, Philip C. Woodland:
Combining Information Sources for Confidence Estimation with CRF Models. INTERSPEECH 2011: 905-908 - [c105]T. Li, Philip C. Woodland, Frank Diehl, Mark J. F. Gales:
Graphone Model Interpolation and Arabic Pronunciation Generation. INTERSPEECH 2011: 2309-2312 - [c104]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation. INTERSPEECH 2011: 2857-2860 - 2010
- [j25]Kai Yu, Mark J. F. Gales, Lan Wang, Philip C. Woodland:
Unsupervised training and directed manual transcription for LVCSR. Speech Commun. 52(7-8): 652-663 (2010) - [c103]Marcus Tomalin, Frank Diehl, Mark J. F. Gales, Junho Park, Philip C. Woodland:
Recent improvements to the Cambridge Arabic Speech-to-Text systems. ICASSP 2010: 4382-4385 - [c102]Xunying Liu, Mark J. F. Gales, Jim L. Hieronymus, Philip C. Woodland:
Language model combination and adaptation usingweighted finite state transducers. ICASSP 2010: 5390-5393 - [c101]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Language model cross adaptation for LVCSR system combination. INTERSPEECH 2010: 342-345 - [c100]Junho Park, Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Improved neural network based language modelling and adaptation. INTERSPEECH 2010: 1041-1044
2000 – 2009
- 2009
- [j24]Kai Yu, Mark J. F. Gales, Philip C. Woodland:
Unsupervised Adaptation With Discriminative Mapping Transforms. IEEE Trans. Speech Audio Process. 17(4): 714-723 (2009) - [c99]Junho Park, Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Philip C. Woodland:
Training and adapting MLP features for Arabic speech recognition. ICASSP 2009: 4461-4464 - [c98]Junho Park, Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Philip C. Woodland:
Efficient generation and use of MLP features for Arabic speech recognition. INTERSPEECH 2009: 236-239 - [c97]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Use of contexts in language model interpolation and adaptation. INTERSPEECH 2009: 360-363 - [c96]Jim L. Hieronymus, Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Exploiting Chinese character models to improve speech recognition performance. INTERSPEECH 2009: 364-367 - [c95]Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Philip C. Woodland:
Morphological analysis and decomposition for Arabic speech-to-text systems. INTERSPEECH 2009: 2675-2678 - 2008
- [j23]Lan Wang, Philip C. Woodland:
MPE-based discriminative linear transforms for speaker adaptation. Comput. Speech Lang. 22(3): 256-272 (2008) - [c94]Frank Diehl, Mark J. F. Gales, Marcus Tomalin, Philip C. Woodland:
Phonetic pronunciations for arabic speech-to-text systems. ICASSP 2008: 1573-1576 - [c93]Kai Yu, Mark J. F. Gales, Philip C. Woodland:
Unsupervised discriminative adaptation using discriminative mapping transforms. ICASSP 2008: 4273-4276 - [c92]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Context dependent language model adaptation. INTERSPEECH 2008: 837-840 - 2007
- [c91]Mark J. F. Gales, Frank Diehl, Chandra Kant Raut, Marcus Tomalin, Philip C. Woodland, Kai Yu:
Development of a phonetic system for large vocabulary Arabic speech recognition. ASRU 2007: 24-29 - [c90]Xunying Liu, William J. Byrne, Mark J. F. Gales, Adrià de Gispert, Marcus Tomalin, Philip C. Woodland, Kai Yu:
Discriminative language model adaptation for Mandarin broadcast speech transcription and translation. ASRU 2007: 153-158 - [c89]Marcus Tomalin, Mark J. F. Gales, X. Andrew Liu, Khe Chai Sim, Rohit Sinha, Lan Wang, Philip C. Woodland, Kai Yu:
Improving Speech Transcription for Mandarin-English Translation. ICASSP (4) 2007: 97-100 - [c88]Khe Chai Sim, William J. Byrne, Mark J. F. Gales, Hichem Sahbi, Philip C. Woodland:
Consensus Network Decoding for Statistical Machine Translation System Combination. ICASSP (4) 2007: 105-108 - [c87]Lan Wang, Mark J. F. Gales, Philip C. Woodland:
Unsupervised Training for Mandarin Broadcast News and Conversation Transcription. ICASSP (4) 2007: 353-356 - [c86]Mark J. F. Gales, Xunying Liu, Rohit Sinha, Philip C. Woodland, Kai Yu, Spyros Matsoukas, Tim Ng, Kham Nguyen, Long Nguyen, Jean-Luc Gauvain, Lori Lamel, Abdelkhalek Messaoudi:
Speech Recognition System Combination for Machine Translation. ICASSP (4) 2007: 1277-1280 - [c85]Kai Yu, Mark J. F. Gales, Philip C. Woodland:
Unsupervised training with directed manual transcription for recognising Mandarin broadcast audio. INTERSPEECH 2007: 1709-1712 - 2006
- [j22]Thomas Hain, Philip C. Woodland, Gunnar Evermann, Mark J. F. Gales, Xunying Liu, Gareth L. Moore, Daniel Povey, Lan Wang:
Corrections to "Automatic Transcription of Conversational Telephone Speech". IEEE Trans. Speech Audio Process. 14(2): 727-727 (2006) - [j21]Mark J. F. Gales, Do Yeong Kim, Philip C. Woodland, Ho Yin Chan, David Mrva, Rohit Sinha, S. E. Tranter:
Progress in the CU-HTK broadcast news transcription system. IEEE Trans. Speech Audio Process. 14(5): 1513-1525 (2006) - [c84]Marcus Tomalin, Philip C. Woodland:
Discriminatively Trained Gaussian Mixture Models for Sentence Boundary Detection. ICASSP (1) 2006: 549-552 - [c83]Rohit Sinha, Mark J. F. Gales, Do Yeong Kim, X. Andrew Liu, Khe Chai Sim, Philip C. Woodland:
The Cu-Htk Mandarin Broadcast News Transcription System. ICASSP (1) 2006: 1077-1080 - [c82]David Mrva, Philip C. Woodland:
Unsupervised language model adaptation for Mandarin broadcast conversation transcription. INTERSPEECH 2006 - 2005
- [j20]Thomas Hain, Philip C. Woodland, Gunnar Evermann, Mark J. F. Gales, Xunying Liu, Gareth L. Moore, Daniel Povey, Lan Wang:
Automatic transcription of conversational telephone speech. IEEE Trans. Speech Audio Process. 13(6): 1173-1185 (2005) - [c81]Gunnar Evermann, Ho Yin Chan, Mark J. F. Gales, Bin Jia, David Mrva, Philip C. Woodland, Kai Yu:
Training LVCSR Systems on Thousands of Hours of Data. ICASSP (1) 2005: 209-212 - [c80]Mark J. F. Gales, Bin Jia, X. Andrew Liu, Khe Chai Sim, Philip C. Woodland, Kai Yu:
Development of the CUHTK 2004 Mandarin Conversational Telephone Speech Transcription System. ICASSP (1) 2005: 841-844 - [c79]Do Yeong Kim, Ho Yin Chan, Gunnar Evermann, Mark J. F. Gales, David Mrva, Khe Chai Sim, Philip C. Woodland:
Development of the CU-HTK 2004 Broadcast News Transcription Systems. ICASSP (1) 2005: 861-864 - [c78]Yang Liu, Elizabeth Shriberg, Andreas Stolcke, Barbara Peskin, Jeremy Ang, Dustin Hillard, Mari Ostendorf, Marcus Tomalin, Philip C. Woodland, Mary P. Harper:
Structural metadata research in the EARS program. ICASSP (5) 2005: 957-960 - [c77]Rohit Sinha, S. E. Tranter, Mark J. F. Gales, Philip C. Woodland:
The Cambridge University March 2005 speaker diarisation system. INTERSPEECH 2005: 2437-2440 - 2004
- [j19]Ji-Hwan Kim, Philip C. Woodland:
Automatic capitalisation generation for speech input. Comput. Speech Lang. 18(1): 67-90 (2004) - [c76]Gunnar Evermann, Ho Yin Chan, Mark J. F. Gales, Thomas Hain, Xunying Liu, David Mrva, Lan Wang, Philip C. Woodland:
Development of the 2003 CU-HTK conversational telephone speech transcription system. ICASSP (1) 2004: 249-252 - [c75]Lan Wang, Philip C. Woodland:
MPE-based discriminative linear transform for speaker adaptation. ICASSP (1) 2004: 321-324 - [c74]Ho Yin Chan, Philip C. Woodland:
Improving broadcast news transcription by lightly supervised discriminative training. ICASSP (1) 2004: 737-740 - [c73]Sue Tranter, Kai Yu, Gunnar Evermann, Philip C. Woodland:
Generating and evaluating segmentations for automatic speech recognition of conversational telephone speech. ICASSP (1) 2004: 753-756 - [c72]Do Yeong Kim, Srinivasan Umesh, Mark J. F. Gales, Thomas Hain, Philip C. Woodland:
Using VTLN for broadcast news transcription. INTERSPEECH 2004: 1953-1956 - [c71]David Mrva, Philip C. Woodland:
A PLSA-based language model for conversational telephone speech. INTERSPEECH 2004: 2257-2260 - 2003
- [j18]Edward W. D. Whittaker, Philip C. Woodland:
Language modelling for Russian and English using words and classes. Comput. Speech Lang. 17(1): 87-104 (2003) - [j17]Edward W. D. Whittaker, Philip C. Woodland:
Erratum: Language modelling for Russian and English using words and classes [Computer Speech and Language 17 (2003) 87-104]. Comput. Speech Lang. 17(4): 415 (2003) - [j16]Ji-Hwan Kim, Philip C. Woodland:
A combined punctuation generation and speech recognition system and its performance enhancement using prosody. Speech Commun. 41(4): 563-577 (2003) - [c70]Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Automatic complexity control for HLDA systems. ICASSP (1) 2003: 132-135 - [c69]Daniel Povey, Philip C. Woodland, Mark J. F. Gales:
Discriminative map for acoustic model adaptation. ICASSP (1) 2003: 312-315 - [c68]Mark J. F. Gales, Yuan Dong, Daniel Povey, Philip C. Woodland:
Porting: SwitchBoard to the VoiceMail task. ICASSP (1) 2003: 536-539 - [c67]Daniel Povey, Mark J. F. Gales, Do Yeong Kim, Philip C. Woodland:
MMI-MAP and MPE-MAP for acoustic model adaptation. INTERSPEECH 2003: 1981-1984 - 2002
- [j15]Philip C. Woodland, Daniel Povey:
Large scale discriminative training of hidden Markov models for speech recognition. Comput. Speech Lang. 16(1): 25-47 (2002) - [j14]Philip C. Woodland:
The development of the HTK Broadcast News transcription system: An overview. Speech Commun. 37(1-2): 47-67 (2002) - [c66]Ricardo de Córdoba, Philip C. Woodland, Mark J. F. Gales:
Improved cross-task recognition using MMIE training. ICASSP 2002: 85-88 - [c65]Daniel Povey, Philip C. Woodland:
Minimum Phone Error and I-smoothing for improved discriminative training. ICASSP 2002: 105-108 - [c64]Ji-Hwan Kim, Philip C. Woodland:
Implementation of automatic capitalisation generation systems for speech input. ICASSP 2002: 857-860 - [c63]K. K. Chin, Philip C. Woodland:
Maximum mutual information training of hidden Markov models with vector linear predictors. INTERSPEECH 2002: 997-1000 - [c62]J. T. Wickramaratna, Philip C. Woodland:
Cluster identification for speaker-environment tracking. INTERSPEECH 2002: 2001-2004 - 2001
- [j13]Andreas Tuerk, Sue E. Johnson, Pierre Jourlin, Karen Spärck Jones, Philip C. Woodland:
The Cambridge University Multimedia Document Retrieval Demo System. Int. J. Speech Technol. 4(3-4): 241-250 (2001) - [j12]Sue E. Johnson, Pierre Jourlin, Karen Spärck Jones, Philip C. Woodland:
Information Retrieval from Unsegmented Broadcast News Audio. Int. J. Speech Technol. 4(3-4): 251-268 (2001) - [c61]Daniel Povey, Philip C. Woodland:
Improved discriminative training techniques for large vocabulary continuous speech recognition. ICASSP 2001: 45-48 - [c60]Luís Felipe Uebel, Philip C. Woodland:
Improvements in linear transform based speaker adaptation. ICASSP 2001: 49-52 - [c59]Thomas Hain, Philip C. Woodland, Gunnar Evermann, Daniel Povey:
New features in the CU-HTK system for transcription of conversational telephone speech. ICASSP 2001: 57-60 - [c58]Edward W. D. Whittaker, Philip C. Woodland:
Efficient class-based language modelling for very large vocabularies. ICASSP 2001: 545-548 - [c57]Ji-Hwan Kim, Philip C. Woodland:
The use of prosody in a combined system for punctuation generation and speech recognition. INTERSPEECH 2001: 2757-2760 - 2000
- [j11]Pierre Jourlin, Sue E. Johnson, Karen Spärck Jones, Philip C. Woodland:
Spoken document representations for probabilistic retrieval. Speech Commun. 32(1-2): 21-36 (2000) - [c56]Sue E. Johnson, Philip C. Woodland:
A method for direct audio search with applications to indexing and retrieval. ICASSP 2000: 1427-1430 - [c55]Gunnar Evermann, Philip C. Woodland:
Large vocabulary decoding and confidence estimation using word posterior probabilities. ICASSP 2000: 1655-1658 - [c54]Edward W. D. Whittaker, Philip C. Woodland:
Particle-based language modelling. INTERSPEECH 2000: 170-173 - [c53]Thomas Hain, Philip C. Woodland:
Modelling sub-phone insertions and deletions in continuous speech recognition. INTERSPEECH 2000: 172-175 - [c52]Ji-Hwan Kim, Philip C. Woodland:
A rule-based named entity recognition system for speech input. INTERSPEECH 2000: 528-531 - [c51]Sue E. Johnson, Pierre Jourlin, Karen Sparck Jones, Philip C. Woodland:
Audio Indexing and Retrieval of Complete Broadcoast News Shows. RIAO 2000: 1163-1177 - [c50]Philip C. Woodland, Sue E. Johnson, Pierre Jourlin, Karen Sparck Jones:
Effects of out of vocabulary words in spoken document retrieval. SIGIR 2000: 372-374 - [c49]Andreas Tuerk, Sue E. Johnson, Pierre Jourlin, Karen Sparck Jones, Philip C. Woodland:
The Cambridge University multimedia document retrieval demo system. SIGIR 2000: 394 - [c48]Sue E. Johnson, Pierre Jourlin, Karen Sparck Jones, Philip C. Woodland:
Spoken Document Retrieval for TREC-9 at Cambridge University. TREC 2000
1990 – 1999
- 1999
- [j10]Thomas Niesler, Philip C. Woodland:
Variable-length categoryn-gram language models. Comput. Speech Lang. 13(1): 99-124 (1999) - [j9]Robert E. Donovan, Philip C. Woodland:
A hidden Markov-model-based trainable speech synthesizer. Comput. Speech Lang. 13(3): 223-241 (1999) - [c47]Sue E. Johnson, Pierre Jourlin, Gareth L. Moore, Karen Spärck Jones, Philip C. Woodland:
The Cambridge University spoken document retrieval system. ICASSP 1999: 49-52 - [c46]Thomas Hain, Philip C. Woodland, Thomas Niesler, Edward W. D. Whittaker:
The 1998 HTK system for transcription of conversational telephone speech. ICASSP 1999: 57-60 - [c45]Daniel Povey, Philip C. Woodland:
Frame discrimination training for HMMs for large vocabulary speech recognition. ICASSP 1999: 333-336 - [c44]Thomas Hain, Philip C. Woodland:
Dynamic HMM selection for continuous speech recognition. EUROSPEECH 1999 - [c43]Philip C. Woodland, J. J. Odell, Thomas Hain, Gareth L. Moore, Thomas Niesler, Andreas Tuerk, Edward W. D. Whittaker:
Improvements in accuracy and speed in the HTK broadcast news transcription system. EUROSPEECH 1999: 1043-1046 - [c42]Luís Felipe Uebel, Philip C. Woodland:
An investigation into vocal tract length normalisation. EUROSPEECH 1999: 2527-2530 - [c41]Pierre Jourlin, Sue E. Johnson, Karen Sparck Jones, Philip C. Woodland:
Improving Retrieval on Imperfect Speech Transcriptions (poster abstract). SIGIR 1999: 283-284 - [c40]Sue E. Johnson, Philip C. Woodland, Karen Sparck Jones, Pierre Jourlin:
Spoken Document Retrieval for TREC-8 at Cambridge University. TREC 1999 - 1998
- [c39]Thomas Niesler, Edward W. D. Whittaker, Philip C. Woodland:
Comparison of part-of-speech and automatically derived category-based language models for speech recognition. ICASSP 1998: 177-180 - [c38]Jason J. Humphries, Philip C. Woodland:
The use of accent-specific pronunciation dictionaries in acoustic model training. ICASSP 1998: 317-320 - [c37]Philip C. Woodland, Thomas Hain, Sue E. Johnson, Thomas Niesler, Andreas Tuerk, Steve J. Young:
Experiments in broadcast news transcription. ICASSP 1998: 909-912 - [c36]Thomas Hain, Philip C. Woodland:
Segmentation and classification of broadcast news audio. ICSLP 1998 - [c35]Sue E. Johnson, Philip C. Woodland:
Speaker clustering using direct maximisation of the MLLR-adapted likelihood. ICSLP 1998 - [c34]Edward W. D. Whittaker, Philip C. Woodland:
Comparison of language modelling techniques for Russian and English. ICSLP 1998 - [c33]Sue E. Johnson, Pierre Jourlin, Gareth L. Moore, Karen Sparck Jones, Philip C. Woodland:
Spoken Document Retrieval For TREC-7 At Cambridge University. TREC 1998: 138-147 - 1997
- [j8]Steve J. Young, Martine Adda-Decker, Xavier L. Aubert, Christian Dugast, Jean-Luc Gauvain, Dan J. Kershaw, Lori Lamel, David A. van Leeuwen, David Pye, Anthony J. Robinson, Herman J. M. Steeneken, Philip C. Woodland:
Multilingual large vocabulary speech recognition: the European SQALE project. Comput. Speech Lang. 11(1): 73-89 (1997) - [j7]S. M. Ahadi, Philip C. Woodland:
Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models. Comput. Speech Lang. 11(3): 187-206 (1997) - [j6]V. Valtchev, J. J. Odell, Philip C. Woodland, Steve J. Young:
MMIE training of large vocabulary recognition systems. Speech Commun. 22(4): 303-314 (1997) - [c32]Philip C. Woodland, Mark J. F. Gales, David Pye, Steve J. Young:
Broadcast news transcription using HTK. ICASSP 1997: 719-722 - [c31]Thomas Niesler, Philip C. Woodland:
Modelling word-pair relations in a category-based language model. ICASSP 1997: 795-798 - [c30]David Pye, Philip C. Woodland:
Experiments in speaker normalisation and adaptation for large vocabulary speech recognition. ICASSP 1997: 1047-1050 - [c29]Jason J. Humphries, Philip C. Woodland:
Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition. EUROSPEECH 1997: 2367-2370 - 1996
- [j5]Mark J. F. Gales, Philip C. Woodland:
Mean and variance adaptation within the MLLR framework. Comput. Speech Lang. 10(4): 249-264 (1996) - [c28]Philip C. Woodland, Mark John Francis Gales, David Pye:
Improving environmental robustness in large vocabulary speech recognition. ICASSP 1996: 65-68 - [c27]Thomas Niesler, Philip C. Woodland:
A variable-length category-based n-gram language model. ICASSP 1996: 164-167 - [c26]Valtcho Valtchev, Julian Odell, Philip C. Woodland, Steve J. Young:
Lattice-based discriminative training for large vocabulary speech recognition. ICASSP 1996: 605-608 - [c25]V. Valtchev, Philip C. Woodland, Steve J. Young:
Discriminative optimisation of large vocabulary recognition systems. ICSLP 1996: 18-21 - [c24]Thomas Niesler, Philip C. Woodland:
Combination of word-based and category-based language models. ICSLP 1996: 220-223 - [c23]Philip C. Woodland, David Pye, Mark J. F. Gales:
Iterative unsupervised adaptation using maximum likelihood linear regression. ICSLP 1996: 1133-1136 - [c22]Mark J. F. Gales, David Pye, Philip C. Woodland:
Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation. ICSLP 1996: 1832-1835 - [c21]Jason J. Humphries, Philip C. Woodland, David J. B. Pearce:
Using accent-specific pronunciation modelling for robust speech recognition. ICSLP 1996: 2324-2327 - 1995
- [j4]C. J. Leggetter, Philip C. Woodland:
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Comput. Speech Lang. 9(2): 171-185 (1995) - [c20]Philip C. Woodland, C. J. Leggetter, Julian Odell, Valtcho Valtchev, Steve J. Young:
The 1994 HTK large vocabulary speech recognition system. ICASSP 1995: 73-76 - [c19]Robert E. Donovan, Philip C. Woodland:
Automatic speech synthesiser parameter estimation using HMMs. ICASSP 1995: 640-643 - [c18]S. M. Ahadi, Philip C. Woodland:
Rapid speaker adaptation using model prediction. ICASSP 1995: 684-687 - [c17]David Pye, Philip C. Woodland, Steve J. Young:
Large vocabulary multilingual speech recognition using HTK. EUROSPEECH 1995: 181-184 - [c16]Robert E. Donovan, Philip C. Woodland:
Improvements in an HMM-based speech synthesiser. EUROSPEECH 1995: 573-576 - [c15]C. J. Leggetter, Philip C. Woodland:
Flexible speaker adaptation for large vocabulary speech recognition. EUROSPEECH 1995: 1155-1158 - 1994
- [j3]Steve J. Young, Philip C. Woodland:
State clustering in hidden Markov model-based continuous speech recognition. Comput. Speech Lang. 8(4): 369-383 (1994) - [j2]Steve J. Young, Philip C. Woodland, William J. Byrne:
Spontaneous speech recognition for the credit card corpus using the HTK toolkit. IEEE Trans. Speech Audio Process. 2(4): 615-621 (1994) - [c14]Philip C. Woodland, J. J. Odell, V. Valtchev, Steve J. Young:
Large vocabulary continuous speech recognition using HTK. ICASSP (2) 1994: 125-128 - [c13]C. J. Leggetter, Philip C. Woodland:
Speaker adaptation of continuous density HMMs using multivariate linear regression. ICSLP 1994: 451-454 - [c12]V. Valtchev, J. J. Odell, Philip C. Woodland, Steve J. Young:
Recognition ********* a dynamic network decoder design for large vocabulary speech recognition. ICSLP 1994: 1351-1354 - [c11]M. Jones, Philip C. Woodland:
Modelling syllable characteristics to improve a large vocabulary continuous speech recogniser. ICSLP 1994: 2171-2174 - [c10]J. J. Odell, V. Valtchev, Philip C. Woodland, Steve J. Young:
A One Pass Decoder Design For Large Vocabulary Recognition. HLT 1994 - [c9]Steve J. Young, J. J. Odell, Philip C. Woodland:
Tree-Based State Tying for High Accuracy Modelling. HLT 1994 - 1993
- [c8]M. Jones, Philip C. Woodland:
Exploiting variable-width features in large vocabulary speech recognition. ICASSP (2) 1993: 323-326 - [c7]Christian Giguère, Philip C. Woodland:
A wave digital filter model of the entire auditory periphery. ICASSP (2) 1993: 708-711 - [c6]M. Jones, Philip C. Woodland:
Using relative duration in large vocabulary speech recognition. EUROSPEECH 1993: 311-314 - [c5]B. A. Maxwell, Philip C. Woodland:
Hidden Markov models using shared vector linear predictors. EUROSPEECH 1993: 819-822 - [c4]Steve J. Young, Philip C. Woodland:
The use of state tying in continuous speech recognition. EUROSPEECH 1993: 2203-2206 - [c3]Philip C. Woodland, Steve J. Young:
The HTK tied-state continuous speech recogniser. EUROSPEECH 1993: 2207-2210 - 1992
- [c2]Philip C. Woodland:
Hidden Markov models using vector linear prediction and discriminative output distributions. ICASSP 1992: 509-512 - 1991
- [c1]Philip C. Woodland, David R. Cole:
Optimising hidden Markov models using discriminative output distributions. ICASSP 1991: 545-548 - 1990
- [j1]Philip C. Woodland, S. G. Smyth:
An experimental comparison of connectionist and conventional classification systems on natural data. Speech Commun. 9(1): 73-82 (1990)
Coauthor Index
aka: Mark John Francis Gales
aka: Karen Sparck Jones
aka: Florian L. Kreyssig
aka: X. Andrew Liu
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-24 18:07 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint