default search action
Simon King 0001
Person information
- affiliation: University of Edinburgh, Centre for Speech Technology Research, Scotland, UK
Other persons with the same name
- Simon King — disambiguation page
- Simon King 0002 — Technische Universität Darmstadt, Germany
- Simon King 0003 — University of Twente, The Netherlands
- Simon King 0004 — Yahoo! Research Berkeley, CA, USA (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j46]Olivier Perrotin, Brooke Stephenson, Silvain Gerber, Gérard Bailly, Simon King:
Refining the evaluation of speech synthesis: A summary of the Blizzard Challenge 2023. Comput. Speech Lang. 90: 101747 (2025) - 2024
- [j45]Sébastien Le Maguer, Simon King, Naomi Harte:
The limits of the Mean Opinion Score for speech synthesis evaluation. Comput. Speech Lang. 84: 101577 (2024) - [c219]Atli Sigurgeirsson, Simon King:
Controllable Speaking Styles Using A Large Language Model. ICASSP 2024: 10851-10855 - [i19]Daniel Lyth, Simon King:
Natural language guidance of high-fidelity text-to-speech with synthetic annotations. CoRR abs/2402.01912 (2024) - [i18]Jacob J. Webber, Oliver Watts, Gustav Eje Henter, Jennifer Williams, Simon King:
Voice Conversion-based Privacy through Adversarial Information Hiding. CoRR abs/2409.14919 (2024) - 2023
- [c218]Atli Þór Sigurgeirsson, Simon King:
Do Prosody Transfer Models Transfer Prosodyƒ. ICASSP 2023: 1-5 - [c217]Tian Huey Teh, Vivian Hu, Devang S. Ram Mohan, Zack Hodari, Christopher G. R. Wallis, Tomás Gómez Ibarrondo, Alexandra Torresquintero, James Leoni, Mark J. F. Gales, Simon King:
Ensemble Prosody Prediction For Expressive Speech Synthesis. ICASSP 2023: 1-5 - [c216]Jacob J. Webber, Cassia Valentini-Botinhao, Evelyn Williams, Gustav Eje Henter, Simon King:
Autovocoder: Fast Waveform Generation from a Learned Speech Representation Using Differentiable Digital Signal Processing. ICASSP 2023: 1-5 - [c215]Avashna Govender, Simon King:
Cognitive Load of Modern TTS Systems Under Noisy Conditions. COGAI@IJCLR 2023 - [c214]Niamh Corkey, Johannah O'Mahony, Simon King:
Intonation Control for Neural Text-to-Speech Synthesis with Polynomial Models of F0. INTERSPEECH 2023: 2014-2015 - [c213]Jason Fong, Hao Tang, Simon King:
Spell4TTS: Acoustically-informed spellings for improving text-to-speech pronunciations. SSW 2023: 8-13 - [c212]Johannah O'Mahony, Catherine Lai, Simon King:
Synthesising turn-taking cues using natural conversational data. SSW 2023: 75-80 - [c211]Atli Sigurgeirsson, Simon King:
Using a Large Language Model to Control Speaking Style for Expressive TTS. SSW 2023: 246-247 - [e2]Olivier Perrotin, Gérard Bailly, Simon King:
18th Blizzard Challenge Workshop, Grenoble, France, August 29, 2023. ISCA 2023 [contents] - [i17]Atli Þór Sigurgeirsson, Simon King:
Do Prosody Transfer Models Transfer Prosody? CoRR abs/2303.04289 (2023) - [i16]Atli Þór Sigurgeirsson, Simon King:
Using a Large Language Model to Control Speaking Style for Expressive TTS. CoRR abs/2305.10321 (2023) - [i15]Alistair Carson, Cassia Valentini-Botinhao, Simon King, Stefan Bilbao:
Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing. CoRR abs/2306.01332 (2023) - 2022
- [c210]Jason Fong, Daniel Lyth, Gustav Eje Henter, Hao Tang, Simon King:
Speech Audio Corrector: using speech from non-target speakers for one-off correction of mispronunciations in grapheme-input text-to-speech. INTERSPEECH 2022: 1213-1217 - [c209]Sébastien Le Maguer, Simon King, Naomi Harte:
Back to the Future: Extending the Blizzard Challenge 2013. INTERSPEECH 2022: 2378-2382 - [c208]Johannah O'Mahony, Catherine Lai, Simon King:
Combining conversational speech with read speech to improve prosody in Text-to-Speech synthesis. INTERSPEECH 2022: 3388-3392 - [i14]Jacob J. Webber, Cassia Valentini-Botinhao, Evelyn Williams, Gustav Eje Henter, Simon King:
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing. CoRR abs/2211.06989 (2022) - 2021
- [j44]Berrak Sisman, Junichi Yamagishi, Simon King, Haizhou Li:
An Overview of Voice Conversion and Its Challenges: From Statistical Modeling to Deep Learning. IEEE ACM Trans. Audio Speech Lang. Process. 29: 132-157 (2021) - [c207]Zhen-Hua Ling, Xiao Zhou, Simon King:
The Blizzard Challenge 2021. Blizzard Challenge 2021 - [c206]Dan Wells, Pilar Oplustil Gallegos, Simon King:
The CSTR entry to the Blizzard Challenge 2021. Blizzard Challenge 2021 - [c205]Cassia Valentini-Botinhao, Simon King:
Detection and Analysis of Attention Errors in Sequence-to-Sequence Text-to-Speech. Interspeech 2021: 2746-2750 - [c204]Devang S. Ram Mohan, Qinmin Vivian Hu, Tian Huey Teh, Alexandra Torresquintero, Christopher G. R. Wallis, Marlene Staib, Lorenzo Foglianti, Jiameng Gao, Simon King:
Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis. Interspeech 2021: 3875-3879 - [c203]Alexandra Torresquintero, Tian Huey Teh, Christopher G. R. Wallis, Marlene Staib, Devang S. Ram Mohan, Vivian Hu, Lorenzo Foglianti, Jiameng Gao, Simon King:
ADEPT: A Dataset for Evaluating Prosody Transfer. Interspeech 2021: 3880-3884 - [c202]Johannah O'Mahony, Pilar Oplustil Gallegos, Catherine Lai, Simon King:
Factors Affecting the Evaluation of Synthetic Speech in Context. SSW 2021: 148-153 - [c201]Pilar Oplustil Gallegos, Johannah O'Mahony, Simon King:
Comparing acoustic and textual representations of previous linguistic context for improving Text-to-Speech. SSW 2021: 205-210 - [c200]Jason Fong, Jennifer Williams, Simon King:
Analysing Temporal Sensitivity of VQ-VAE Sub-Phone Codebooks. SSW 2021: 227-231 - [i13]Devang S. Ram Mohan, Qinmin Vivian Hu, Tian Huey Teh, Alexandra Torresquintero, Christopher G. R. Wallis, Marlene Staib, Lorenzo Foglianti, Jiameng Gao, Simon King:
Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis. CoRR abs/2106.08352 (2021) - 2020
- [j43]Xin Wang, Shinji Takaki, Junichi Yamagishi, Simon King, Keiichi Tokuda:
A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 28: 157-170 (2020) - [c199]Xiao Zhou, Zhen-Hua Ling, Simon King:
The Blizzard Challenge 2020. Blizzard Challenge / Voice Conversion Challenge 2020 - [c198]Mateusz Dubiel, Martin Halvey, Pilar Oplustil Gallegos, Simon King:
Persuasive Synthetic Speech: Voice Perception and User Behaviour. CIU 2020: 6:1-6:9 - [c197]Ivan Himawan, Sandesh Aryal, Iris Ouyang, Sam Kang, Pierre Lanchantin, Simon King:
Speaker Adaptation of a Multilingual Acoustic Model for Cross-Language Synthesis. ICASSP 2020: 7629-7633 - [c196]Carol Chermaz, Simon King:
A Sound Engineering Approach to Near End Listening Enhancement. INTERSPEECH 2020: 1356-1360 - [c195]Pilar Oplustil Gallegos, Jennifer Williams, Joanna Rownicka, Simon King:
An Unsupervised Method to Select a Speaker Subset from Large Multi-Speaker Speech Synthesis Datasets. INTERSPEECH 2020: 1758-1762 - [c194]Jacob J. Webber, Olivier Perrotin, Simon King:
Hider-Finder-Combiner: An Adversarial Architecture for General Speech Signal Modification. INTERSPEECH 2020: 3206-3210 - [c193]Jason Fong, Jason Taylor, Simon King:
Testing the Limits of Representation Mixing for Pronunciation Correction in End-to-End Speech Synthesis. INTERSPEECH 2020: 4019-4023 - [c192]Jennifer Williams, Joanna Rownicka, Pilar Oplustil, Simon King:
Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis. Odyssey 2020: 222-229 - [e1]Junichi Yamagishi, Zhenhua Ling, Rohan Kumar Das, Simon King, Tomi Kinnunen, Tomoki Toda, Wen-Chin Huang, Xiao Zhou, Xiaohai Tian, Yi Zhao:
Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, Shanghai, China, October 30, 2020. ISCA 2020 [contents] - [i12]Jennifer Williams, Joanna Rownicka, Pilar Oplustil, Simon King:
Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis. CoRR abs/2002.12645 (2020) - [i11]Zack Hodari, Catherine Lai, Simon King:
Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0. CoRR abs/2003.06686 (2020) - [i10]Berrak Sisman, Junichi Yamagishi, Simon King, Haizhou Li:
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning. CoRR abs/2008.03648 (2020) - [i9]Pilar Oplustil Gallegos, Simon King:
Using previous acoustic context to improve Text-to-Speech synthesis. CoRR abs/2012.03763 (2020)
2010 – 2019
- 2019
- [j42]Martin Cooke, Simon King, Valérie Hazan, Yannis Stylianou, Esther Janse, Deniz Baskent, Volker Hohmann, Axel H. Winneke, Inma Hernáez:
Enriched communication across the lifespan. Proces. del Leng. Natural 63: 175-178 (2019) - [c191]Zhizheng Wu, Zhihang Xie, Simon King:
The Blizzard Challenge 2019. Blizzard Challenge 2019 - [c190]Cheng-I Lai, Alberto Abad, Korin Richmond, Junichi Yamagishi, Najim Dehak, Simon King:
Attentive Filtering Networks for Audio Replay Attack Detection. ICASSP 2019: 6316-6320 - [c189]Oliver Watts, Cassia Valentini-Botinhao, Simon King:
Speech Waveform Reconstruction Using Convolutional Neural Networks with Noise and Periodic Inputs. ICASSP 2019: 7045-7049 - [c188]Carol Chermaz, Cassia Valentini-Botinhao, Henning F. Schepker, Simon King:
Evaluating Near End Listening Enhancement Algorithms in Realistic Environments. INTERSPEECH 2019: 1373-1377 - [c187]Jason Fong, Pilar Oplustil Gallegos, Zack Hodari, Simon King:
Investigating the Robustness of Sequence-to-Sequence Text-to-Speech Models to Imperfectly-Transcribed Training Data. INTERSPEECH 2019: 1546-1550 - [c186]Avashna Govender, Anita E. Wagner, Simon King:
Using Pupil Dilation to Measure Cognitive Load When Listening to Text-to-Speech in Quiet and in Noise. INTERSPEECH 2019: 1551-1555 - [c185]Jennifer Williams, Simon King:
Disentangling Style Factors from Speaker Representations. INTERSPEECH 2019: 3945-3949 - [c184]Adèle Aubin, Alessandra Cervone, Oliver Watts, Simon King:
Improving Speech Synthesis with Discourse Relations. INTERSPEECH 2019: 4470-4474 - [c183]Avashna Govender, Cassia Valentini-Botinhao, Simon King:
Measuring the contribution to cognitive load of each predicted vocoder speech parameter in DNN-based speech synthesis. SSW 2019: 121-126 - [c182]Jason Fong, Jason Taylor, Korin Richmond, Simon King:
A Comparison of Letters and Phones as Input to Sequence-to-Sequence Models for Speech Synthesis. SSW 2019: 223-227 - [c181]Zack Hodari, Oliver Watts, Simon King:
Using generative modelling to produce varied intonation for speech synthesis. SSW 2019: 239-244 - [i8]Zack Hodari, Oliver Watts, Simon King:
Using generative modelling to produce varied intonation for speech synthesis. CoRR abs/1906.04233 (2019) - 2018
- [c180]Simon King, Jane Crumlish, Amy Martin, Lovisa Wihlborg:
The Blizzard Challenge 2018. Blizzard Challenge 2018 - [c179]Zack Hodari, Oliver Watts, Srikanth Ronanki, Simon King:
Learning Interpretable Control Dimensions for Speech Synthesis by Using External Data. INTERSPEECH 2018: 32-36 - [c178]Oliver Watts, Cassia Valentini-Botinhao, Felipe Espic, Simon King:
Exemplar-based Speech Waveform Generation. INTERSPEECH 2018: 2022-2026 - [c177]Olympia Simantiraki, Martin Cooke, Simon King:
Impact of Different Speech Types on Listening Effort. INTERSPEECH 2018: 2267-2271 - [c176]Avashna Govender, Simon King:
Using Pupillometry to Measure the Cognitive Load of Synthetic Speech. INTERSPEECH 2018: 2838-2842 - [c175]Avashna Govender, Simon King:
Measuring the Cognitive Load of Synthetic Speech Using a Dual Task Paradigm. INTERSPEECH 2018: 2843-2847 - [c174]Cassia Valentini-Botinhao, Oliver Watts, Felipe Espic, Simon King:
Examplar-Based Speechwaveform Generation for Text-To-Speech. SLT 2018: 332-338 - [i7]José Novoa, Juan Pablo Escudero, Jorge Wuth, Víctor Poblete, Simon King, Richard M. Stern, Néstor Becerra Yoma:
Exploring the robustness of features and enhancement on speech recognition systems in highly-reverberant real environments. CoRR abs/1803.09013 (2018) - [i6]Gustav Eje Henter, Simon King, Thomas Merritt, Gilles Degottex:
Analysing Shortcomings of Statistical Parametric Speech Synthesis. CoRR abs/1807.10941 (2018) - [i5]Cheng-I Lai, Alberto Abad, Korin Richmond, Junichi Yamagishi, Najim Dehak, Simon King:
Attentive Filtering Networks for Audio Replay Attack Detection. CoRR abs/1810.13048 (2018) - 2017
- [j41]Josué Fredes, José Novoa, Simon King, Richard M. Stern, Néstor Becerra Yoma:
Locally Normalized Filter Banks Applied to Deep Neural-Network-Based Robust Speech Recognition. IEEE Signal Process. Lett. 24(4): 377-381 (2017) - [j40]Seyyed Saeed Sarfjoo, Cenk Demiroglu, Simon King:
Using Eigenvoices and Nearest-Neighbors in HMM-Based Cross-Lingual Speaker Adaptation With Limited Data. IEEE ACM Trans. Audio Speech Lang. Process. 25(4): 839-851 (2017) - [c173]Kei Sawada, Keiichi Tokuda, Simon King, Alan W. Black:
The blizzard machine learning challenge 2017. ASRU 2017: 331-337 - [c172]Simon King, Lovisa Wihlborg, Wei Guo:
The Blizzard Challenge 2017. Blizzard Challenge 2017 - [c171]Srikanth Ronanki, Oliver Watts, Simon King:
A Hierarchical Encoder-Decoder Model for Statistical Parametric Speech Synthesis. INTERSPEECH 2017: 1133-1137 - [c170]Felipe Espic, Cassia Valentini-Botinhao, Simon King:
Direct Modelling of Magnitude and Phase Spectra for Statistical Parametric Speech Synthesis. INTERSPEECH 2017: 1383-1387 - [c169]Joseph Mendelson, Pilar Oplustil, Oliver Watts, Simon King:
Nativization of Foreign Names in TTS for Automatic Reading of World News in Swahili. INTERSPEECH 2017: 2188-2192 - 2016
- [j39]Adriana Stan, Yoshitaka Mamiya, Junichi Yamagishi, Peter Bell, Oliver Watts, Robert A. J. Clark, Simon King:
ALISA: An automatic lightly supervised speech segmentation and alignment tool. Comput. Speech Lang. 35: 116-133 (2016) - [j38]Zhizheng Wu, Phillip L. De Leon, Cenk Demiroglu, Ali Khodabakhsh, Simon King, Zhen-Hua Ling, Daisuke Saito, Bryan Stewart, Tomoki Toda, Mirjam Wester, Junichi Yamagishi:
Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance. IEEE ACM Trans. Audio Speech Lang. Process. 24(4): 768-783 (2016) - [j37]Zhizheng Wu, Simon King:
Improving Trajectory Modelling for DNN-Based Speech Synthesis by Using Stacked Bottleneck Features and Minimum Generation Error Training. IEEE ACM Trans. Audio Speech Lang. Process. 24(7): 1255-1265 (2016) - [c168]Simon King, Vasilis Karaiskos:
The Blizzard Challenge 2016. Blizzard Challenge 2016 - [c167]Gustav Eje Henter, Srikanth Ronanki, Oliver Watts, Mirjam Wester, Zhizheng Wu, Simon King:
Robust TTS duration modelling using DNNS. ICASSP 2016: 5130-5134 - [c166]Zhizheng Wu, Simon King:
Investigating gated recurrent networks for speech synthesis. ICASSP 2016: 5140-5144 - [c165]Thomas Merritt, Robert A. J. Clark, Zhizheng Wu, Junichi Yamagishi, Simon King:
Deep neural network-guided unit selection synthesis. ICASSP 2016: 5145-5149 - [c164]Korin Richmond, Simon King:
Smooth talking: Articulatory join costs for unit selection. ICASSP 2016: 5150-5154 - [c163]Rasmus Dall, Sandrine Brognaux, Korin Richmond, Cassia Valentini-Botinhao, Gustav Eje Henter, Julia Hirschberg, Junichi Yamagishi, Simon King:
Testing the consistency assumption: Pronunciation variant forced alignment in read and spontaneous speech synthesis. ICASSP 2016: 5155-5159 - [c162]Oliver Watts, Gustav Eje Henter, Thomas Merritt, Zhizheng Wu, Simon King:
From HMMS to DNNS: Where do the improvements come from? ICASSP 2016: 5505-5509 - [c161]Felipe Espic, Cassia Valentini-Botinhao, Zhizheng Wu, Simon King:
Waveform Generation Based on Signal Reshaping for Statistical Parametric Speech Synthesis. INTERSPEECH 2016: 2263-2267 - [c160]Víctor Poblete, Juan Pablo Escudero, Josué Fredes, José Novoa, Richard M. Stern, Simon King, Néstor Becerra Yoma:
The Use of Locally Normalized Cepstral Coefficients (LNCC) to Improve Speaker Recognition Accuracy in Highly Reverberant Rooms. INTERSPEECH 2016: 2373-2377 - [c159]Srikanth Ronanki, Gustav Eje Henter, Zhizheng Wu, Simon King:
A Template-Based Approach for Speech Synthesis Intonation Generation Using LSTMs. INTERSPEECH 2016: 2463-2467 - [c158]Manu Airaksinen, Bajibabu Bollepalli, Lauri Juvela, Zhizheng Wu, Simon King, Paavo Alku:
GlottDNN - A Full-Band Glottal Vocoder for Statistical Parametric Speech Synthesis. INTERSPEECH 2016: 2473-2477 - [c157]Srikanth Ronanki, Oliver Watts, Simon King, Gustav Eje Henter:
Median-based generation of synthetic speech durations using a non-parametric approach. SLT 2016: 686-692 - [c156]Srikanth Ronanki, Siva Reddy Gangireddy, Bajibabu Bollepalli, Simon King:
DNN-based Speech Synthesis for Indian Languages from ASCII text. SSW 2016: 70-75 - [c155]Srikanth Ronanki, Zhizheng Wu, Oliver Watts, Simon King:
A Demonstration of the Merlin Open Source Neural Network Speech Synthesis System. SSW 2016: 124 - [c154]Zhizheng Wu, Oliver Watts, Simon King:
Merlin: An Open Source Neural Network Speech Synthesis System. SSW 2016: 202-207 - [i4]Zhizheng Wu, Simon King:
Investigating gated recurrent neural networks for speech synthesis. CoRR abs/1601.02539 (2016) - [i3]Zhizheng Wu, Simon King:
Improving Trajectory Modelling for DNN-based Speech Synthesis by using Stacked Bottleneck Features and Minimum Trajectory Error Training. CoRR abs/1602.06727 (2016) - [i2]Srikanth Ronanki, Siva Reddy Gangireddy, Bajibabu Bollepalli, Simon King:
DNN-based Speech Synthesis for Indian Languages from ASCII text. CoRR abs/1608.05374 (2016) - [i1]Srikanth Ronanki, Oliver Watts, Simon King, Gustav Eje Henter:
Median-Based Generation of Synthetic Speech Durations using a Non-Parametric Approach. CoRR abs/1608.06134 (2016) - 2015
- [j36]Víctor Poblete, Felipe Espic, Simon King, Richard M. Stern, Fernando Huenupán, Josué Fredes, Néstor Becerra Yoma:
A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification. Comput. Speech Lang. 31(1): 1-27 (2015) - [j35]Soheil Khorram, Hossein Sameti, Simon King:
Soft context clustering for F0 modeling in HMM-based speech synthesis. EURASIP J. Adv. Signal Process. 2015: 2 (2015) - [c153]Kishore Prahallad, Anandaswarup Vadapalli, Sai Krishna Rallabandi, Santosh Kesiraju, Hema A. Murthy, T. Nagarajan, Bira Chandra Singh, T. Sajani, K. Sreenivasa Rao, Suryakanth V. Gangashetty, Simon King, Keiichi Tokuda, Alan W. Black:
The Blizzard Challenge 2015. Blizzard Challenge 2015 - [c152]Thomas Merritt, Javier Latorre, Simon King:
Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech. ICASSP 2015: 4220-4224 - [c151]Zhizheng Wu, Ali Khodabakhsh, Cenk Demiroglu, Junichi Yamagishi, Daisuke Saito, Tomoki Toda, Simon King:
SAS: A speaker verification spoofing database containing diverse attacks. ICASSP 2015: 4440-4444 - [c150]Zhizheng Wu, Cassia Valentini-Botinhao, Oliver Watts, Simon King:
Deep neural networks employing Multi-Task Learning and stacked bottleneck features for speech synthesis. ICASSP 2015: 4460-4464 - [c149]Simon King:
What speech synthesis can do for you (and what you can do for speech synthesis). ICPhS 2015 - [c148]Zhizheng Wu, Simon King:
Minimum trajectory error training for deep neural networks, combined with stacked bottleneck features. INTERSPEECH 2015: 309-313 - [c147]Cassia Valentini-Botinhao, Zhizheng Wu, Simon King:
Towards minimum perceptual error training for DNN-based speech synthesis. INTERSPEECH 2015: 869-873 - [c146]Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Steve Renals, Simon King:
A study of speaker adaptation for DNN-based speech synthesis. INTERSPEECH 2015: 879-883 - [c145]Thomas Merritt, Junichi Yamagishi, Zhizheng Wu, Oliver Watts, Simon King:
Deep neural network context embeddings for model selection in rich-context HMM synthesis. INTERSPEECH 2015: 2207-2211 - [c144]Oliver Watts, Zhizheng Wu, Simon King:
Sentence-level control vectors for deep neural network speech synthesis. INTERSPEECH 2015: 2217-2221 - [c143]Pierre Lanchantin, Christophe Veaux, Mark J. F. Gales, Simon King, Junichi Yamagishi:
Reconstructing voices within the multiple-average-voice-model framework. INTERSPEECH 2015: 2232-2236 - [c142]Josué Fredes, José Novoa, Víctor Poblete, Simon King, Richard M. Stern, Néstor Becerra Yoma:
Robustness to additive noise of locally-normalized cepstral coefficients in speaker verification. INTERSPEECH 2015: 3011-3015 - [c141]Christophe Veaux, Junichi Yamagishi, Simon King:
A Comparison of Manual and Automatic Voice Repair for Individual with Vocal Disabilities. SLPAT@Interspeech 2015: 130-133 - [c140]Adriana Stan, Cassia Valentini-Botinhao, Mircea Giurgiu, Simon King:
Phonetic segmentation of speech using STEP and t-SNE. SpeD 2015: 1-6 - 2014
- [j34]Martin Cooke, Simon King, W. Bastiaan Kleijn, Yannis Stylianou:
Introduction to the Special Issue on The listening talker: context-dependent speech production and perception. Comput. Speech Lang. 28(2): 540-542 (2014) - [j33]Martin Cooke, Simon King, Maeva Garnier, Vincent Aubanel:
The listening talker: A review of human and algorithmic context-induced modifications of speech. Comput. Speech Lang. 28(2): 543-571 (2014) - [j32]Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King, Ranniery Maia:
Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion. Comput. Speech Lang. 28(2): 665-686 (2014) - [j31]Javier Tejedor, Doroteo T. Toledano, Dong Wang, Simon King, José Colás:
Feature analysis for discriminative confidence estimation in spoken term detection. Comput. Speech Lang. 28(5): 1083-1114 (2014) - [j30]Soheil Khorram, Hossein Sameti, Fahimeh Bahmaninezhad, Simon King, Thomas Drugman:
Context-dependent acoustic modeling based on hidden maximum entropy model for statistical parametric speech synthesis. EURASIP J. Audio Speech Music. Process. 2014: 12 (2014) - [j29]Jianhua Tao, Keikichi Hirose, Keiichi Tokuda, Alan W. Black, Simon King:
Introduction to the Issue on Statistical Parametric Speech Synthesis. IEEE J. Sel. Top. Signal Process. 8(2): 170-172 (2014) - [j28]Moses Ekpenyong, Eno-Abasi Urua, Oliver Watts, Simon King, Junichi Yamagishi:
Statistical parametric speech synthesis for Ibibio. Speech Commun. 56: 243-251 (2014) - [c139]Kishore Prahallad, Anandaswarup Vadapalli, Santosh Kesiraju, Hema A. Murthy, Swaran Lata, T. Nagarajan, S. R. Mahadeva Prasanna, Hemant A. Patil, Anil Kumar Sao, Simon King, Alan W. Black, Keiichi Tokuda:
The Blizzard Challenge 2014. Blizzard Challenge 2014 - [c138]Tuomo Raitio, Heng Lu, John Kane, Antti Suni, Martti Vainio, Simon King, Paavo Alku:
Voice source modelling using deep neural networks for statistical parametric speech synthesis. EUSIPCO 2014: 2290-2294 - [c137]Pierre Lanchantin, Mark J. F. Gales, Simon King, Junichi Yamagishi:
Multiple-average-voice-based speech synthesis. ICASSP 2014: 285-289 - [c136]Oliver Watts, Siva Reddy Gangireddy, Junichi Yamagishi, Simon King, Steve Renals, Adriana Stan, Mircea Giurgiu:
Neural net word representations for phrase-break prediction without a part of speech tagger. ICASSP 2014: 2599-2603 - [c135]Jaime Lorenzo-Trueba, Julián D. Echeverry-Correa, Roberto Barra-Chicote, Rubén San-Segundo-Hernández, Javier Ferreiros, Ascensión Gallardo-Antolín, Junichi Yamagishi, Simon King, Juan Manuel Montero-Martínez:
Development of a genre-dependent TTS system with cross-speaker speaking-style transplantation. SLAM@INTERSPEECH 2014: 39-42 - [c134]Rasmus Dall, Marcus Tomalin, Mirjam Wester, William J. Byrne, Simon King:
Investigating automatic & human filled pause insertion for speech synthesis. INTERSPEECH 2014: 51-55 - [c133]Gustav Eje Henter, Thomas Merritt, Matt Shannon, Catherine Mayo, Simon King:
Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech. INTERSPEECH 2014: 1504-1508 - [c132]Thomas Merritt, Tuomo Raitio, Simon King:
Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis. INTERSPEECH 2014: 1509-1513 - [c131]Ascensión Gallardo-Antolín, Juan Manuel Montero, Simon King:
A comparison of open-source segmentation architectures for dealing with imperfect data from the media in speech synthesis. INTERSPEECH 2014: 2370-2374 - [c130]Herman Kamper, Aren Jansen, Simon King, Sharon Goldwater:
Unsupervised lexical clustering of speech segments using fixed-dimensional acoustic embeddings. SLT 2014: 100-105 - 2013
- [j27]John Dines, Hui Liang, Lakshmi Babu Saheer, Matthew Gibson, William Byrne, Keiichiro Oura, Keiichi Tokuda, Junichi Yamagishi, Simon King, Mirjam Wester, Teemu Hirsimäki, Reima Karhila, Mikko Kurimo:
Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis. Comput. Speech Lang. 27(2): 420-437 (2013) - [j26]Christian Geng, Alice Turk, James M. Scobbie, Cedric Macmartin, Philip Hoole, Korin Richmond, Alan Wrench, Marianne Pouplier, Ellen Gurman Bard, Ziggy Campbell, Catherine Dickie, Eddie Dubourg, William J. Hardcastle, Evia Kainada, Simon King, Robin J. Lickley, Satsuki Nakai, Steve Renals, Kevin White, Ronny Wiegand:
Recording speech articulation in dialogue: Evaluating a synchronized double electromagnetic articulography setup. J. Phonetics 41(6): 421-431 (2013) - [j25]Partha Lal, Simon King:
Cross-Lingual Automatic Speech Recognition Using Tandem Features. IEEE ACM Trans. Audio Speech Lang. Process. 21(12): 2506-2515 (2013) - [c129]Simon King, Vasilis Karaiskos:
The Blizzard Challenge 2013. Blizzard Challenge 2013 - [c128]Kishore Prahallad, Anandaswarup Vadapalli, Naresh Kumar Elluru, Gautam Mantena, Bhargav Pulugundla, Peri Bhaskararao, Hema A. Murthy, Simon King, Vasilis Karaiskos, Alan W. Black:
The Blizzard Challenge 2013 - Indian Language Task. Blizzard Challenge 2013 - [c127]Chee-Ming Ting, Simon King, Sh-Hussain Salleh, Ahmad Kamaru Ariff:
Discriminative tandem features for HMM-based EEG classification. EMBC 2013: 3957-3960 - [c126]Mark Sinclair, Simon King:
Where are the challenges in speaker diarization? ICASSP 2013: 7741-7745 - [c125]Heng Lu, Simon King:
Factorized context modelling for Text-to-Speech synthesis. ICASSP 2013: 7849-7853 - [c124]Cassia Valentini-Botinhao, Elizabeth Godoy, Yannis Stylianou, Bastian Sauert, Simon King, Junichi Yamagishi:
Improving intelligibility in noise of HMM-generated speech via noise-dependent and -independent methods. ICASSP 2013: 7854-7858 - [c123]Yoshitaka Mamiya, Junichi Yamagishi, Oliver Watts, Robert A. J. Clark, Simon King, Adriana Stan:
Lightly supervised GMM VAD to use audiobook for speech synthesiser. ICASSP 2013: 7987-7991 - [c122]James M. Scobbie, Alice Turk, Christian Geng, Simon King, Robin J. Lickley, Korin Richmond:
The edinburgh speech production facility doubletalk corpus. INTERSPEECH 2013: 764-766 - [c121]Adriana Stan, Peter Bell, Junichi Yamagishi, Simon King:
Lightly supervised discriminative training of grapheme models for improved sentence-level alignment of speech and text data. INTERSPEECH 2013: 1525-1529 - [c120]Maria Astrinaki, Junichi Yamagishi, Simon King, Nicolas D'Alessandro, Thierry Dutoit:
Reactive accent interpolation through an interactive map application. INTERSPEECH 2013: 1877-1878 - [c119]Adriana Stan, Oliver Watts, Yoshitaka Mamiya, Mircea Giurgiu, Robert A. J. Clark, Junichi Yamagishi, Simon King:
TUNDRA: a multilingual corpus of found data for TTS research created with light supervision. INTERSPEECH 2013: 2331-2335 - [c118]Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King, Yannis Stylianou:
Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise. INTERSPEECH 2013: 3567-3571 - [c117]Heidi Christensen, Magda B. Aniol, Peter Bell, Phil D. Green, Thomas Hain, Simon King, Pawel Swietojanski:
Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. INTERSPEECH 2013: 3642-3645 - [c116]Christophe Veaux, Junichi Yamagishi, Simon King:
The voice bank corpus: Design, collection and data analysis of a large regional accent speech database. O-COCOSDA/CASLRE 2013: 1-4 - [c115]Christophe Veaux, Junichi Yamagishi, Simon King:
Towards Personalised Synthesised Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction. SLPAT 2013: 107-111 - [c114]Yoshitaka Mamiya, Adriana Stan, Junichi Yamagishi, Peter Bell, Oliver Watts, Robert A. J. Clark, Simon King:
Using adaptation to improve speech transcription alignment in noisy and reverberant environments. SSW 2013: 41-46 - [c113]Rubén San-Segundo-Hernández, Juan Manuel Montero, Mircea Giurgiu, Ioana Muresan, Simon King:
Multilingual number transcription for text-to-speech conversion. SSW 2013: 65-69 - [c112]Oliver Watts, Adriana Stan, Robert A. J. Clark, Yoshitaka Mamiya, Mircea Giurgiu, Junichi Yamagishi, Simon King:
Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from 'found' data: evaluation and analysis. SSW 2013: 101-106 - [c111]Cassia Valentini-Botinhao, Mirjam Wester, Junichi Yamagishi, Simon King:
Using neighbourhood density and selective SNR boosting to increase the intelligibility of synthetic speech in noise. SSW 2013: 113-118 - [c110]Kayoko Yanagisawa, Javier Latorre, Vincent Wan, Mark J. F. Gales, Simon King:
Noise robustness in HMM-TTS speaker adaptation. SSW 2013: 119-124 - [c109]Thomas Merritt, Simon King:
Investigating the shortcomings of HMM synthesis. SSW 2013: 165-170 - [c108]Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Thierry Dutoit:
Mage - reactive articulatory feature control of HMM-based parametric speech synthesis. SSW 2013: 207-211 - [c107]Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Thierry Dutoit:
Mage - HMM-based speech synthesis reactively controlled by the articulators. SSW 2013: 243 - [c106]Maria Astrinaki, Junichi Yamagishi, Simon King, Nicolas D'Alessandro, Thierry Dutoit:
Reactive accent interpolation through an interactive map application. SSW 2013: 245 - [c105]Heng Lu, Simon King, Oliver Watts:
Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis. SSW 2013: 261-265 - 2012
- [j24]Dong Wang, Javier Tejedor, Simon King, Joe Frankel:
Term-Dependent Confidence Normalisation for Out-of-Vocabulary Spoken Term Detection. J. Comput. Sci. Technol. 27(2): 358-375 (2012) - [j23]Keiichiro Oura, Junichi Yamagishi, Mirjam Wester, Simon King, Keiichi Tokuda:
Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping. Speech Commun. 54(6): 703-714 (2012) - [j22]Kei Hashimoto, Junichi Yamagishi, William Byrne, Simon King, Keiichi Tokuda:
Impacts of machine translation and speech synthesis on speech-to-speech translation. Speech Commun. 54(7): 857-866 (2012) - [j21]Dong Wang, Simon King, Joe Frankel, Ravichander Vipperla, Nicholas W. D. Evans, Raphaël Troncy:
Direct posterior confidence for out-of-vocabulary spoken term detection. ACM Trans. Inf. Syst. 30(3): 16:1-16:34 (2012) - [c104]Simon King, Vasilis Karaiskos:
The Blizzard Challenge 2012. Blizzard Challenge 2012 - [c103]Cassia Valentini-Botinhao, Ranniery Maia, Junichi Yamagishi, Simon King, Heiga Zen:
Cepstral analysis based on the glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise. ICASSP 2012: 3997-4000 - [c102]Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King:
Evaluating speech intelligibility enhancement for HMM-based synthetic speech in noise. SAPA@INTERSPEECH 2012: 22-27 - [c101]Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King:
Mel cepstral coefficient modification based on the Glimpse Proportion measure for improving the intelligibility of HMM-generated synthetic speech in noise. INTERSPEECH 2012: 631-634 - [c100]Christophe Veaux, Junichi Yamagishi, Simon King:
Using HMM-based Speech Synthesis to Reconstruct the Voice of Individuals with Degenerative Speech Disorders. INTERSPEECH 2012: 967-970 - [c99]Rasmus Dall, Christophe Veaux, Junichi Yamagishi, Simon King:
Analysis of speaker clustering strategies for HMM-based speech synthesis. INTERSPEECH 2012: 995-998 - [c98]Heng Lu, Simon King:
Using Bayesian Networks to find relevant context features for HMM-based speech synthesis. INTERSPEECH 2012: 1143-1146 - [c97]Rubén San Segundo, Juan Manuel Montero, Verónica López-Ludeña, Simon King:
Detecting Acronyms from Capital Letter Sequences in Spanish. INTERSPEECH 2012: 2550-2553 - [c96]Chen-Yu Yang, Georgina Brown, Liang Lu, Junichi Yamagishi, Simon King:
Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation. ISCSLP 2012: 220-223 - [c95]Adriana Stan, Peter Bell, Simon King:
A grapheme-based method for automatic alignment of speech and text data. SLT 2012: 286-290 - 2011
- [j20]Catherine Mayo, Robert A. J. Clark, Simon King:
Listeners' weighting of acoustic cues to synthetic speech naturalness: A multidimensional scaling analysis. Speech Commun. 53(3): 311-326 (2011) - [j19]Adriana Stan, Junichi Yamagishi, Simon King, Matthew P. Aylett:
The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate. Speech Commun. 53(3): 442-450 (2011) - [j18]Dong Wang, Simon King:
Letter-to-Sound Pronunciation Prediction Using Conditional Random Fields. IEEE Signal Process. Lett. 18(2): 122-125 (2011) - [j17]Dong Wang, Simon King, Joe Frankel:
Stochastic Pronunciation Modeling for Out-of-Vocabulary Spoken Term Detection. IEEE Trans. Speech Audio Process. 19(4): 688-698 (2011) - [c94]Christophe Veaux, Junichi Yamagishi, Simon King:
Voice banking and voice reconstruction for MND patients. ASSETS 2011: 305-306 - [c93]Simon King, Vasilis Karaiskos:
The Blizzard Challenge 2011. Blizzard Challenge 2011 - [c92]Kei Hashimoto, Junichi Yamagishi, William J. Byrne, Simon King, Keiichi Tokuda:
An analysis of machine translation and speech synthesis in speech-to-speech translation system. ICASSP 2011: 5108-5111 - [c91]Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King:
Evaluation of objective measures for intelligibility prediction of HMM-based synthetic speech in noise. ICASSP 2011: 5112-5115 - [c90]Sandra Andraszewicz, Junichi Yamagishi, Simon King:
Vocal attractiveness of statistical speech synthesisers. ICASSP 2011: 5368-5371 - [c89]Dong Wang, Nicholas W. D. Evans, Raphaël Troncy, Simon King:
Handling overlaps in spoken term detection. ICASSP 2011: 5656-5659 - [c88]Korin Richmond, Phil Hoole, Simon King:
Announcing the Electromagnetic Articulography (Day 1) Subset of the mngu0 Articulatory Corpus. INTERSPEECH 2011: 1505-1508 - [c87]Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King:
Can Objective Measures Predict the Intelligibility of Modified HMM-Based Synthetic Speech in Noise? INTERSPEECH 2011: 1837-1840 - [c86]Oliver Watts, Junichi Yamagishi, Simon King:
Unsupervised Continuous-Valued Word Features for Phrase-Break Prediction without a Part-of-Speech Tagger. INTERSPEECH 2011: 2157-2160 - [c85]Ming Lei, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Li-Rong Dai:
Formant-Controlled HMM-Based Speech Synthesis. INTERSPEECH 2011: 2777-2780 - 2010
- [j16]John Dines, Junichi Yamagishi, Simon King:
Measuring the Gap Between HMM-Based ASR and TTS. IEEE J. Sel. Top. Signal Process. 4(6): 1046-1058 (2010) - [j15]Roberto Barra-Chicote, Junichi Yamagishi, Simon King, Juan Manuel Montero, Javier Macías Guarasa:
Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech. Speech Commun. 52(5): 394-404 (2010) - [j14]Junichi Yamagishi, Bela Usabaev, Simon King, Oliver Watts, John Dines, Jilei Tian, Yong Guan, Rile Hu, Keiichiro Oura, Yi-Jian Wu, Keiichi Tokuda, Reima Karhila, Mikko Kurimo:
Thousands of Voices for HMM-Based Speech Synthesis-Analysis and Application of TTS Systems Built on Various ASR Corpora. IEEE Trans. Speech Audio Process. 18(5): 984-1004 (2010) - [j13]Oliver Watts, Junichi Yamagishi, Simon King, Kay Berkling:
Synthesis of Child Speech With HMM Adaptation and Voice Conversion. IEEE Trans. Speech Audio Process. 18(5): 1005-1016 (2010) - [c84]Mikko Kurimo, William Byrne, John Dines, Philip N. Garner, Matthew Gibson, Yong Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Babu Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian:
Personalising Speech-To-Speech Translation in the EMIME Project. ACL (System Demonstrations) 2010: 48-53 - [c83]Simon King, Vasilis Karaiskos:
The Blizzard Challenge 2010. Blizzard Challenge 2010 - [c82]Keiichiro Oura, Keiichi Tokuda, Junichi Yamagishi, Simon King, Mirjam Wester:
Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis. ICASSP 2010: 4594-4597 - [c81]Junichi Yamagishi, Simon King:
Simple methods for improving speaker-similarity of HMM-based speech synthesis. ICASSP 2010: 4610-4613 - [c80]Dong Wang, Simon King, Joe Frankel, Peter Bell:
Stochastic pronunciation modelling and soft match for out-of-vocabulary spoken term detection. ICASSP 2010: 5294-5297 - [c79]Volker Strom, Simon King:
A classifier-based target cost for unit selection speech synthesis trained on perceptual data. INTERSPEECH 2010: 150-153 - [c78]Junichi Yamagishi, Oliver Watts, Simon King, Bela Usabaev:
Roles of the average voice in speaker-adaptive HMM-based speech synthesis. INTERSPEECH 2010: 418-421 - [c77]Javier Tejedor, Doroteo T. Toledano, Miguel Bautista, Simon King, Dong Wang, José Colás:
Augmented set of features for confidence estimation in spoken term detection. INTERSPEECH 2010: 701-704 - [c76]Oliver Watts, Junichi Yamagishi, Simon King:
The role of higher-level linguistic features in HMM-based speech synthesis. INTERSPEECH 2010: 841-844 - [c75]Dong Wang, Simon King, Nicholas W. D. Evans, Raphaël Troncy:
CRF-based stochastic pronunciation modeling for out-of-vocabulary spoken term detection. INTERSPEECH 2010: 1668-1671 - [c74]Dong Wang, Simon King, Nicholas W. D. Evans:
Evans, Joe Frankel, Raphaël Troncy: Direct posterior confidence for out-of-vocabulary spoken term detection. SSCS@MM 2010: 21-26 - [c73]Simon King:
Speech synthesis without the right data. SSW 2010: 38 - [c72]Mirjam Wester, John Dines, Matthew Gibson, Hui Liang, Yi-Jian Wu, Lakshmi Babu Saheer, Simon King, Keiichiro Oura, Philip N. Garner, William Byrne, Yong Guan, Teemu Hirsimäki, Reima Karhila, Mikko Kurimo, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Junichi Yamagishi:
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project. SSW 2010: 192-197 - [c71]Oliver Watts, Junichi Yamagishi, Simon King:
Letter-based speech synthesis. SSW 2010: 317-322
2000 – 2009
- 2009
- [j12]Junichi Yamagishi, Takashi Nose, Heiga Zen, Zhen-Hua Ling, Tomoki Toda, Keiichi Tokuda, Simon King, Steve Renals:
Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis. IEEE Trans. Speech Audio Process. 17(6): 1208-1230 (2009) - [c70]Peter Bell, Simon King:
Diagonal priors for full covariance speech recognition. ASRU 2009: 113-117 - [c69]Simon King, Vasilis Karaiskos:
The Blizzard Challenge 2009. Blizzard Challenge 2009 - [c68]Junichi Yamagishi, Mike Lincoln, Simon King, John Dines, Matthew Gibson, Jilei Tian, Yong Guan:
Analysis of Unsupervised and Noise-Robust Speaker-Adaptive HMM-Based Speech Synthesis Systems toward a Unified ASR and TTS Framework. Blizzard Challenge 2009 - [c67]Dong Wang, Javier Tejedor, Joe Frankel, Simon King, José Colás:
Posterior-based confidence measures for spoken term detection. ICASSP 2009: 4889-4892 - [c66]Junichi Yamagishi, Bela Usabaev, Simon King, Oliver Watts, John Dines, Jilei Tian, Rile Hu, Yong Guan, Keiichiro Oura, Keiichi Tokuda, Reima Karhila, Mikko Kurimo:
Thousands of voices for HMM-based speech synthesis. INTERSPEECH 2009: 420-423 - [c65]John Dines, Junichi Yamagishi, Simon King:
Measuring the gap between HMM-based ASR and TTS. INTERSPEECH 2009: 1391-1394 - [c64]Matthew P. Aylett, Simon King, Junichi Yamagishi:
Speech synthesis without a phone inventory. INTERSPEECH 2009: 2087-2090 - [c63]Javier Tejedor, Dong Wang, Simon King, Joe Frankel, José Colás:
A posterior probability-based system hybridisation and combination for spoken term detection. INTERSPEECH 2009: 2131-2134 - [c62]Dong Wang, Simon King, Joe Frankel:
Stochastic pronunciation modelling for spoken term detection. INTERSPEECH 2009: 2135-2138 - [c61]Dong Wang, Simon King, Joe Frankel, Peter Bell:
Term-dependent confidence for out-of-vocabulary term detection. INTERSPEECH 2009: 2139-2142 - [c60]Oliver Watts, Junichi Yamagishi, Simon King, Kay Berkling:
HMM adaptation and voice conversion for the synthesis of child speech: a comparison. INTERSPEECH 2009: 2627-2630 - 2008
- [j11]Olga Goubanova, Simon King:
Bayesian networks for phone duration prediction. Speech Commun. 50(4): 301-311 (2008) - [j10]Javier Tejedor, Dong Wang, Joe Frankel, Simon King, José Colás:
A comparison of grapheme and phoneme-based units for Spanish spoken term detection. Speech Commun. 50(11-12): 980-991 (2008) - [c59]Tiago H. Falk, Sebastian Möller, Vasilis Karaiskos, Simon King:
Improving Instrumental Quality Prediction Performance for the Blizzard Challenge. Blizzard Challenge 2008 - [c58]Vasilis Karaiskos, Simon King, Robert A. J. Clark, Catherine Mayo:
The Blizzard Challenge 2008. Blizzard Challenge 2008 - [c57]Dong Wang, Joe Frankel, Javier Tejedor, Simon King:
A comparison of phone and grapheme-based spoken term detection. ICASSP 2008: 4969-4972 - [c56]Junichi Yamagishi, Zhen-Hua Ling, Simon King:
Robustness of HMM-based speech synthesis. INTERSPEECH 2008: 581-584 - [c55]Peter Bell, Simon King:
A shrinkage estimator for speech recognition with full covariance HMMs. INTERSPEECH 2008: 910-913 - [c54]Peter Bell, Simon King:
Covariance updates for discriminative training by constrained line search. INTERSPEECH 2008: 914 - [c53]Dong Wang, Ivan Himawan, Joe Frankel, Simon King:
A posterior approach for microphone array based speech recognition. INTERSPEECH 2008: 996-999 - [c52]Joe Frankel, Dong Wang, Simon King:
Growing bottleneck features for tandem ASR. INTERSPEECH 2008: 1549 - [c51]Simon King, Keiichi Tokuda, Heiga Zen, Junichi Yamagishi:
Unsupervised adaptation for HMM-based speech synthesis. INTERSPEECH 2008: 1869-1872 - [c50]Volker Strom, Simon King:
Investigating festival's target cost function using perceptual experiments. INTERSPEECH 2008: 1873-1876 - [c49]László Tóth, Joe Frankel, Gábor Gosztolya, Simon King:
Cross-lingual portability of MLP-based tandem features - a case study for English and Hungarian. INTERSPEECH 2008: 2695-2698 - [c48]Yi-Jian Wu, Simon King, Keiichi Tokuda:
Cross-Lingual Speaker Adaptation for HMM-Based Speech Synthesis. ISCSLP 2008: 9-12 - [c47]Oliver Watts, Junichi Yamagishi, Kay Berkling, Simon King:
HMM-based synthesis of child speech. WOCCI 2008: 19 - 2007
- [j9]Joe Frankel, Mirjam Wester, Simon King:
Articulatory feature recognition using dynamic Bayesian networks. Comput. Speech Lang. 21(4): 620-640 (2007) - [j8]Joe Frankel, Simon King:
Factoring Gaussian precision matrices for linear dynamic models. Pattern Recognit. Lett. 28(16): 2264-2272 (2007) - [j7]Robert A. J. Clark, Korin Richmond, Simon King:
Multisyn: Open-domain unit selection for the Festival speech synthesis system. Speech Commun. 49(4): 317-330 (2007) - [j6]Joe Frankel, Simon King:
Speech Recognition Using Linear Dynamic Models. IEEE Trans. Speech Audio Process. 15(1): 246-256 (2007) - [c46]Özgür Çetin, Mathew Magimai-Doss, Karen Livescu, Arthur Kantor, Simon King, Chris D. Bartels, Joe Frankel:
Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPS. ASRU 2007: 36-41 - [c45]Robert A. J. Clark, Monika Podsiadlo, Mark E. Fraser, Catherine Mayo, Simon King:
Statistical analysis of the Blizzard Challenge 2007 listening test results. Blizzard Challenge 2007 - [c44]Mark E. Fraser, Simon King:
The Blizzard Challenge 2007. Blizzard Challenge 2007 - [c43]Karen Livescu, Özgür Çetin, Mark Hasegawa-Johnson, Simon King, Chris D. Bartels, Nash M. Borges, Arthur Kantor, Partha Lal, Lisa Yung, Ari Bezman, Stephen Dawson-Haggerty, Bronwyn Woods, Joe Frankel, Mathew Magimai-Doss, Kate Saenko:
Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop. ICASSP (4) 2007: 621-624 - [c42]Özgür Çetin, Arthur Kantor, Simon King, Chris D. Bartels, Mathew Magimai-Doss, Joe Frankel, Karen Livescu:
An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling. ICASSP (4) 2007: 645-648 - [c41]Karen Livescu, Ari Bezman, Nash M. Borges, Lisa Yung, Özgür Çetin, Joe Frankel, Simon King, Mathew Magimai-Doss, Xuemin Chi, Lisa Lavoie:
Manual Transcription of Conversational Speech at the Articulatory Feature Level. ICASSP (4) 2007: 953-956 - [c40]Volker Strom, Ani Nenkova, Robert A. J. Clark, Yolanda Vazquez-Alvarez, Jason M. Brenier, Simon King, Dan Jurafsky:
Modelling prominence and emphasis improves unit-selection synthesis. INTERSPEECH 2007: 1282-1285 - [c39]Peter Bell, Simon King:
Sparse Gaussian graphical models for speech recognition. INTERSPEECH 2007: 2113-2116 - [c38]Joe Frankel, Mathew Magimai-Doss, Simon King, Karen Livescu, Özgür Çetin:
Articulatory feature classifiers trained on 2000 hours of telephone speech. INTERSPEECH 2007: 2485-2488 - [c37]Junichi Yamagishi, Takao Kobayashi, Steve Renals, Simon King, Heiga Zen, Tomoki Toda, Keiichi Tokuda:
Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV. SSW 2007: 125-130 - [c36]Matthew P. Aylett, Simon King:
Single speaker segmentation and inventory selection using dynamic time warping self organization and joint multigram mapping. SSW 2007: 258-263 - 2006
- [j5]Joe Frankel, Simon King:
Observation process adaptation for linear dynamic models. Speech Commun. 48(9): 1192-1199 (2006) - [j4]Jithendra Vepa, Simon King:
Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis. IEEE Trans. Speech Audio Process. 14(5): 1763-1771 (2006) - [c35]Robert A. J. Clark, Korin Richmond, Volker Strom, Simon King:
Multisyn Voice for the Blizzard Challenge 2006. Blizzard Challenge 2006 - [c34]Robert A. J. Clark, Simon King:
Joint prosodic and segmental unit selection speech synthesis. INTERSPEECH 2006 - [c33]Volker Strom, Robert A. J. Clark, Simon King:
Expressive prosody for unit-selection speech synthesis. INTERSPEECH 2006 - 2005
- [c32]Alexander Gutkin, Simon King:
Detection of Symbolic Gestural Events in Articulatory Data for Use in Structural Representations of Continuous Speech. ICASSP (1) 2005: 885-888 - [c31]Robert A. J. Clark, Korin Richmond, Simon King:
Multisyn voices from ARCTIC data for the blizzard challenge. INTERSPEECH 2005: 101-104 - [c30]Catherine Mayo, Robert A. J. Clark, Simon King:
Multidimensional scaling of listener responses to synthetic speech. INTERSPEECH 2005: 1725-1728 - [c29]Olga Goubanova, Simon King:
Predicting consonant duration with Bayesian belief networks. INTERSPEECH 2005: 1941-1944 - [c28]Joe Frankel, Simon King:
A hybrid ANN/DBN approach to articulatory feature recognition. INTERSPEECH 2005: 3045-3048 - [c27]Chris D. Bartels, Kevin Duh, Jeff A. Bilmes, Katrin Kirchhoff, Simon King:
Genetic triangulation of graphical models for speech and language processing. INTERSPEECH 2005: 3329-3332 - [c26]Simon King, Chris D. Bartels, Jeff A. Bilmes:
SVitchboard 1: small vocabulary tasks from Switchboard. INTERSPEECH 2005: 3385-3388 - [c25]Alexander Gutkin, Simon King:
Inductive String Template-Based Learning of Spoken Language. PRIS 2005: 43-51 - 2004
- [c24]Alexander Gutkin, Simon King:
Structural Representation of Speech for Phonetic Classification. ICPR (3) 2004: 438-441 - [c23]Jithendra Vepa, Simon King:
Subjective evaluation of join cost functions used in unit selection speech synthesis. INTERSPEECH 2004: 1181-1184 - [c22]Alexander Gutkin, Simon King:
Phone classification in pseudo-euclidean vector spaces. INTERSPEECH 2004: 1453-1456 - [c21]Joe Frankel, Mirjam Wester, Simon King:
Articulatory feature recognition using dynamic Bayesian networks. INTERSPEECH 2004: 1477-1480 - [c20]Yoshinori Shiga, Simon King:
Source-filter separation for articulation-to-speech synthesis. INTERSPEECH 2004: 1913-1916 - [c19]Yoshinori Shiga, Simon King:
Estimating detailed spectral envelopes using articulatory clustering. INTERSPEECH 2004: 2485-2488 - [c18]Jithendra Vepa, Simon King:
Subjective evaluation of join cost & smoothing methods. SSW 2004: 7-12 - [c17]Yoshinori Shiga, Simon King:
Accurate spectral envelope estimation for articulation-to-speech synthesis. SSW 2004: 19-24 - [c16]Robert A. J. Clark, Korin Richmond, Simon King:
Festival 2 - build your own general purpose unit selection speech synthesiser. SSW 2004: 173-178 - 2003
- [j3]Korin Richmond, Simon King, Paul Taylor:
Modelling the uncertainty in recovering articulation from acoustics. Comput. Speech Lang. 17(2-3): 153-172 (2003) - [j2]Simon King:
Dependence and independence in automatic speech recognition and synthesis. J. Phonetics 31(3-4): 407-411 (2003) - [c15]Ben Gillett, Simon King:
Transforming F0 contours. INTERSPEECH 2003: 101-104 - [c14]Jithendra Vepa, Simon King:
Kalman-filter based join cost for unit-selection speech synthesis. INTERSPEECH 2003: 293-296 - [c13]James Horlock, Simon King:
Named entity extraction from word lattices. INTERSPEECH 2003: 1265-1268 - [c12]Ben Gillett, Simon King:
Transforming voice quality. INTERSPEECH 2003: 1713-1716 - [c11]Yoshinori Shiga, Simon King:
Estimating the spectral envelope of voiced speech using multi-frame analysis. INTERSPEECH 2003: 1737-1740 - [c10]Yoshinori Shiga, Simon King:
Estimation of voice source and vocal tract characteristics based on multi-frame analysis. INTERSPEECH 2003: 1749-1752 - [c9]James Horlock, Simon King:
Discriminative methods for improving named entity extraction on speech data. INTERSPEECH 2003: 2765-2768 - 2002
- [c8]Jithendra Vepa, Simon King, Paul Taylor:
Objective distance measures for spectral discontinuities in concatenative speech synthesis. INTERSPEECH 2002: 2605-2608 - [c7]Jesper Salomon, Simon King, Miles Osborne:
Framewise phone classification using support vector machines. INTERSPEECH 2002: 2645-2648 - 2001
- [c6]Joe Frankel, Simon King:
ASR - articulatory speech recognition. INTERSPEECH 2001: 599-602 - 2000
- [j1]Simon King, Paul Taylor:
Detection of phonological features in continuous speech using neural networks. Comput. Speech Lang. 14(4): 333-353 (2000) - [c5]Joe Frankel, Korin Richmond, Simon King, Paul Taylor:
An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces. INTERSPEECH 2000: 254-257
1990 – 1999
- 1998
- [c4]Simon King, Todd A. Stephenson, Stephen Isard, Paul Taylor, Alex Strachan:
Speech recognition via phonetically featured syllables. ICSLP 1998 - 1997
- [c3]Simon King, Thomas Portele, Florian Höfer:
Speech synthesis using non-uniform units in the Verbmobil project. EUROSPEECH 1997: 569-572 - [c2]Paul Taylor, Simon King, Stephen Isard, Helen Wright, Jacqueline C. Kowtko:
Using intonation to constrain language models in speech recognition. EUROSPEECH 1997: 2763-2766 - 1996
- [c1]Paul Taylor, Hiroshi Shimodaira, Stephen Isard, Simon King, Jacqueline C. Kowtko:
Using prosodic information to constrain language models for spoken dialogue. ICSLP 1996: 216-219
Coauthor Index
aka: Pilar Oplustil Gallegos
aka: Adriana Stan
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-20 23:00 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint