Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2021
Noise Robust Singing Voice Synthesis Using Gaussian Mixture Variational Autoencoder
ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal InteractionPages 131–136https://doi.org/10.1145/3461615.3491115Generating high-quality singing voice usually depends on a sizable studio-level singing corpus which is difficult and expensive to collect. In contrast, there is plenty of singing voice data that can be found on the Internet. However, the found singing ...
- articleJune 2019
The Spoken Wikipedia Corpus collection: Harvesting, alignment and an application to hyperlistening
Language Resources and Evaluation (SPLRE), Volume 53, Issue 2Pages 303–329https://doi.org/10.1007/s10579-017-9410-ySpoken corpora are important for speech research, but are expensive to create and do not necessarily reflect (read or spontaneous) speech `in the wild'. We report on our conversion of the preexisting and freely available Spoken Wikipedia into a speech ...
- articleDecember 2016
Developing a unit selection voice given audio without corresponding text
EURASIP Journal on Audio, Speech, and Music Processing (EJASMP), Volume 2016, Issue 1Article No.: 84, Pages 1–11https://doi.org/10.1186/s13636-016-0084-yToday, a large amount of audio data is available on the web in the form of audiobooks, podcasts, video lectures, video blogs, news bulletins, etc. In addition, we can effortlessly record and store audio data such as a read, lecture, or impromptu speech ...