End-to-End Speech Processing Toolkit
-
Updated
Dec 23, 2024 - Python
End-to-End Speech Processing Toolkit
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Open singing synthesis platform / Open source UTAU successor
Official PyTorch implementation of BigVGAN (ICLR 2023)
Neural network-based singing voice synthesis library for research
Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).
PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
An opensource music processing toolkit
PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control
Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
Singing Voice Synthesis based on VITS, different from VISinger
Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
PyTorch Implementation of Multi-Singer (ACM-MM'21)
Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis (IEEE MLSP 2021)
NNSVSのモデルをUTAUで使えるようにするツール (UTAU plugin software powered by NNSVS)
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
Add a description, image, and links to the singing-voice-synthesis topic page so that developers can more easily learn about it.
To associate your repository with the singing-voice-synthesis topic, visit your repo's landing page and select "manage topics."