Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Picture for Lei Xie

Lei Xie

Ph.D. Program in Computer Science, The Graduate Center, The City University of New York, New York, New York, USA, Ph.D. Program in Biology and Biochemistry, The Graduate Center, The City University of New York, New York, New York, USA, Department of Computer Science, Hunter College, The City University of New York, New York, New York, USA, Helen and Robert Appel Alzheimers Disease Research Institute, Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, Cornell University, New York, New York, USA

MUSA: Multi-lingual Speaker Anonymization via Serial Disentanglement

Add code
Jul 16, 2024
Viaarxiv icon

Learning Multi-view Anomaly Detection

Add code
Jul 16, 2024
Viaarxiv icon

Whisper-SV: Adapting Whisper for Low-data-resource Speaker Verification

Add code
Jul 14, 2024
Viaarxiv icon

AI-driven multi-omics integration for multi-scale predictive modeling of causal genotype-environment-phenotype relationships

Add code
Jul 08, 2024
Viaarxiv icon

Streaming Decoder-Only Automatic Speech Recognition with Discrete Speech Units: A Pilot Study

Add code
Jun 27, 2024
Figure 1 for Streaming Decoder-Only Automatic Speech Recognition with Discrete Speech Units: A Pilot Study
Figure 2 for Streaming Decoder-Only Automatic Speech Recognition with Discrete Speech Units: A Pilot Study
Figure 3 for Streaming Decoder-Only Automatic Speech Recognition with Discrete Speech Units: A Pilot Study
Figure 4 for Streaming Decoder-Only Automatic Speech Recognition with Discrete Speech Units: A Pilot Study
Viaarxiv icon

Vec-Tok-VC+: Residual-enhanced Robust Zero-shot Voice Conversion with Progressive Constraints in a Dual-mode Training Strategy

Add code
Jun 14, 2024
Viaarxiv icon

SCDNet: Self-supervised Learning Feature-based Speaker Change Detection

Add code
Jun 12, 2024
Viaarxiv icon

DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion

Add code
Jun 12, 2024
Viaarxiv icon

Text-aware and Context-aware Expressive Audiobook Speech Synthesis

Add code
Jun 12, 2024
Viaarxiv icon

FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Add code
Jun 12, 2024
Figure 1 for FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
Figure 2 for FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
Figure 3 for FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
Figure 4 for FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
Viaarxiv icon