default search action

combined dblp search
author search
venue search
publication search

ask others

Yixuan Zhou 0002

> Home > Persons

Person information

affiliation: Tsinghua University, Shenzhen International Graduate School, Tsinghua-CUHK Joint Research Center for Media Sciences, Technologies and Systems, Shenzhen, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouZLWW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouZLWW24
Yixuan Zhou, Shuoyi Zhou, Shun Lei, Zhiyong Wu, Menglin Wu:
The THU-HCSI Multi-Speaker Multi-Lingual Few-Shot Voice Cloning System for LIMMITS'24 Challenge. ICASSP Workshops 2024: 71-72
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Lei0CLWWKJZ0M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Lei0CLWWKJZ0M24
Shun Lei, Yixuan Zhou, Liyang Chen, Dan Luo, Zhiyong Wu, Xixin Wu, Shiyin Kang, Tao Jiang, Yahui Zhou, Yuxing Han, Helen Meng:
Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts. ICASSP 2024: 12662-12666
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0002QJZLZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0002QJZLZ0024
Yixuan Zhou, Xiaoyu Qin, Zeyu Jin, Shuoyi Zhou, Shun Lei, Songtao Zhou, Zhiyong Wu, Jia Jia:
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling. ACM Multimedia 2024: 554-563
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/mrac/CaiYX0X024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mrac/CaiYX0X024
Yunrui Cai, Runchuan Ye, Jingran Xie, Yixuan Zhou, Yaoxun Xu, Zhiyong Wu:
Robust Representation Learning for Multimodal Emotion Recognition with Contrastive Learning and Mixup. MRAC@MM 2024: 93-97
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/mrac/XuZCXY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mrac/XuZCXY024
Yaoxun Xu, Yixuan Zhou, Yunrui Cai, Jingran Xie, Runchuan Ye, Zhiyong Wu:
Multimodal Emotion Captioning Using Large Language Model with Prompt Engineering. MRAC@MM 2024: 104-109
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-16619
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-16619
Yixuan Zhou, Shuoyi Zhou, Shun Lei, Zhiyong Wu, Menglin Wu:
The THU-HCSI Multi-Speaker Multi-Lingual Few-Shot Voice Cloning System for LIMMITS'24 Challenge. CoRR abs/2404.16619 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-13509
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-13509
Weiqin Li, Peiji Yang, Yicheng Zhong, Yixuan Zhou, Zhisheng Wang, Zhiyong Wu, Xixin Wu, Helen Meng:
Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models. CoRR abs/2407.13509 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-15676
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-15676
Yixuan Zhou, Xiaoyu Qin, Zeyu Jin, Shuoyi Zhou, Shun Lei, Songtao Zhou, Zhiyong Wu, Jia Jia:
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling. CoRR abs/2408.15676 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-06029
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-06029
Shun Lei, Yixuan Zhou, Boshi Tang, Max W. Y. Lam, Feng Liu, Hangyu Liu, Jingcheng Wu, Shiyin Kang, Zhiyong Wu, Helen Meng:
SongCreator: Lyrics-based Universal Song Generation. CoRR abs/2409.06029 (2024)
2023
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LeiZCWWKM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LeiZCWWKM23
Shun Lei, Yixuan Zhou, Liyang Chen, Zhiyong Wu, Xixin Wu, Shiyin Kang, Helen Meng:
MSStyleTTS: Multi-Scale Style Modeling With Hierarchical Context Information for Expressive Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3290-3303 (2023)
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LeiZCWKM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LeiZCWKM23
Shun Lei, Yixuan Zhou, Liyang Chen, Zhiyong Wu, Shiyin Kang, Helen Meng:
Context-Aware Coherent Speaking Style Prediction with Hierarchical Transformers for Audiobook Speech Synthesis. ICASSP 2023: 1-5
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiLH00KM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiLH00KM23
Weiqin Li, Shun Lei, Qiaochu Huang, Yixuan Zhou, Zhiyong Wu, Shiyin Kang, Helen Meng:
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis. INTERSPEECH 2023: 3377-3381
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-06359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-06359
Shun Lei, Yixuan Zhou, Liyang Chen, Zhiyong Wu, Shiyin Kang, Helen Meng:
Context-aware Coherent Speaking Style Prediction with Hierarchical Transformers for Audiobook Speech Synthesis. CoRR abs/2304.06359 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-16012
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-16012
Shun Lei, Yixuan Zhou, Liyang Chen, Zhiyong Wu, Xixin Wu, Shiyin Kang, Helen Meng:
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis. CoRR abs/2307.16012 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-16593
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-16593
Weiqin Li, Shun Lei, Qiaochu Huang, Yixuan Zhou, Zhiyong Wu, Shiyin Kang, Helen Meng:
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis. CoRR abs/2308.16593 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-11977
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-11977
Shun Lei, Yixuan Zhou, Liyang Chen, Dan Luo, Zhiyong Wu, Xixin Wu, Shiyin Kang, Tao Jiang, Yahui Zhou, Yuxing Han, Helen Meng:
Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts. CoRR abs/2309.11977 (2023)
2022
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenSZWCWM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenSZWCWM22
Xueyuan Chen, Changhe Song, Yixuan Zhou, Zhiyong Wu, Changbin Chen, Zhongqin Wu, Helen Meng:
A Character-Level Span-Based Model for Mandarin Prosodic Structure Prediction. ICASSP 2022: 7602-7606
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LeiZCWKM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LeiZCWKM22
Shun Lei, Yixuan Zhou, Liyang Chen, Zhiyong Wu, Shiyin Kang, Helen Meng:
Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis. ICASSP 2022: 7922-7926
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouSLZ0B0M22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouSLZ0B0M22
Yixuan Zhou, Changhe Song, Xiang Li, Luwen Zhang, Zhiyong Wu, Yanyao Bian, Dan Su, Helen Meng:
Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis. INTERSPEECH 2022: 2573-2577
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouSL0B0M22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouSL0B0M22
Yixuan Zhou, Changhe Song, Jingbei Li, Zhiyong Wu, Yanyao Bian, Dan Su, Helen Meng:
Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech Synthesis. INTERSPEECH 2022: 5518-5522
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeiZCH0KM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeiZCH0KM22
Shun Lei, Yixuan Zhou, Liyang Chen, Jiankun Hu, Zhiyong Wu, Shiyin Kang, Helen Meng:
Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis. INTERSPEECH 2022: 5523-5527
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-12201
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-12201
Shun Lei, Yixuan Zhou, Liyang Chen, Zhiyong Wu, Shiyin Kang, Helen Meng:
Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis. CoRR abs/2203.12201 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16922
Xueyuan Chen, Changhe Song, Yixuan Zhou, Zhiyong Wu, Changbin Chen, Zhongqin Wu, Helen Meng:
A Character-level Span-based Model for Mandarin Prosodic Structure Prediction. CoRR abs/2203.16922 (2022)
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-00990
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-00990
Yixuan Zhou, Changhe Song, Xiang Li, Luwen Zhang, Zhiyong Wu, Yanyao Bian, Dan Su, Helen Meng:
Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis. CoRR abs/2204.00990 (2022)
[i3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-02743
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-02743
Shun Lei, Yixuan Zhou, Liyang Chen, Jiankun Hu, Zhiyong Wu, Shiyin Kang, Helen Meng:
Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis. CoRR abs/2204.02743 (2022)
2021
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SongLZ0M21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SongLZ0M21
Changhe Song, Jingbei Li, Yixuan Zhou, Zhiyong Wu, Helen M. Meng:
Syntactic Representation Learning For Neural Network Based TTS with Syntactic Parse Tree Traversal. ICASSP 2021: 6064-6068
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-06835
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-06835
Yixuan Zhou, Changhe Song, Jingbei Li, Zhiyong Wu, Helen Meng:
Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech. CoRR abs/2104.06835 (2021)
2020
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-06971
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-06971
Changhe Song, Jingbei Li, Yixuan Zhou, Zhiyong Wu, Helen M. Meng:
Syntactic representation learning for neural network based TTS with syntactic parse tree traversal. CoRR abs/2012.06971 (2020)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.