default search action

combined dblp search
author search
venue search
publication search

ask others

Jian Cong

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/TanCLCZLWLYHZQSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/TanCLCZLWLYHZQSL24
Xu Tan, Jiawei Chen, Haohe Liu, Jian Cong, Chen Zhang, Yanqing Liu, Xi Wang, Yichong Leng, Yuanhao Yi, Lei He, Sheng Zhao, Tao Qin, Frank K. Soong, Tie-Yan Liu:
NaturalSpeech: End-to-End Text-to-Speech Synthesis With Human-Level Quality. IEEE Trans. Pattern Anal. Mach. Intell. 46(6): 4234-4245 (2024)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiWZCTWX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiWZCTWX24
Tao Li, Zhichao Wang, Xinfa Zhu, Jian Cong, Qiao Tian, Yuping Wang, Lei Xie:
U-Style: Cascading U-Nets With Multi-Level Speaker and Style Modeling for Zero-Shot Voice Cloning. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4026-4035 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02430
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02430
Philip Anastassiou, Jiawei Chen, Jitong Chen, Yuanzhe Chen, Zhuo Chen, Ziyi Chen, Jian Cong, Lelai Deng, Chuang Ding, Lu Gao, Mingqing Gong, Peisong Huang, Qingqing Huang, Zhiying Huang, Yuanyuan Huo, Dongya Jia, Chumin Li, Feiya Li, Hui Li, Jiaxin Li, Xiaoyang Li, Xingxing Li, Lin Liu, Shouda Liu, Sichao Liu, Xudong Liu, Yuchen Liu, Zhengxi Liu, Lu Lu, Junjie Pan, Xin Wang, Yuping Wang, Yuxuan Wang, Zhen Wei, Jian Wu, Chao Yao, Yifeng Yang, Yuanhao Yi, Junteng Zhang, Qidi Zhang, Shuo Zhang, Wenjie Zhang, Yang Zhang, Zilin Zhao, Dejian Zhong, Xiaobin Zhuang:
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models. CoRR abs/2406.02430 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-02622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-02622
Ziyang Ma, Yakun Song, Chenpeng Du, Jian Cong, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xie Chen:
Language Model Can Listen While Speaking. CoRR abs/2408.02622 (2024)
2023
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiHCZLTWX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiHCZLTWX23
Tao Li, Chenxu Hu, Jian Cong, Xinfa Zhu, Jingbei Li, Qiao Tian, Yuping Wang, Lei Xie:
DiCLET-TTS: Diffusion Model Based Cross-Lingual Emotion Transfer for Text-to-Speech - A Study Between English and Mandarin. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3418-3430 (2023)
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SongZLCLXHB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SongZLCLXHB23
Kun Song, Yongmao Zhang, Yi Lei, Jian Cong, Hanzhao Li, Lei Xie, Gang He, Jinfeng Bai:
DSPGAN: A Gan-Based Universal Vocoder for High-Fidelity TTS by Time-Frequency Domain Supervision from DSP. ICASSP 2023: 1-5
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-00883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-00883
Tao Li, Chenxu Hu, Jian Cong, Xinfa Zhu, Jingbei Li, Qiao Tian, Yuping Wang, Lei Xie:
DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech - A Study between English and Mandarin. CoRR abs/2309.00883 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-04004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-04004
Tao Li, Zhichao Wang, Xinfa Zhu, Jian Cong, Qiao Tian, Yuping Wang, Lei Xie:
U-Style: Cascading U-nets with Multi-level Speaker and Style Modeling for Zero-Shot Voice Cloning. CoRR abs/2310.04004 (2023)
2022
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangCXXZB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangCXXZB22
Yongmao Zhang, Jian Cong, Heyang Xue, Lei Xie, Pengcheng Zhu, Mengxiao Bi:
VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice Synthesis. ICASSP 2022: 7237-7241
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeiYC0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeiYC0022
Yi Lei, Shan Yang, Jian Cong, Lei Xie, Dan Su:
Glow-WaveGAN 2: High-quality Zero-shot Text-to-speech Synthesis and Any-to-any Voice Conversion. INTERSPEECH 2022: 2563-2567
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/SongCWZXJW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/SongCWZXJW22
Kun Song, Jian Cong, Xinsheng Wang, Yongmao Zhang, Lei Xie, Ning Jiang, Haiying Wu:
Robust MelGAN: A robust universal neural vocoder for high-fidelity TTS. ISCSLP 2022: 71-75
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/SongXWCZXYZS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/SongXWCZXYZS22
Kun Song, Heyang Xue, Xinsheng Wang, Jian Cong, Yongmao Zhang, Lei Xie, Bing Yang, Xiong Zhang, Dan Su:
AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation. ISCSLP 2022: 319-323
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-04421
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-04421
Xu Tan, Jiawei Chen, Haohe Liu, Jian Cong, Chen Zhang, Yanqing Liu, Xi Wang, Yichong Leng, Yuanhao Yi, Lei He, Frank K. Soong, Tao Qin, Sheng Zhao, Tie-Yan Liu:
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality. CoRR abs/2205.04421 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-00208
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-00208
Kun Song, Heyang Xue, Xinsheng Wang, Jian Cong, Yongmao Zhang, Lei Xie, Bing Yang, Xiong Zhang, Dan Su:
AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation. CoRR abs/2206.00208 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-01832
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-01832
Yi Lei, Shan Yang, Jian Cong, Lei Xie, Dan Su:
Glow-WaveGAN 2: High-quality Zero-shot Text-to-speech Synthesis and Any-to-any Voice Conversion. CoRR abs/2207.01832 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-17349
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-17349
Kun Song, Jian Cong, Xinsheng Wang, Yongmao Zhang, Lei Xie, Ning Jiang, Haiying Wu:
Robust MelGAN: A robust universal neural vocoder for high-fidelity TTS. CoRR abs/2210.17349 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01087
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01087
Kun Song, Yongmao Zhang, Yi Lei, Jian Cong, Hanzhao Li, Lei Xie, Gang He, Jinfeng Bai:
DSPGAN: a GAN-based universal vocoder for high-fidelity TTS by time-frequency domain supervision from DSP. CoRR abs/2211.01087 (2022)
2021
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CongYXS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CongYXS21
Jian Cong, Shan Yang, Lei Xie, Dan Su:
Glow-WaveGAN: Learning Speech Representations from GAN-Based Variational Auto-Encoder for High Fidelity Flow-Based Speech Synthesis. Interspeech 2021: 2182-2186
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CongYHLX021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CongYHLX021
Jian Cong, Shan Yang, Na Hu, Guangzhi Li, Lei Xie, Dan Su:
Controllable Context-Aware Conversational Speech Synthesis. Interspeech 2021: 4658-4662
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-10828
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-10828
Jian Cong, Shan Yang, Na Hu, Guangzhi Li, Lei Xie, Dan Su:
Controllable Context-aware Conversational Speech Synthesis. CoRR abs/2106.10828 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-10831
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-10831
Jian Cong, Shan Yang, Lei Xie, Dan Su:
Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis. CoRR abs/2106.10831 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-08813
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-08813
Yongmao Zhang, Jian Cong, Heyang Xue, Lei Xie, Pengcheng Zhu, Mengxiao Bi:
VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice Synthesis. CoRR abs/2110.08813 (2021)
2020
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CongY0YW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CongY0YW20
Jian Cong, Shan Yang, Lei Xie, Guoqiao Yu, Guanglu Wan:
Data Efficient Voice Cloning from Noisy Samples with Domain Adversarial Training. INTERSPEECH 2020: 811-815
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-04265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-04265
Jian Cong, Shan Yang, Lei Xie, Guoqiao Yu, Guanglu Wan:
Data Efficient Voice Cloning from Noisy Samples with Domain Adversarial Training. CoRR abs/2008.04265 (2020)

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2006
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CongC06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CongC06
Jian Cong, Suo Cong:
New Speech Encoding Algorithms for Ultra Low Bit Rate at 600/300 Bps. ICASSP (1) 2006: 709-712

1990 – 1999

see FAQ

What is the meaning of the colors in the publication lists?

1999
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icsc/CongCLWJ99
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icsc/CongCLWJ99
Jian Cong, Suo Cong, Zaimin Li, Gansha Wu, Wen Jin:
Technology of Image Coding for Cell Losses in ATM Transmission. ICSC 1999: 509-516

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.