default search action

combined dblp search
author search
venue search
publication search

ask others

Dongchao Yang

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YangLHWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YangLHWM24
Dongchao Yang, Songxiang Liu, Rongjie Huang, Chao Weng, Helen Meng:
InstructTTS: Modelling Expressive TTS in Discrete Latent Space With Natural Language Style Prompt. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2913-2925 (2024)
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HuangLYSCYWHHLR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HuangLYSCYWHHLR24
Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Yuexian Zou, Zhou Zhao, Shinji Watanabe:
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. AAAI 2024: 23802-23804
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuangZWYTYLW0CS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuangZWYTYLW0CS24
Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Jinchuan Tian, Zhenhui Ye, Luping Liu, Zehan Wang, Ziyue Jiang, Xuankai Chang, Jiatong Shi, Chao Weng, Zhou Zhao, Dong Yu:
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners. ACL (1) 2024: 10929-10942
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangCYYW0M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangCYYW0M24
Yuanyuan Wang, Hangting Chen, Dongchao Yang, Jianwei Yu, Chao Weng, Zhiyong Wu, Helen Meng:
Consistent and Relevant: Rethink the Query Embedding in General Sound Separation. ICASSP 2024: 961-965
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HaiWYTDE24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HaiWYTDE24
Jiarui Hai, Helin Wang, Dongchao Yang, Karan Thakkar, Najim Dehak, Mounya Elhilali:
DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction. ICASSP 2024: 1196-1200
[c25]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LengGSJ0LLYZS0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LengGSJ0LLYZS0024
Yichong Leng, Zhifang Guo, Kai Shen, Zeqian Ju, Xu Tan, Eric Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiangyang Li, Sheng Zhao, Tao Qin, Jiang Bian:
PromptTTS 2: Describing and Generating Voices with Text Prompt. ICLR 2024
[c24]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HuangHW0C0YYLGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuangHW0C0YYLGZ24
Rongjie Huang, Ruofan Hu, Yongqi Wang, Zehan Wang, Xize Cheng, Ziyue Jiang, Zhenhui Ye, Dongchao Yang, Luping Liu, Peng Gao, Zhou Zhao:
InstructSpeech: Following Speech Editing Instructions via Large Language Models. ICML 2024
[c23]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/JuWS0XYLLST000024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/JuWS0XYLLST000024
Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Eric Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiangyang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. ICML 2024
[c22]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YangT0HLGCSZ0ZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangT0HLGCSZ0ZW24
Dongchao Yang, Jinchuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Haohan Guo, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Zhou Zhao, Xixin Wu, Helen M. Meng:
UniAudio: Towards Universal Audio Generation with Large Language Models. ICML 2024
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangWHXHYC00YL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangWHXHYC00YL24
Rongjie Huang, Yongqi Wang, Ruofan Hu, Xiaoshan Xu, Zhiqing Hong, Dongchao Yang, Xize Cheng, Zehan Wang, Ziyue Jiang, Zhenhui Ye, Luping Liu, Siqi Zheng, Zhou Zhao:
VoiceTuner: Self-Supervised Pre-training and Efficient Fine-tuning For Voice Generation. ACM Multimedia 2024: 10630-10639
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-03100
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-03100
Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. CoRR abs/2403.03100 (2024)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-03204
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-03204
Detai Xin, Xu Tan, Kai Shen, Zeqian Ju, Dongchao Yang, Yuancheng Wang, Shinnosuke Takamichi, Hiroshi Saruwatari, Shujie Liu, Jinyu Li, Sheng Zhao:
RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis. CoRR abs/2404.03204 (2024)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02328
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02328
Dongchao Yang, Dingdong Wang, Haohan Guo, Xueyuan Chen, Xixin Wu, Helen Meng:
SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models. CoRR abs/2406.02328 (2024)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02940
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02940
Haohan Guo, Fenglong Xie, Dongchao Yang, Hui Lu, Xixin Wu, Helen Meng:
Addressing Index Collapse of Large-Codebook Speech Tokenizer with Dual-Decoding Product-Quantized Variational Auto-Encoder. CoRR abs/2406.02940 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-08336
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-08336
Xueyuan Chen, Dongchao Yang, Dingdong Wang, Xixin Wu, Zhiyong Wu, Helen Meng:
CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction. CoRR abs/2406.08336 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10056
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10056
Dongchao Yang, Haohan Guo, Yuanyuan Wang, Rongjie Huang, Xiang Li, Xu Tan, Xixin Wu, Helen Meng:
UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner. CoRR abs/2406.10056 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-13893
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-13893
Dongchao Yang, Rongjie Huang, Yuanyuan Wang, Haohan Guo, Dading Chong, Songxiang Liu, Xixin Wu, Helen Meng:
SimpleSpeech 2: Towards Simple and Efficient Text-to-Speech with Flow-based Scalar Latent Transformer Diffusion Models. CoRR abs/2408.13893 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00933
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00933
Haohan Guo, Fenglong Xie, Kun Xie, Dongchao Yang, Dake Guo, Xixin Wu, Helen Meng:
SoCodec: A Semantic-Ordered Multi-Stream Speech Codec for Efficient Language Model Based Text-to-Speech Synthesis. CoRR abs/2409.00933 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-11630
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-11630
Haohan Guo, Fenglong Xie, Dongchao Yang, Xixin Wu, Helen Meng:
Speaking from Coarse to Fine: Improving Neural Codec Language Model via Multi-Scale Speech Coding and Generation. CoRR abs/2409.11630 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-12560
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-12560
Yuanyuan Wang, Hangting Chen, Dongchao Yang, Zhiyong Wu, Helen Meng, Xixin Wu:
AudioComposer: Towards Fine-grained Audio Generation with Natural Language Descriptions. CoRR abs/2409.12560 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-14085
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-14085
Haibin Wu, Xuanjun Chen, Yi-Cheng Lin, Kai-Wei Chang, Jiawei Du, Ke-Han Lu, Alexander H. Liu, Ho-Lam Chung, Yuan-Kuei Wu, Dongchao Yang, Songxiang Liu, Yi-Chiao Wu, Xu Tan, James R. Glass, Shinji Watanabe, Hung-yi Lee:
Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models. CoRR abs/2409.14085 (2024)
2023
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YangYWWWZY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YangYWWWZY23
Dongchao Yang, Jianwei Yu, Helin Wang, Wen Wang, Chao Weng, Yuexian Zou, Dong Yu:
Diffsound: Discrete Diffusion Model for Text-to-Sound Generation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1720-1733 (2023)
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WangYYCZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WangYYCZ23
Wen Wang, Dongchao Yang, Qichen Ye, Bowen Cao, Yuexian Zou:
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement. APSIPA ASC 2023: 2416-2423
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XinYCWZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XinYCWZ23
Yifei Xin, Dongchao Yang, Fan Cui, Yujun Wang, Yuexian Zou:
Improving Weakly Supervised Sound Event Detection with Causal Intervention. ICASSP 2023: 1-5
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XinYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XinYZ23
Yifei Xin, Dongchao Yang, Yuexian Zou:
Improving Text-Audio Retrieval by Text-Aware Attention Pooling and Prior Matrix Revised Loss. ICASSP 2023: 1-5
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HuangHY0LLYLYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuangHY0LLYLYZ23
Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao:
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. ICML 2023: 13916-13932
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XinYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XinYZ23
Yifei Xin, Dongchao Yang, Yuexian Zou:
Background-aware Modeling for Weakly Supervised Sound Event Detection. INTERSPEECH 2023: 1199-1203
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangLWYWZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangLWYWZ23
Dongchao Yang, Songxiang Liu, Helin Wang, Jianwei Yu, Chao Weng, Yuexian Zou:
NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS. INTERSPEECH 2023: 4798-4802
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-12661
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-12661
Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao:
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. CoRR abs/2301.12661 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-13662
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-13662
Dongchao Yang, Songxiang Liu, Rongjie Huang, Guangzhi Lei, Chao Weng, Helen Meng, Dong Yu:
InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt. CoRR abs/2301.13662 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-05678
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-05678
Yifei Xin, Dongchao Yang, Fan Cui, Yujun Wang, Yuexian Zou:
Improving Weakly Supervised Sound Event Detection with Causal Intervention. CoRR abs/2303.05678 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-05681
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-05681
Yifei Xin, Dongchao Yang, Yuexian Zou:
Improving Text-Audio Retrieval by Text-aware Attention Pooling and Prior Matrix Revised Loss. CoRR abs/2303.05681 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-12995
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-12995
Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Zhou Zhao, Shinji Watanabe:
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. CoRR abs/2304.12995 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-02765
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-02765
Dongchao Yang, Songxiang Liu, Rongjie Huang, Jinchuan Tian, Chao Weng, Yuexian Zou:
HiFi-Codec: Group-residual Vector quantization for High Fidelity Audio Codec. CoRR abs/2305.02765 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18474
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18474
Jiawei Huang, Yi Ren, Rongjie Huang, Dongchao Yang, Zhenhui Ye, Chen Zhang, Jinglin Liu, Xiang Yin, Zejun Ma, Zhou Zhao:
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation. CoRR abs/2305.18474 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19269
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19269
Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Luping Liu, Zhenhui Ye, Ziyue Jiang, Chao Weng, Zhou Zhao, Dong Yu:
Make-A-Voice: Unified Voice Synthesis With Discrete Representation. CoRR abs/2305.19269 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-01212
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-01212
Wen Wang, Dongchao Yang, Qichen Ye, Bowen Cao, Yuexian Zou:
NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement. CoRR abs/2309.01212 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-02285
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-02285
Yichong Leng, Zhifang Guo, Kai Shen, Xu Tan, Zeqian Ju, Yanqing Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiang-Yang Li, Sheng Zhao, Tao Qin, Jiang Bian:
PromptTTS 2: Describing and Generating Voices with Text Prompt. CoRR abs/2309.02285 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00704
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00704
Dongchao Yang, Jinchuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Xixin Wu, Zhou Zhao, Shinji Watanabe, Helen Meng:
UniAudio: An Audio Foundation Model Toward Universal Audio Generation. CoRR abs/2310.00704 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-04567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-04567
Jiarui Hai, Helin Wang, Dongchao Yang, Karan Thakkar, Najim Dehak, Mounya Elhilali:
DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction. CoRR abs/2310.04567 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-15463
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-15463
Yuanyuan Wang, Hangting Chen, Dongchao Yang, Jianwei Yu, Chao Weng, Zhiyong Wu, Helen Meng:
Consistent and Relevant: Rethink the Query Embedding in General Sound Separation. CoRR abs/2312.15463 (2023)
2022
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/clawar/TaoYHZCL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/clawar/TaoYHZCL22
Bo Tao, Dongchao Yang, Geng Huang, Zecui Zeng, Chen Chen, Teng Li:
Omnidirectional Motion Control Method of Quadruped Robot Based on 3D-CPG Oscillator Group. CLAWAR 2022: 301-312
[c13]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/WangYZCW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/WangYZCW22
Helin Wang, Dongchao Yang, Yuexian Zou, Fan Cui, Yujun Wang:
Detect What You Want: Target Sound Detection. DCASE 2022
[c12]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/YangWWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/YangWWZ22
Dongchao Yang, Helin Wang, Wenwu Wang, Yuexian Zou:
A Mixed Supervised Learning Framework For Target Sound Detection. DCASE 2022
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangWZYW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangWZYW22
Dongchao Yang, Helin Wang, Yuexian Zou, Zhongjie Ye, Wenwu Wang:
A Mutual Learning Framework for Few-Shot Sound Event Detection. ICASSP 2022: 811-815
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangWYZW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangWYZW22
Dongchao Yang, Helin Wang, Zhongjie Ye, Yuexian Zou, Wenwu Wang:
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection. INTERSPEECH 2022: 1511-1515
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangYWYZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangYWYZ22
Helin Wang, Dongchao Yang, Chao Weng, Jianwei Yu, Yuexian Zou:
Improving Target Sound Extraction with Timestamp Information. INTERSPEECH 2022: 1526-1530
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XinYZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XinYZ22
Yifei Xin, Dongchao Yang, Yuexian Zou:
Audio Pyramid Transformer with Domain Adaption for Weakly Supervised Sound Event Detection and Audio Classification. INTERSPEECH 2022: 1546-1550
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoGYTZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoGYTZ22
Zifeng Zhao, Rongzhi Gu, Dongchao Yang, Jinchuan Tian, Yuexian Zou:
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction. INTERSPEECH 2022: 5318-5322
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoYGZZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoYGZZ22
Zifeng Zhao, Dongchao Yang, Rongzhi Gu, Haoran Zhang, Yuexian Zou:
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches. INTERSPEECH 2022: 5333-5337
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/robio/LiYCZHTL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/robio/LiYCZHTL22
Teng Li, Dongchao Yang, Chen Chen, Zecui Zeng, Geng Huang, Bo Tao, Jiahe Li:
A Mobile Robot Design for Efficient and Large-Scale Solar Panel Cleaning. ROBIO 2022: 70-75
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-00821
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-00821
Helin Wang, Dongchao Yang, Chao Weng, Jianwei Yu, Yuexian Zou:
Improving Target Sound Extraction with Timestamp Information. CoRR abs/2204.00821 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-01355
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-01355
Zifeng Zhao, Dongchao Yang, Rongzhi Gu, Haoran Zhang, Yuexian Zou:
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches. CoRR abs/2204.01355 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-02088
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-02088
Dongchao Yang, Helin Wang, Yuexian Zou, Wenwu Wang:
A Two-student Learning Framework for Mixed Supervised Target Sound Detection. CoRR abs/2204.02088 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-02143
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-02143
Dongchao Yang, Helin Wang, Zhongjie Ye, Yuexian Zou, Wenwu Wang:
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection. CoRR abs/2204.02143 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-07375
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-07375
Zifeng Zhao, Rongzhi Gu, Dongchao Yang, Jinchuan Tian, Yuexian Zou:
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction. CoRR abs/2204.07375 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-09983
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-09983
Dongchao Yang, Jianwei Yu, Helin Wang, Wen Wang, Chao Weng, Yuexian Zou, Dong Yu:
Diffsound: Discrete Diffusion Model for Text-to-sound Generation. CoRR abs/2207.09983 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-02448
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-02448
Dongchao Yang, Songxiang Liu, Jianwei Yu, Helin Wang, Chao Weng, Yuexian Zou:
NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS. CoRR abs/2211.02448 (2022)
2021
[c4]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/YeWYZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/YeWYZ21
Zhongjie Ye, Helin Wang, Dongchao Yang, Yuexian Zou:
Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information. DCASE 2021: 40-44
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icmlsc/ZhuYHWLT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmlsc/ZhuYHWLT21
Heng Zhu, Dongchao Yang, Geng Huang, Qingyuan Wu, Teng Li, Bo Tao:
YOLOv3 with Asymmetric Intersection over Union Based Loss Function for Human Detection. ICMLSC 2021: 70-76
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangWZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangWZ21
Dongchao Yang, Helin Wang, Yuexian Zou:
Unsupervised Multi-Target Domain Adaptation for Acoustic Scene Classification. Interspeech 2021: 1159-1163
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-10340
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-10340
Dongchao Yang, Helin Wang, Yuexian Zou:
Unsupervised Multi-Target Domain Adaptation for Acoustic Scene Classification. CoRR abs/2105.10340 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04474
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04474
Dongchao Yang, Helin Wang, Yuexian Zou, Zhongjie Ye, Wenwu Wang:
A Mutual learning framework for Few-shot Sound Event Detection. CoRR abs/2110.04474 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06100
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06100
Zhongjie Ye, Helin Wang, Dongchao Yang, Yuexian Zou:
Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information. CoRR abs/2110.06100 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-10153
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-10153
Dongchao Yang, Helin Wang, Yuexian Zou, Chao Weng:
Detect what you want: Target Sound Detection. CoRR abs/2112.10153 (2021)
2020
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/robosoft/FangCSYZX020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/robosoft/FangCSYZX020
Bin Fang, Yang Chen, Fuchun Sun, Dongchao Yang, Xu Zhang, Ziwei Xia, Huaping Liu:
A petal-array capacitive tactile sensor with micro-pin for robotic fingertip sensing. RoboSoft 2020: 452-457
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-08923
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-08923
Chenyu You, Nuo Chen, Fenglin Liu, Dongchao Yang, Yuexian Zou:
Towards Data Distillation for End-to-end Spoken Conversational Question Answering. CoRR abs/2010.08923 (2020)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.