default search action

combined dblp search
author search
venue search
publication search

ask others

Yao Qian

> Home > Persons

This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j21]
- view
  authority control:
- export record
  dblp key:
  - journals/apin/WuLWTQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/apin/WuLWTQ24
Jiacheng Wu, Gang Liu, Xiao Wang, Haojie Tang, Yao Qian:
GAN-GA: infrared and visible image fusion generative adversarial network based on global awareness. Appl. Intell. 54(13-14): 7296-7316 (2024)
[j20]
- view
  authority control:
- export record
  dblp key:
  - journals/dsp/LiLGTXQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/dsp/LiLGTXQ24
Kaixin Li, Gang Liu, Xinjie Gu, Haojie Tang, Jinxin Xiong, Yao Qian:
DANT-GAN: A dual attention-based of nested training network for infrared and visible image fusion. Digit. Signal Process. 145: 104316 (2024)
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/nca/ChangLTQT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nca/ChangLTQT24
Rui Chang, Gang Liu, Haojie Tang, Yao Qian, Jianchao Tang:
RDGMEF: a multi-exposure image fusion framework based on Retinex decompostion and guided filter. Neural Comput. Appl. 36(20): 12083-12102 (2024)
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/pr/XingLTQZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pr/XingLTQZ24
Mengliang Xing, Gang Liu, Haojie Tang, Yao Qian, Jun Zhang:
CFNet: An infrared and visible image compression fusion network. Pattern Recognit. 156: 110774 (2024)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/tci/TangLQWX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tci/TangLQWX24
Haojie Tang, Gang Liu, Yao Qian, Jiebang Wang, Jinxin Xiong:
EgeFusion: Towards Edge Gradient Enhancement in Infrared and Visible Image Fusion With Multi-Scale Transform. IEEE Trans. Computational Imaging 10: 385-398 (2024)
[c107]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LingHQYQ0L024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LingHQYQ0L024
Shaoshi Ling, Yuxuan Hu, Shuangbei Qian, Guoli Ye, Yao Qian, Yifan Gong, Ed Lin, Michael Zeng:
Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition. ICASSP 2024: 11046-11050
[c106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/YangKXPFZCQGCGKCXSYYZH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/YangKXPFZCQGCGKCXSYYZH24
Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Xuemei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data. NAACL-HLT (Findings) 2024: 1615-1627
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-06690
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-06690
Leying Zhang, Yao Qian, Long Zhou, Shujie Liu, Dongmei Wang, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Lei He, Sheng Zhao, Michael Zeng:
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations. CoRR abs/2404.06690 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17809
Chenyang Le, Yao Qian, Dongmei Wang, Long Zhou, Shujie Liu, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Sheng Zhao, Michael Zeng:
TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation. CoRR abs/2405.17809 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05370
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05370
Sanyuan Chen, Shujie Liu, Long Zhou, Yanqing Liu, Xu Tan, Jinyu Li, Sheng Zhao, Yao Qian, Furu Wei:
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers. CoRR abs/2406.05370 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-04016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-04016
Jiaqi Li, Dongmei Wang, Xiaofei Wang, Yao Qian, Long Zhou, Shujie Liu, Midia Yousefi, Canrun Li, Chung-Hsien Tsai, Zhen Xiao, Yanqing Liu, Junkun Chen, Sheng Zhao, Jinyu Li, Zhizheng Wu, Michael Zeng:
Investigating Neural Audio Codecs for Speech Language Model-Based Speech Generation. CoRR abs/2409.04016 (2024)
2023
[c105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YangFZP00XQGCLX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YangFZP00XQGCLX23
Ziyi Yang, Yuwei Fang, Chenguang Zhu, Reid Pryzant, Dongdong Chen, Yu Shi, Yichong Xu, Yao Qian, Mei Gao, Yi-Ling Chen, Liyang Lu, Yujia Xie, Robert Gmyr, Noel Codella, Naoyuki Kanda, Bin Xiao, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code: An Integrative and Composable Multimodal Learning Framework. AAAI 2023: 10880-10890
[c104]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiQCWYLQZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiQCWYLQZ23
Chenda Li, Yao Qian, Zhuo Chen, Dongmei Wang, Takuya Yoshioka, Shujie Liu, Yanmin Qian, Michael Zeng:
Target Sound Extraction with Variable Cross-Modality Clues. ICASSP 2023: 1-5
[c103]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangQYKWYWWLCWZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangQYKWYWWLCWZ23
Heming Wang, Yao Qian, Hemin Yang, Nauyuki Kanda, Peidong Wang, Takuya Yoshioka, Xiaofei Wang, Yiming Wang, Shujie Liu, Zhuo Chen, DeLiang Wang, Michael Zeng:
DATA2VEC-SG: Improving Self-Supervised Learning Representations for Speech Generation Tasks. ICASSP 2023: 1-5
[c102]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuHQJLLSQLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuHQJLLSQLZ23
Haibin Yu, Yuxuan Hu, Yao Qian, Ma Jin, Linquan Liu, Shujie Liu, Yu Shi, Yanmin Qian, Edward Lin, Michael Zeng:
Code-Switching Text Generation and Injection in Mandarin-English ASR. ICASSP 2023: 1-5
[c101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiQ0KWYQ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiQ0KWYQ023
Chenda Li, Yao Qian, Zhuo Chen, Naoyuki Kanda, Dongmei Wang, Takuya Yoshioka, Yanmin Qian, Michael Zeng:
Adapting Multi-Lingual ASR Models for Handling Multiple Talkers. INTERSPEECH 2023: 1314-1318
[c100]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LeQZLQ0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LeQZLQ0023
Chenyang Le, Yao Qian, Long Zhou, Shujie Liu, Yanmin Qian, Michael Zeng, Xuedong Huang:
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation. NeurIPS 2023
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08372
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08372
Chenda Li, Yao Qian, Zhuo Chen, Dongmei Wang, Takuya Yoshioka, Shujie Liu, Yanmin Qian, Michael Zeng:
Target Sound Extraction with Variable Cross-modality Clues. CoRR abs/2303.08372 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-10949
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-10949
Haibin Yu, Yuxuan Hu, Yao Qian, Ma Jin, Linquan Liu, Shujie Liu, Yu Shi, Yanmin Qian, Edward Lin, Michael Zeng:
Code-Switching Text Generation and Injection in Mandarin-English ASR. CoRR abs/2303.10949 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12311
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12311
Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Mei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data. CoRR abs/2305.12311 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13738
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13738
Yuwei Fang, Mahmoud Khademi, Chenguang Zhu, Ziyi Yang, Reid Pryzant, Yichong Xu, Yao Qian, Takuya Yoshioka, Lu Yuan, Michael Zeng, Xuedong Huang:
i-Code Studio: A Configurable and Composable Framework for Integrative AI. CoRR abs/2305.13738 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-14838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-14838
Chenyang Le, Yao Qian, Long Zhou, Shujie Liu, Michael Zeng, Xuedong Huang:
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation. CoRR abs/2305.14838 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18747
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18747
Chenda Li, Yao Qian, Zhuo Chen, Naoyuki Kanda, Dongmei Wang, Takuya Yoshioka, Yanmin Qian, Michael Zeng:
Adapting Multi-Lingual ASR Models for Handling Multiple Talkers. CoRR abs/2305.18747 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13874
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13874
Leying Zhang, Yao Qian, Linfeng Yu, Heming Wang, Xinkai Wang, Hemin Yang, Long Zhou, Shujie Liu, Yanmin Qian, Michael Zeng:
Diffusion Conditional Expectation Model for Efficient and Robust Target Speech Extraction. CoRR abs/2309.13874 (2023)
2022
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/ChenWCWLCLKYXWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/ChenWCWLCLKYXWZ22
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Xiangzhan Yu, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. IEEE J. Sel. Top. Signal Process. 16(6): 1505-1518 (2022)
[c99]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/AoWZ0RW0KLZWQ0W22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/AoWZ0RW0KLZWQ0W22
Junyi Ao, Rui Wang, Long Zhou, Chengyi Wang, Shuo Ren, Yu Wu, Shujie Liu, Tom Ko, Qing Li, Yu Zhang, Zhihua Wei, Yao Qian, Jinyu Li, Furu Wei:
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing. ACL (1) 2022: 5723-5738
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangQWWWLYLW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangQWWWLYLW22
Heming Wang, Yao Qian, Xiaofei Wang, Yiming Wang, Chengyi Wang, Shujie Liu, Takuya Yoshioka, Jinyu Li, DeLiang Wang:
Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction. ICASSP 2022: 6062-6066
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenCWQWLQZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenCWQWLQZ22
Zhengyang Chen, Sanyuan Chen, Yu Wu, Yao Qian, Chengyi Wang, Shujie Liu, Yanmin Qian, Michael Zeng:
Large-Scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification. ICASSP 2022: 6147-6151
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWWCCLWQWLY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWWCCLWQWLY22
Sanyuan Chen, Yu Wu, Chengyi Wang, Zhengyang Chen, Zhuo Chen, Shujie Liu, Jian Wu, Yao Qian, Furu Wei, Jinyu Li, Xiangzhan Yu:
Unispeech-Sat: Universal Speech Representation Learning With Speaker Aware Pre-Training. ICASSP 2022: 6152-6156
[c95]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangWCLLQY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangWCLLQY22
Chengyi Wang, Yu Wu, Sanyuan Chen, Shujie Liu, Jinyu Li, Yao Qian, Zhenglu Yang:
Improving Self-Supervised Learning for Speech Recognition with Intermediate Layer Supervision. ICASSP 2022: 7092-7096
[c94]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLWQWW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLWQWW22
Yiming Wang, Jinyu Li, Heming Wang, Yao Qian, Chengyi Wang, Yu Wu:
Wav2vec-Switch: Contrastive Learning from Original-Noisy Speech Pairs for Robust Speech Recognition. ICASSP 2022: 7097-7101
[c93]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangRQLSQZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangRQLSQZ22
Wei Wang, Shuo Ren, Yao Qian, Shujie Liu, Yu Shi, Yanmin Qian, Michael Zeng:
Optimizing Alignment of Speech and Language Latent Spaces for End-To-End Speech Recognition and Understanding. ICASSP 2022: 7802-7806
[c92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AoZZ00K00QW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AoZZ00K00QW22
Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu, Haizhou Li, Tom Ko, Lirong Dai, Jinyu Li, Yao Qian, Furu Wei:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data. INTERSPEECH 2022: 2658-2662
[c91]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ChenQHQZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ChenQHQZ22
Zhengyang Chen, Yao Qian, Bing Han, Yanmin Qian, Michael Zeng:
A Comprehensive Study on Self-Supervised Distillation for Speaker Representation Learning. SLT 2022: 599-604
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-17113
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-17113
Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu, Haizhou Li, Tom Ko, Lirong Dai, Jinyu Li, Yao Qian, Furu Wei:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data. CoRR abs/2203.17113 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-01818
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-01818
Ziyi Yang, Yuwei Fang, Chenguang Zhu, Reid Pryzant, Dongdong Chen, Yu Shi, Yichong Xu, Yao Qian, Mei Gao, Yi-Ling Chen, Liyang Lu, Yujia Xie, Robert Gmyr, Noel Codella, Naoyuki Kanda, Bin Xiao, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code: An Integrative and Composable Multimodal Learning Framework. CoRR abs/2205.01818 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-08598
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-08598
Mostafa Karimi, Changliang Liu, Ken'ichi Kumatani, Yao Qian, Tianyu Wu, Jian Wu:
Deploying self-supervised learning in the wild for hybrid automatic speech recognition. CoRR abs/2205.08598 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-11266
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-11266
Gang Liu, Tianyan Zhou, Yong Zhao, Yu Wu, Zhuo Chen, Yao Qian, Jian Wu:
The Microsoft System for VoxCeleb Speaker Recognition Challenge 2022. CoRR abs/2209.11266 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15936
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15936
Zhengyang Chen, Yao Qian, Bing Han, Yanmin Qian, Michael Zeng:
A comprehensive study on self-supervised distillation for speaker representation learning. CoRR abs/2210.15936 (2022)
2021
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianBSKSXZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianBSKSXZ21
Yao Qian, Ximo Bian, Yu Shi, Naoyuki Kanda, Leo Shen, Zhen Xiao, Michael Zeng:
Speech-Language Pre-Training for End-to-End Spoken Language Understanding. ICASSP 2021: 7458-7462
[c89]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/0002WQK0WZ021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0002WQK0WZ021
Chengyi Wang, Yu Wu, Yao Qian, Ken'ichi Kumatani, Shujie Liu, Furu Wei, Michael Zeng, Xuedong Huang:
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data. ICML 2021: 10937-10947
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/QinQLLMEL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/QinQLLMEL21
Ying Qin, Yao Qian, Anastassia Loukina, Patrick L. Lange, Abhinav Misra, Keelan Evanini, Tan Lee:
Automatic Detection of Word-Level Reading Errors in Non-native English Speech Based on ASR Output. ISCSLP 2021: 1-5
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/WangEQM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/WangEQM21
Xinhao Wang, Keelan Evanini, Yao Qian, Matthew Mulholland:
Automated Scoring of Spontaneous Speech from Young Learners of English Using Transformers. SLT 2021: 705-712
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-07597
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-07597
Chengyi Wang, Yu Wu, Yao Qian, Ken'ichi Kumatani, Shujie Liu, Furu Wei, Michael Zeng, Xuedong Huang:
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data. CoRR abs/2101.07597 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-06283
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-06283
Yao Qian, Ximo Bian, Yu Shi, Naoyuki Kanda, Leo Shen, Zhen Xiao, Michael Zeng:
Speech-language Pre-training for End-to-end Spoken Language Understanding. CoRR abs/2102.06283 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04934
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04934
Yiming Wang, Jinyu Li, Heming Wang, Yao Qian, Chengyi Wang, Yu Wu:
Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition. CoRR abs/2110.04934 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05752
Sanyuan Chen, Yu Wu, Chengyi Wang, Zhengyang Chen, Zhuo Chen, Shujie Liu, Jian Wu, Yao Qian, Furu Wei, Jinyu Li, Xiangzhan Yu:
UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training. CoRR abs/2110.05752 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05777
Zhengyang Chen, Sanyuan Chen, Yu Wu, Yao Qian, Chengyi Wang, Shujie Liu, Yanmin Qian, Michael Zeng:
Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification. CoRR abs/2110.05777 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-07205
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-07205
Junyi Ao, Rui Wang, Long Zhou, Shujie Liu, Shuo Ren, Yu Wu, Tom Ko, Qing Li, Yu Zhang, Zhihua Wei, Yao Qian, Jinyu Li, Furu Wei:
SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing. CoRR abs/2110.07205 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-07909
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-07909
Rimita Lahiri, Ken'ichi Kumatani, Eric Sun, Yao Qian:
Multilingual Speech Recognition using Knowledge Transfer across Learning Processes. CoRR abs/2110.07909 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-12138
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-12138
Wei Wang, Shuo Ren, Yao Qian, Shujie Liu, Yu Shi, Yanmin Qian, Michael Zeng:
Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding. CoRR abs/2110.12138 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-13900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-13900
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. CoRR abs/2110.13900 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-15430
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-15430
Heming Wang, Yao Qian, Xiaofei Wang, Yiming Wang, Chengyi Wang, Shujie Liu, Takuya Yoshioka, Jinyu Li, DeLiang Wang:
Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction. CoRR abs/2110.15430 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-08778
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-08778
Chengyi Wang, Yu Wu, Sanyuan Chen, Shujie Liu, Jinyu Li, Yao Qian, Zhenglu Yang:
Self-Supervised Learning for speech recognition with Intermediate layer supervision. CoRR abs/2112.08778 (2021)
2020
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/QianULERS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/QianULERS20
Yao Qian, Rutuja Ubale, Patrick L. Lange, Keelan Evanini, Vikram Ramanarayanan, Frank K. Soong:
Spoken Language Understanding of Human-Machine Conversations for Language Learning Applications. J. Signal Process. Syst. 92(8): 805-817 (2020)
[c86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianSZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianSZ20
Yao Qian, Yu Shi, Michael Zeng:
Discriminative Transfer Learning for Optimizing ASR and Semantic Labeling in Task-Oriented Spoken Dialog. INTERSPEECH 2020: 3915-3919

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/jssc/CaoQXLHH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jssc/CaoQXLHH19
Peng Cao, Yao Qian, Pan Xue, Danzhu Lu, Jie He, Zhiliang Hong:
A Bipolar-Input Thermoelectric Energy-Harvesting Interface With Boost/Flyback Hybrid Converter and On-Chip Cold Starter. IEEE J. Solid State Circuits 54(12): 3362-3374 (2019)
[c85]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/UbaleRQELL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/UbaleRQELL19
Rutuja Ubale, Vikram Ramanarayanan, Yao Qian, Keelan Evanini, Chee Wee Leong, Chong Min Lee:
Native Language Identification from Raw Waveforms Using Deep Convolutional Neural Networks with Attentive Pooling. ASRU 2019: 403-410
[c84]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WangEQZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WangEQZ19
Xinhao Wang, Keelan Evanini, Yao Qian, Klaus Zechner:
Using Very Deep Convolutional Neural Networks to Automatically Detect Plagiarized Spoken Responses. ASRU 2019: 764-771
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/bea/WangEMQB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bea/WangEMQB19
Xinhao Wang, Keelan Evanini, Matthew Mulholland, Yao Qian, James V. Bruno:
Application of an Automatic Plagiarism Detection System in a Large-scale Assessment of English Speaking Proficiency. BEA@ACL 2019: 435-443
[c82]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianLEPUMW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianLEPUMW19
Yao Qian, Patrick L. Lange, Keelan Evanini, Robert A. Pugh, Rutuja Ubale, Matthew Mulholland, Xinhao Wang:
Neural Approaches to Automated Speech Scoring of Monologue and Dialogue Responses. ICASSP 2019: 8112-8116
[c81]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/LeongRRMKUQMM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/LeongRRMKUQMM19
Chee Wee Leong, Katrina Roohr, Vikram Ramanarayanan, Michelle P. Martin-Raugh, Harrison Kell, Rutuja Ubale, Yao Qian, Zydrune Mladineo, Laura McCulla:
Are Humans Biased in Assessment of Video Interviews? ICMI (Adjunct) 2019: 9:1-9:5
[c80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoukinaKLQGMMZW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoukinaKLQGMMZW19
Anastassia Loukina, Beata Beigman Klebanov, Patrick L. Lange, Yao Qian, Binod Gyawali, Nitin Madnani, Abhinav Misra, Klaus Zechner, Zuowei Wang, John Sabatini:
Automated Estimation of Oral Reading Fluency During Summer Camp e-Book Reading with MyTurnToRead. INTERSPEECH 2019: 21-25
[c79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangYEZQ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangYEZQ19
Xinhao Wang, Su-Youn Yoon, Keelan Evanini, Klaus Zechner, Yao Qian:
Automatic Detection of Off-Topic Spoken Responses Using Very Deep Convolutional Neural Networks. INTERSPEECH 2019: 4200-4204
[c78]
- view
  authority control:
- export record
  dblp key:
  - conf/isscc/CaoQXLHH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isscc/CaoQXLHH19
Peng Cao, Yao Qian, Pan Xue, Danzhu Lu, Jie He, Zhiliang Hong:
An 84% Peak Efficiency Bipolar-Input Boost/Flyback Hybrid Converter With MPPT and on-Chip Cold Starter for Thermoelectric Energy Harvesting. ISSCC 2019: 420-422
[c77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/sigdial/RamanarayananMQ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigdial/RamanarayananMQ19
Vikram Ramanarayanan, Matthew Mulholland, Yao Qian:
Scoring Interactional Aspects of Human-Machine Dialog for Language Learning and Assessment using Text Features. SIGdial 2019: 103-109
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-13248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-13248
Chee Wee Leong, Katrina Roohr, Vikram Ramanarayanan, Michelle P. Martin-Raugh, Harrison Kell, Rutuja Ubale, Yao Qian, Zydrune Mladineo, Laura McCulla:
To Trust, or Not to Trust? A Study of Human Bias in Automated Video Interview Assessments. CoRR abs/1911.13248 (2019)
2018
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/tcas/QianLHH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcas/QianLHH18
Yao Qian, Danzhu Lu, Jie He, Zhiliang Hong:
An On-Chip Transformer-Based Self-Startup Hybrid SIDITO Converter for Thermoelectric Energy Harvesting. IEEE Trans. Circuits Syst. II Express Briefs 65-II(11): 1673-1677 (2018)
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0004TGQ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0004TGQ18
Lei Chen, Jidong Tao, Shabnam Ghaffarzadegan, Yao Qian:
End-to-End Neural Network Based Automated Speech Scoring. ICASSP 2018: 6234-6238
[c75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EvaniniMUQPRC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EvaniniMUQPRC18
Keelan Evanini, Matthew Mulholland, Rutuja Ubale, Yao Qian, Robert A. Pugh, Vikram Ramanarayanan, Aoife Cahill:
Improvements to an Automated Content Scoring System for Spoken CALL Responses: the ETS Submission to the Second Spoken CALL Shared Task. INTERSPEECH 2018: 2379-2383
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/NiUQMYMS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/NiUQMYMS18
Zhaoheng Ni, Rutuja Ubale, Yao Qian, Michael I. Mandel, Su-Youn Yoon, Abhinav Misra, David Suendermann-Oeft:
Unusable Spoken Response Detection with BLSTM Neural Networks. ISCSLP 2018: 255-259
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/QianULES18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/QianULES18
Yao Qian, Rutuja Ubale, Patrick L. Lange, Keelan Evanini, Frank K. Soong:
From Speech Signals to Semantics - Tagging Performance at Acoustic, Phonetic and Word Levels. ISCSLP 2018: 280-284
[c72]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/RamanarayananPQ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/RamanarayananPQ18
Vikram Ramanarayanan, Robert Pugh, Yao Qian, David Suendermann-Oeft:
Automatic Turn-Level Language Identification for Code-Switched Spanish-English Dialog. IWSDS 2018: 51-61
[c71]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/UbaleQE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/UbaleQE18
Rutuja Ubale, Yao Qian, Keelan Evanini:
Exploring End-To-End Attention-Based Neural Networks For Native Language Identification. SLT 2018: 84-91
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/QianUMEW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/QianUMEW18
Yao Qian, Rutuja Ubale, Matthew Mulholland, Keelan Evanini, Xinhao Wang:
A Prompt-Aware Neural Network Approach to Content-Based Scoring of Non-Native Spontaneous Speech. SLT 2018: 979-986
2017
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/tcas/QianZCQLH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcas/QianZCQLH17
Yao Qian, Hongguang Zhang, Yanqin Chen, Yajie Qin, Danzhu Lu, Zhiliang Hong:
A SIDIDO DC-DC Converter With Dual-Mode and Programmable-Capacitor-Array MPPT Control for Thermoelectric Energy Harvesting. IEEE Trans. Circuits Syst. II Express Briefs 64-II(8): 952-956 (2017)
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/QianURLSET17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/QianURLSET17
Yao Qian, Rutuja Ubale, Vikram Ramanarayanan, Patrick L. Lange, David Suendermann-Oeft, Keelan Evanini, Eugene Tsuprun:
Exploring ASR-free end-to-end modeling to improve spoken language understanding in a cloud-based dialog system. ASRU 2017: 569-576
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/QianELPUS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/QianELPUS17
Yao Qian, Keelan Evanini, Patrick L. Lange, Robert A. Pugh, Rutuja Ubale, Frank K. Soong:
Improving native language (L1) identifation with better VAD and TDNN trained separately on native and non-native English corpora. ASRU 2017: 606-613
[c67]
- view
  authority control:
- export record
  dblp key:
  - conf/bea/MalmasiECTPHNQ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bea/MalmasiECTPHNQ17
Shervin Malmasi, Keelan Evanini, Aoife Cahill, Joel R. Tetreault, Robert A. Pugh, Christopher Hamill, Diane Napolitano, Yao Qian:
A Report on the 2017 Native Language Identification Shared Task. BEA@EMNLP 2017: 62-75
[c66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianEWLM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianEWLM17
Yao Qian, Keelan Evanini, Xinhao Wang, Chong Min Lee, Matthew Mulholland:
Bidirectional LSTM-RNN for Improving Automated Assessment of Non-Native Children's Speech. INTERSPEECH 2017: 1417-1421
[c65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianEWSPLMS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianEWSPLMS17
Yao Qian, Keelan Evanini, Xinhao Wang, David Suendermann-Oeft, Robert A. Pugh, Patrick L. Lange, Hillary R. Molloy, Frank K. Soong:
Improving Sub-Phone Modeling for Better Native Language Identification with Non-Native English Speech. INTERSPEECH 2017: 2586-2590
[c64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/slte/EvaniniMTQ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slte/EvaniniMTQ17
Keelan Evanini, Matthew Mulholland, Eugene Tsuprun, Yao Qian:
Using an Automated Content Scoring Engine for Spoken CALL Responses: The ETS submission for the Spoken CALL Challenge. SLaTE 2017: 97-102
[c63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/wocci/LoukinaKLGQ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wocci/LoukinaKLGQ17
Anastassia Loukina, Beata Beigman Klebanov, Patrick L. Lange, Binod Gyawali, Yao Qian:
Developing speech processing technologies for shared book reading with a computer. WOCCI 2017: 46-51
2016
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/YinLQSHLD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/YinLQSHLD16
Xiang Yin, Ming Lei, Yao Qian, Frank K. Soong, Lei He, Zhen-Hua Ling, Li-Rong Dai:
Modeling F0 trajectories in hierarchically structured deep neural networks. Speech Commun. 76: 82-92 (2016)
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FanQSH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FanQSH16
Yuchen Fan, Yao Qian, Frank K. Soong, Lei He:
Unsupervised speaker adaptation for DNN-based TTS synthesis. ICASSP 2016: 5135-5139
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FanQSH16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FanQSH16a
Yuchen Fan, Yao Qian, Frank K. Soong, Lei He:
Speaker and language factorization in DNN-based TTS synthesis. ICASSP 2016: 5540-5544
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MulhollandLELQ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MulhollandLELQ16
Matthew Mulholland, Melissa Lopez, Keelan Evanini, Anastassia Loukina, Yao Qian:
A comparison of ASR and human errors for transcription of non-native spontaneous speech. ICASSP 2016: 5855-5859
[c59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianWES16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianWES16
Yao Qian, Xinhao Wang, Keelan Evanini, David Suendermann-Oeft:
Self-Adaptive DNN for Improving Spoken Language Proficiency Assessment. INTERSPEECH 2016: 3122-3126
[c58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianTSEIR16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianTSEIR16
Yao Qian, Jidong Tao, David Suendermann-Oeft, Keelan Evanini, Alexei V. Ivanov, Vikram Ramanarayanan:
Noise and Metadata Sensitive Bottleneck Features for Improving Speaker Recognition with Non-Native Speech Input. INTERSPEECH 2016: 3648-3652
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/WangQSHZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/WangQSHZ16
Peilu Wang, Yao Qian, Frank K. Soong, Lei He, Hai Zhao:
Learning Distributed Word Representations For Bidirectional LSTM Recurrent Neural Network. HLT-NAACL 2016: 527-533
[c56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/wocci/QianWES16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wocci/QianWES16
Yao Qian, Xinhao Wang, Keelan Evanini, David Suendermann-Oeft:
Improving DNN-Based Automatic Recognition of Non-native Children Speech with Adult Speech. WOCCI 2016: 40-44
2015
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/HuQSW15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/HuQSW15
Wenping Hu, Yao Qian, Frank K. Soong, Yong Wang:
Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers. Speech Commun. 67: 154-166 (2015)
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/YuRSWZ0TIQ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/YuRSWZ0TIQ15
Zhou Yu, Vikram Ramanarayanan, David Suendermann-Oeft, Xinhao Wang, Klaus Zechner, Lei Chen, Jidong Tao, Aliaksei Ivanou, Yao Qian:
Using bidirectional lstm recurrent neural networks to learn high-level abstractions of sequential features for automated scoring of non-native spontaneous speech. ASRU 2015: 338-345
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FanQSH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FanQSH15
Yuchen Fan, Yao Qian, Frank K. Soong, Lei He:
Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis. ICASSP 2015: 4475-4479
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangQSHZ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangQSHZ15
Peilu Wang, Yao Qian, Frank K. Soong, Lei He, Hai Zhao:
Word embedding for recurrent neural network based TTS synthesis. ICASSP 2015: 4879-4883
[c52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FanQSH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FanQSH15
Yuchen Fan, Yao Qian, Frank K. Soong, Lei He:
Sequence generation error (SGE) minimization based deep neural networks training for text-to-speech synthesis. INTERSPEECH 2015: 864-868
[c51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/slte/HuQS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slte/HuQS15
Wenping Hu, Yao Qian, Frank K. Soong:
An improved DNN-based approach to mispronunciation detection and diagnosis of L2 learners' speech. SLaTE 2015: 71-76
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/WangQSHZ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WangQSHZ15
Peilu Wang, Yao Qian, Frank K. Soong, Lei He, Hai Zhao:
Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Recurrent Neural Network. CoRR abs/1510.06168 (2015)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/WangQSHZ15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WangQSHZ15a
Peilu Wang, Yao Qian, Frank K. Soong, Lei He, Hai Zhao:
A Unified Tagging Solution: Bidirectional LSTM Recurrent Neural Network with Word Embedding. CoRR abs/1511.00215 (2015)
2014
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/GaoCQ14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/GaoCQ14
Weixun Gao, Qiying Cao, Yao Qian:
Cross-Dialectal Voice Conversion with Neural Networks. IEICE Trans. Inf. Syst. 97-D(11): 2872-2880 (2014)
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuQS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuQS14
Wenping Hu, Yao Qian, Frank K. Soong:
A DNN-based acoustic modeling of tonal language and its application to Mandarin pronunciation training. ICASSP 2014: 3206-3210
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianFHS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianFHS14
Yao Qian, Yuchen Fan, Wenping Hu, Frank K. Soong:
On the training aspects of Deep Neural Network (DNN) for parametric TTS synthesis. ICASSP 2014: 3829-3833
[c48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FanQXS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FanQXS14
Yuchen Fan, Yao Qian, Feng-Long Xie, Frank K. Soong:
TTS synthesis with bidirectional LSTM based recurrent neural networks. INTERSPEECH 2014: 1964-1968
[c47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YinLQSHLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YinLQSHLD14
Xiang Yin, Ming Lei, Yao Qian, Frank K. Soong, Lei He, Zhen-Hua Ling, Li-Rong Dai:
Modeling DCT parameterized F0 trajectory at intonation phrase level with DNN or decision tree. INTERSPEECH 2014: 2273-2277
[c46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XieQFSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XieQFSL14
Feng-Long Xie, Yao Qian, Yuchen Fan, Frank K. Soong, Haifeng Li:
Sequence error (SE) minimization training of neural network for voice conversion. INTERSPEECH 2014: 2283-2287
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XieQSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XieQSL14
Feng-Long Xie, Yao Qian, Frank K. Soong, Haifeng Li:
Pitch transformation in neural network based voice conversion. ISCSLP 2014: 197-200
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HuQS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HuQS14
Wenping Hu, Yao Qian, Frank K. Soong:
A new Neural Network based logistic regression classifier for improving mispronunciation detection of L2 language learners. ISCSLP 2014: 245-249
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/isscc/LuQH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isscc/LuQH14
Danzhu Lu, Yao Qian, Zhiliang Hong:
4.3 An 87%-peak-efficiency DVS-capable single-inductor 4-output DC-DC buck converter with ripple-based adaptive off-time control. ISSCC 2014: 82-83
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/robio/QuanQR14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/robio/QuanQR14
Changqin Quan, Yao Qian, Fuji Ren:
Dynamic facial expression recognition based on K-order emotional intensity model. ROBIO 2014: 1164-1168
2013
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QianSY13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianSY13
Yao Qian, Frank K. Soong, Zhi-Jie Yan:
A Unified Trajectory Tiling Approach to High Quality Speech Rendering. IEEE Trans. Speech Audio Process. 21(2): 280-290 (2013)
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/asicon/LiQLH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asicon/LiQLH13
Qiuli Li, Yao Qian, Danzhu Lu, Zhiliang Hong:
VCCS controlled LDO with small on-chip capacitor. ASICON 2013: 1-4
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianSZQZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianSZQZ13
Yao Qian, Frank K. Soong, Xiaobo Zhou, Yundi Qian, Xiaotian Zhang:
A fast table lookup based, statistical model driven non-uniform unit selection TTS. ICASSP 2013: 7957-7961
[c39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuQS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuQS13
Wenping Hu, Yao Qian, Frank K. Soong:
A new DNN-based high quality pronunciation evaluation for computer-aided language learning (CALL). INTERSPEECH 2013: 1886-1890
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/sii/QianRQ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sii/QianRQ13
Yao Qian, Fuji Ren, Changqin Quan:
A new preprocessing algorithm and local binary pattern based facial expression recognition. SII 2013: 239-244
2012
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/computer/WangQSCS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/computer/WangQSCS12
Lijuan Wang, Yao Qian, Matthew R. Scott, Gang Chen, Frank K. Soong:
Computer-Assisted Audiovisual Language Learning. Computer 45(6): 38-47 (2012)
[c37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeQSZ12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeQSZ12
Ji He, Yao Qian, Frank K. Soong, Sheng Zhao:
Turning a Monolingual Speaker into Multilingual for a Mixed-language TTS. INTERSPEECH 2012: 963-966
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/QianS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/QianS12
Yao Qian, Frank K. Soong:
A unified trajectory tiling approach to high quality TTS and cross-lingual voice transformation. ISCSLP 2012: 165-169
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhangQZS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhangQZS12
Xiaotian Zhang, Yao Qian, Hai Zhao, Frank K. Soong:
Break index labeling of mandarin text via syntactic-to-prosodic tree mapping. ISCSLP 2012: 256-260
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HuQS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HuQS12
Wenping Hu, Yao Qian, Frank K. Soong:
Pitch accent detection and prediction with DCT features and CRF model. ISCSLP 2012: 266-270
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/mhci/EdgeCWQYS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mhci/EdgeCWQYS12
Darren Edge, Kai-Yin Cheng, Michael Whitney, Yao Qian, Zhijie Yan, Frank K. Soong:
Tip tap tones: mobile microtraining of mandarin sounds. Mobile HCI (Companion) 2012: 215-216
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/mhci/EdgeCWQYS12a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mhci/EdgeCWQYS12a
Darren Edge, Kai-Yin Cheng, Michael Whitney, Yao Qian, Zhijie Yan, Frank K. Soong:
Tip tap tones: mobile microtraining of mandarin sounds. Mobile HCI 2012: 427-430
2011
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QianWGS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianWGS11
Yao Qian, Zhizheng Wu, Boyang Gao, Frank K. Soong:
Improved Prosody Generation by Maximizing Joint Probability of State and Longer Units. IEEE Trans. Speech Audio Process. 19(6): 1702-1710 (2011)
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KunikoshiQSM11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KunikoshiQSM11
Aki Kunikoshi, Yao Qian, Frank K. Soong, Nobuaki Minematsu:
Improved F0 modeling and generation in voice conversion. ICASSP 2011: 4568-4571
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianXS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianXS11
Yao Qian, Ji Xu, Frank K. Soong:
A frame mapping based HMM approach to cross-lingual voice transformation. ICASSP 2011: 5120-5123
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PengQSZ11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PengQSZ11
Bo Peng, Yao Qian, Frank K. Soong, Bo Zhang:
A New Phonetic Candidate Generator for Improving Search Query Efficiency. INTERSPEECH 2011: 1117-1120
2010
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/QianYWSZW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/QianYWSZW10
Yao Qian, Zhi-Jie Yan, Yi-Jian Wu, Frank K. Soong, Guoliang Zhang, Lijuan Wang:
An HMM Trajectory Tiling (HTT) Approach to High Quality TTS - Microsoft Entry to Blizzard Challenge 2010. Blizzard Challenge 2010
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangSQYPY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangSQYPY10
Qingqing Zhang, Frank K. Soong, Yao Qian, Zhijie Yan, Jielin Pan, Yonghong Yan:
Improved modeling for F0 generation and V/U decision in HMM-based TTS. ICASSP 2010: 4606-4609
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YanQS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YanQS10
Zhi-Jie Yan, Yao Qian, Frank K. Soong:
RIch-context Unit Selection (RUS) approach to high quality TTS. ICASSP 2010: 4798-4801
[c25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianYWSZK10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianYWSZK10
Yao Qian, Zhi-Jie Yan, Yi-Jian Wu, Frank K. Soong, Xin Zhuang, Shengyi Kong:
An HMM trajectory tiling (HTT) approach to high quality TTS. INTERSPEECH 2010: 422-425
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuangQSWZ10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuangQSWZ10
Xin Zhuang, Yao Qian, Frank K. Soong, Yi-Jian Wu, Bo Zhang:
Formant-based frequency warping for improving speaker adaptation in HMM TTS. INTERSPEECH 2010: 817-820
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/QianWMS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/QianWMS10
Yao Qian, Zhizheng Wu, Xuezhe Ma, Frank K. Soong:
Automatic prosody prediction and detection with Conditional Random Field (CRF) models. ISCSLP 2010: 135-138

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/QianS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/QianS09
Yao Qian, Frank K. Soong:
A Multi-Space Distribution (MSD) and two-stream tone modeling approach to Mandarin speech recognition. Speech Commun. 51(12): 1169-1179 (2009)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QianLS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianLS09
Yao Qian, Hui Liang, Frank K. Soong:
A Cross-Language State Sharing and Mapping Approach to Bilingual (Mandarin-English) TTS. IEEE Trans. Speech Audio Process. 17(6): 1231-1239 (2009)
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianWS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianWS09
Yao Qian, Zhizheng Wu, Frank K. Soong:
Improved prosody generation by maximizing joint likelihood of state and longer units. ICASSP 2009: 3781-3784
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenJQS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenJQS09
Yining Chen, Yang Jiao, Yao Qian, Frank K. Soong:
State mapping for cross-language speaker adaptation in TTS. ICASSP 2009: 4273-4276
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianSWW09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianSWW09
Yao Qian, Frank K. Soong, Miaomiao Wang, Zhizheng Wu:
A minimum v/u error approach to F0 generation in HMM-based TTS. INTERSPEECH 2009: 408-411
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanQS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanQS09
Zhi-Jie Yan, Yao Qian, Frank K. Soong:
Rich context modeling for high quality HMM-based TTS. INTERSPEECH 2009: 1755-1758
2008
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/QianSL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/QianSL08
Yao Qian, Frank K. Soong, Tan Lee:
Tone-enhanced generalized character posterior probability (GCPP) for Cantonese LVCSR. Comput. Speech Lang. 22(4): 360-373 (2008)
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiangQSL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiangQSL08
Hui Liang, Yao Qian, Frank K. Soong, Gongshen Liu:
A cross-language state mapping approach to bilingual (Mandarin-English) TTS. ICASSP 2008: 4641-4644
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YeungQLS08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YeungQLS08
Yu Ting Yeung, Yao Qian, Tan Lee, Frank K. Soong:
Prosody for Mandarin speech recognition: a comparative study of read and spontaneous speech. INTERSPEECH 2008: 1133-1136
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianLS08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianLS08
Yao Qian, Hui Liang, Frank K. Soong:
Generating natural F0 trajectory with additive trees. INTERSPEECH 2008: 2126-2129
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoQWS08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoQWS08
Boyang Gao, Yao Qian, Zhizheng Wu, Frank K. Soong:
Duration refinement by jointly optimizing state and longer unit likelihood. INTERSPEECH 2008: 2266-2269
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangQMQCS08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangQMQCS08
Lijuan Wang, Xiaojun Qian, Lei Ma, Yao Qian, Yining Chen, Frank K. Soong:
A real-time text to audio-visual speech synthesis system. INTERSPEECH 2008: 2338-2341
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/QianCS08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/QianCS08
Yao Qian, Houwei Cao, Frank K. Soong:
HMM-Based Mixed-Language (Mandarin-English) Speech Synthesis. ISCSLP 2008: 13-16
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WuQSZ08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WuQSZ08
Zhizheng Wu, Yao Qian, Frank K. Soong, Bo Zhang:
Modeling and Generating Tone Contour with Phrase Intonation for Mandarin Chinese Speech. ISCSLP 2008: 121-124
2007
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QiangQSX07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QiangQSX07
Sheng Qiang, Yao Qian, Frank K. Soong, Congfu Xu:
Robust F0 modeling for Mandarin speech recognition in noise. INTERSPEECH 2007: 1801-1804
[c10]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ssw/LiangQS07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/LiangQS07
Hui Liang, Yao Qian, Frank K. Soong:
An HMM-based bilingual (Mandarin-English) TTS. SSW 2007: 137-142
2006
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianSL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianSL06
Yao Qian, Frank K. Soong, Tan Lee:
Tone-Enhanced Generalized Character Posterior Probability (GCPP) for Cantonese LVCSR. ICASSP (1) 2006: 133-136
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangQSZH06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangQSZH06
Huanliang Wang, Yao Qian, Frank K. Soong, Jian-Lai Zhou, Jiqing Han:
A multi-space distribution (MSD) approach to speech recognition of tonal languages. INTERSPEECH 2006
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/QianSCC06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/QianSCC06
Yao Qian, Frank K. Soong, Yining Chen, Min Chu:
An HMM-Based Mandarin Chinese Text-To-Speech System. ISCSLP (Selected Papers) 2006: 223-232
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangQSZH06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangQSZH06
Huanliang Wang, Yao Qian, Frank K. Soong, Jian-Lai Zhou, Jiqing Han:
Improved Mandarin Speech Recognition by Lattice Rescoring with Enhanced Tone Models. ISCSLP (Selected Papers) 2006: 445-453
2004
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/talip/LiLQ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/talip/LiLQ04
Yujia Li, Tan Lee, Yao Qian:
Analysis and modeling of F0 contours for cantonese text-to-speech. ACM Trans. Asian Lang. Inf. Process. 3(3): 169-180 (2004)
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianLS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianLS04
Yao Qian, Tan Lee, Frank K. Soong:
Tone information as a confidence measure for improving Cantonese LVCSR. INTERSPEECH 2004: 1965-1968
2003
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianLL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianLL03
Yao Qian, Tan Lee, Yujia Li:
Overlapped di-tone modeling for tone recognition in continuous Cantonese speech. INTERSPEECH 2003: 1845-1848
2002
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianC02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianC02
Yao Qian, Fang Chen:
Assigning phrase accent to Chinese Text-to-Speech system. ICASSP 2002: 485-488
[c2]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/Li0Q02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/Li0Q02
Yujia Li, Tan Lee, Yao Qian:
Acoustical F0 analysis of continuous cantonese speech. ISCSLP 2002
2001
[j1]
- view
  - electronic edition @ aclclp.org.tw (open access)
  - details & citations
- export record
  dblp key:
  - journals/ijclclp/ChuQ01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijclclp/ChuQ01
Min Chu, Yao Qian:
Locating Boundaries for Prosodic Constituents in Unrestricted Mandarin Texts. Int. J. Comput. Linguistics Chin. Lang. Process. 6(1) (2001)
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianCP01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianCP01
Yao Qian, Min Chu, Hu Peng:
Segmenting unrestricted Chinese text into prosodic words instead of lexical words. ICASSP 2001: 825-828

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.