default search action
Yong Zhao 0008
Person information
- affiliation: Microsoft Corporation, Redmond, WA, USA
- affiliation (former): Georgia Institute of Technology, Atlanta, GA, USA
Other persons with the same name
- Yong Zhao — disambiguation page
- Yong Zhao 0001 — Nazarbayev University, Astana, Kazakhstan (and 1 more)
- Yong Zhao 0002 — University of Oregon, College of Education, OR, USA
- Yong Zhao 0003 — University of Sheffield, Department of Electronical & Electrical Engineering, UK
- Yong Zhao 0004 — Ocean University of China, School of Mathematical Sciences, Qingdao, China (and 1 more)
- Yong Zhao 0005 — Northeastern University, College of Information Science and Engineering, Shenyang, China (and 3 more)
- Yong Zhao 0006 — Michigan State University, East Lansing, MI, USA
- Yong Zhao 0007 — Chang'an University, Mechanical Department, Xi'an, China
- Yong Zhao 0009 — University of Electronical Science & Technology of China, Chengdu, China (and 1 more)
- Yong Zhao 0010 — Peking University, Shenzhen Graduate School, Key Laboratory of Integrated Microsystems, China (and 5 more)
- Yong Zhao 0011 — Kyoto Insitute of Technology, Japan
- Yong Zhao 0012 — China Institute of Water Resources and Hydropower Research, Beijing, China
- Yong Zhao 0013 — Sun Yat-sen University, School of Life Sciences, Guangzhou, China
- Yong Zhao 0014 — University of South Carolina, Department of Computer Science and Engineering, Columbia, SC, USA (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c42]Mufan Sang, Yong Zhao, Gang Liu, John H. L. Hansen, Jian Wu:
Improving Transformer-Based Networks with Locality for Automatic Speaker Verification. ICASSP 2023: 1-5 - [i10]Mufan Sang, Yong Zhao, Gang Liu, John H. L. Hansen, Jian Wu:
Improving Transformer-based Networks With Locality For Automatic Speaker Verification. CoRR abs/2302.08639 (2023) - 2022
- [i9]Gang Liu, Tianyan Zhou, Yong Zhao, Yu Wu, Zhuo Chen, Yao Qian, Jian Wu:
The Microsoft System for VoxCeleb Speaker Recognition Challenge 2022. CoRR abs/2209.11266 (2022) - 2021
- [c41]Xiong Xiao, Naoyuki Kanda, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka, Sanyuan Chen, Yong Zhao, Gang Liu, Yu Wu, Jian Wu, Shujie Liu, Jinyu Li, Yifan Gong:
Microsoft Speaker Diarization System for the Voxceleb Speaker Recognition Challenge 2020. ICASSP 2021: 5824-5828 - [c40]Tianyan Zhou, Yong Zhao, Jian Wu:
ResNeXt and Res2Net Structures for Speaker Verification. SLT 2021: 301-307 - 2020
- [c39]Yong Zhao, Tianyan Zhou, Zhuo Chen, Jian Wu:
Improving Deep CNN Networks with Long Temporal Context for Text-Independent Speaker Verification. ICASSP 2020: 6834-6838 - [i8]Tianyan Zhou, Yong Zhao, Jian Wu:
ResNeXt and Res2Net Structure for Speaker Verification. CoRR abs/2007.02480 (2020) - [i7]Xiong Xiao, Naoyuki Kanda, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka, Sanyuan Chen, Yong Zhao, Gang Liu, Yu Wu, Jian Wu, Shujie Liu, Jinyu Li, Yifan Gong:
Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020. CoRR abs/2010.11458 (2020)
2010 – 2019
- 2019
- [c38]Takuya Yoshioka, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Igor Abramovski, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao, Tianyan Zhou, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang:
Advances in Online Audio-Visual Meeting Transcription. ASRU 2019: 276-283 - [c37]Tianyan Zhou, Yong Zhao, Jinyu Li, Yifan Gong, Jian Wu:
CNN with Phonetic Attention for Text-Independent Speaker Verification. ASRU 2019: 718-725 - [c36]Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong:
Adversarial Speaker Verification. ICASSP 2019: 6216-6220 - [c35]Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong:
Conditional Teacher-student Learning. ICASSP 2019: 6445-6449 - [i6]Ke Li, Jinyu Li, Yong Zhao, Kshitiz Kumar, Yifan Gong:
Speaker Adaptation for End-to-End CTC Models. CoRR abs/1901.01239 (2019) - [i5]Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong:
Conditional Teacher-Student Learning. CoRR abs/1904.12399 (2019) - [i4]Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong:
Adversarial Speaker Verification. CoRR abs/1904.12406 (2019) - [i3]Takuya Yoshioka, Igor Abramovski, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao, Tianyan Zhou:
Advances in Online Audio-Visual Meeting Transcription. CoRR abs/1912.04979 (2019) - 2018
- [c34]Liping Chen, Yong Zhao, Shi-Xiong Zhang, Jie Li, Guoli Ye, Frank K. Soong:
Exploring Sequential Characteristics in Speaker Bottleneck Feature for Text-Dependent Speaker Verification. ICASSP 2018: 5364-5368 - [c33]Yong Zhao, Jinyu Li, Shi-Xiong Zhang, Liping Chen, Yifan Gong:
Domain and Speaker Adaptation for Cortana Speech Recognition. ICASSP 2018: 5984-5988 - [c32]Ke Li, Jinyu Li, Yong Zhao, Kshitiz Kumar, Yifan Gong:
Speaker Adaptation for End-to-End CTC Models. SLT 2018: 542-549 - [i2]Zhong Meng, Jinyu Li, Zhuo Chen, Yong Zhao, Vadim Mazalov, Yifan Gong, Biing-Hwang Juang:
Speaker-Invariant Training via Adversarial Learning. CoRR abs/1804.00732 (2018) - 2017
- [j5]Yong Zhao, Biing-Hwang Fred Juang:
A comparative study of noise estimation algorithms for nonlinear compensation in robust speech recognition. Speech Commun. 89: 58-69 (2017) - [c31]Yong Zhao, Jinyu Li, Kshitiz Kumar, Yifan Gong:
Extended low-rank plus diagonal adaptation for deep and recurrent neural networks. ICASSP 2017: 5040-5044 - [p1]Yifan Gong, Yan Huang, Kshitiz Kumar, Jinyu Li, Chaojun Liu, Guoli Ye, Shi-Xiong Zhang, Yong Zhao, Rui Zhao:
Challenges in and Solutions to Deep Learning Network Acoustic Modeling in Speech Recognition Products at Microsoft. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 401-417 - [i1]Shi-Xiong Zhang, Zhuo Chen, Yong Zhao, Jinyu Li, Yifan Gong:
End-to-End Attention based Text-Dependent Speaker Verification. CoRR abs/1701.00562 (2017) - 2016
- [c30]Yong Zhao, Jinyu Li, Yifan Gong:
Low-rank plus diagonal adaptation for deep neural networks. ICASSP 2016: 5005-5009 - [c29]Shi-Xiong Zhang, Zhuo Chen, Yong Zhao, Jinyu Li, Yifan Gong:
End-to-End attention based text-dependent speaker verification. SLT 2016: 171-178 - 2015
- [c28]Yong Zhao, Jinyu Li, Jian Xue, Yifan Gong:
Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data. ICASSP 2015: 4310-4314 - [c27]Kshitiz Kumar, Ziad Al Bawab, Yong Zhao, Chaojun Liu, Benoît Dumoulin, Yifan Gong:
Confidence-features and confidence-scores for ASR applications in arbitration and DNN speaker adaptation. INTERSPEECH 2015: 702-706 - 2013
- [b1]Yong Zhao:
Nonlinear compensation and heterogeneous data modeling for robust speech recognition. Georgia Institute of Technology, Atlanta, GA, USA, 2013 - [c26]Yong Zhao, Biing-Hwang Juang:
Modeling heterogeneous data sources for speech recognition using synchronous hidden Markov models. ICASSP 2013: 7403-7407 - 2012
- [j4]Qiang Fu, Yong Zhao, Biing-Hwang Juang:
Automatic Speech Recognition Based on Non-Uniform Error Criteria. IEEE Trans. Speech Audio Process. 20(3): 780-793 (2012) - [j3]Yong Zhao, Biing-Hwang Juang:
Nonlinear Compensation Using the Gauss-Newton Method for Noise-Robust Speech Recognition. IEEE Trans. Speech Audio Process. 20(8): 2191-2206 (2012) - [c25]Yong Zhao, Biing-Hwang Juang:
Exploiting sparsity in stranded hidden Markov models for automatic speech recognition. ACSCC 2012: 1623-1625 - [c24]Yong Zhao, Andrej Ljolje, Diamantino Caseiro, Biing-Hwang Juang:
A general discriminative training algorithm for speech recognition using weighted finite-state transducers. ICASSP 2012: 4217-4220 - [c23]Yong Zhao, Biing-Hwang Juang:
Stranded Gaussian mixture hidden Markov models for robust speech recognition. ICASSP 2012: 4301-4304 - 2011
- [c22]Yong Zhao, Biing-Hwang Juang:
Non-linear noise compensation for robust speech recognition using Gauss-Newton method. ICASSP 2011: 4796-4799 - 2010
- [c21]Yong Zhao, Biing-Hwang Juang:
On noise estimation for robust speech recognition using vector Taylor series. ICASSP 2010: 4290-4293 - [c20]Yong Zhao, Biing-Hwang Juang:
A comparative study of noise estimation algorithms for VTS-based robust speech recognition. INTERSPEECH 2010: 2090-2093
2000 – 2009
- 2009
- [c19]Yong Zhao, Sunghwan Shin, Enrique Robledo-Arnuncio, Biing-Hwang Juang:
A study on recognizing distorted speech over local distributed transducer networks. ICASSP 2009: 4181-4184 - 2007
- [c18]Xinqiang Ni, Yining Chen, Min Chu, Frank K. Soong, Yong Zhao, Ping Zhang:
Agreement Learning for Automatic Accent Annotation. ICASSP (4) 2007: 829-832 - [c17]Dacheng Lin, Yong Zhao, Frank K. Soong, Min Chu, Jieyu Zhao:
Iterative unit selection with unnatural prosody detection. INTERSPEECH 2007: 2909-2912 - [c16]Lijuan Wang, Min Chu, Yaya Peng, Yong Zhao, Frank K. Soong:
Perceptual annotation of expressive speech. SSW 2007: 46-51 - [c15]Yong Zhao, Chengsuo Zhang, Frank K. Soong, Min Chu, Xi Xiao:
Measuring attribute dissimilarity with HMM KL-divergence for speech synthesis. SSW 2007: 206-210 - 2006
- [j2]Lijuan Wang, Yong Zhao, Min Chu, Frank K. Soong, Jian-Lai Zhou, Zhigang Cao:
Context-Dependent Boundary Model for Refining Boundaries Segmentation of TTS Units. IEICE Trans. Inf. Syst. 89-D(3): 1082-1091 (2006) - [j1]Min Chu, Yong Zhao, Eric Chang:
Modeling stylized invariance and local variability of prosody in text-to-speech synthesis. Speech Commun. 48(6): 716-726 (2006) - [c14]Min Chu, Yining Chen, Yong Zhao, Yusheng Li, Frank K. Soong:
A Study on How Human Annotations Benefit the TTS Voice. Blizzard Challenge 2006 - [c13]Yong Zhao, Peng Liu, Yusheng Li, Yining Chen, Min Chu:
Measuring Target Cost in Unit Selection with Kl-Divergence Between Context-Dependent HMMS. ICASSP (1) 2006: 725-728 - [c12]Yining Chen, Jia-Li You, Min Chu, Yong Zhao, Jin-Lin Wang:
Identifying Language Origin of Person Names With N-Grams of Different Units. ICASSP (1) 2006: 729-732 - [c11]Min Lai, Yining Chen, Min Chu, Yong Zhao, Fangyu Hu:
A Hierarchical Approach to Automatic Stress Detection in English Sentences. ICASSP (1) 2006: 753-756 - [c10]Jia-Li You, Yining Chen, Min Chu, Yong Zhao, Jin-Lin Wang:
Identify language origin of personal names with normalized appearance number of web pages. INTERSPEECH 2006 - [c9]Yong Zhao, Di Peng, Lijuan Wang, Min Chu, Yining Chen, Peng Yu, Jun Guo:
Constructing stylistic synthesis databases from audio books. INTERSPEECH 2006 - [c8]Min Chu, Yong Zhao, Yining Chen, Lijuan Wang, Frank K. Soong:
The Paradigm for Creating Multi-lingual Text-To-Speech Voice Databases. ISCSLP (Selected Papers) 2006: 736-747 - 2005
- [c7]Lijuan Wang, Yong Zhao, Min Chu, Frank K. Soong, Zhigang Cao:
Phonetic transcription verification with generalized posterior probability. INTERSPEECH 2005: 1949-1952 - [c6]Yong Zhao, Lijuan Wang, Min Chu, Frank K. Soong, Zhigang Cao:
Refining phoneme segmentations using speaker-adaptive context dependent boundary models. INTERSPEECH 2005: 2557-2560 - [c5]Yining Chen, Yong Zhao, Min Chu:
Customizing base unit set with speech database in TTS systems. INTERSPEECH 2005: 2561-2564 - 2004
- [c4]Lijuan Wang, Yong Zhao, Min Chu, Jian-Lai Zhou, Zhigang Cao:
Refining segmental boundaries for TTS database using fine contextual-dependent boundary models. ICASSP (1) 2004: 641-644 - 2003
- [c3]Min Chu, Hu Peng, Yong Zhao, Zhengyu Niu, Eric Chang:
Microsoft Mulan - a bilingual TTS system. ICASSP (1) 2003: 264-267 - [c2]Yong Zhao, Min Chu, Hu Peng, Eric Chang:
Custom-tailoring TTS voice font - keeping the naturalness when reducing database size. INTERSPEECH 2003: 2957-2960 - 2002
- [c1]Hu Peng, Yong Zhao, Min Chu:
Perpetually optimizing the cost function for unit selection in a TTS system with one single run of MOS evaluation. INTERSPEECH 2002: 2613-2616
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-23 20:35 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint