default search action
Chunyang Wu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c30]Yuan Shangguan, Haichuan Yang, Danni Li, Chunyang Wu, Yassir Fathullah, Dilin Wang, Ayushi Dalmia, Raghuraman Krishnamoorthi, Ozlem Kalinli, Junteng Jia, Jay Mahadeokar, Xin Lei, Mike Seltzer, Vikas Chandra:
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-Device ASR Models. ICASSP 2024: 10216-10220 - [c29]Jiamin Xie, Ke Li, Jinxi Guo, Andros Tjandra, Yuan Shangguan, Leda Sari, Chunyang Wu, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of a Multilingual ASR Model. ICASSP 2024: 12201-12205 - [c28]Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen:
End-to-End Speech Recognition Contextualization with Large Language Models. ICASSP 2024: 12406-12410 - [c27]Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective Internal Language Model Training and Fusion for Factorized Transducer Model. ICASSP 2024: 12687-12691 - [c26]Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. ICASSP 2024: 13351-13355 - [c25]Shuyi Chen, Tian Pan, Chunyang Wu, Guohao Ruan, Jiao Zhang, Tao Huang, Yunjie Liu:
D-Router: Decoupled Content Routers with Remote Content Store. ICC 2024: 3016-3021 - [c24]Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Ke Li, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs. NAACL-HLT 2024: 5522-5532 - [i23]Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective internal language model training and fusion for factorized transducer model. CoRR abs/2404.01716 (2024) - [i22]Frank Seide, Morrie Doulaty, Yangyang Shi, Yashesh Gaur, Junteng Jia, Chunyang Wu:
Speech ReaLLM - Real-time Streaming Speech Recognition with Multimodal LLMs by Teaching the Flow of Time. CoRR abs/2406.09569 (2024) - [i21]Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurélien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Rozière, Bethany Biron, Binh Tang, Bobbie Chern, Charlotte Caucheteux, Chaya Nayak, Chloe Bi, Chris Marra, Chris McConnell, Christian Keller, Christophe Touret, Chunyang Wu, Corinne Wong, Cristian Canton Ferrer, Cyrus Nikolaidis, Damien Allonsius, Daniel Song, Danielle Pintz, Danny Livshits, David Esiobu, Dhruv Choudhary, Dhruv Mahajan, Diego Garcia-Olano, Diego Perino, Dieuwke Hupkes, Egor Lakomkin, Ehab AlBadawy, Elina Lobanova, Emily Dinan, Eric Michael Smith, Filip Radenovic, Frank Zhang, Gabriel Synnaeve, Gabrielle Lee, Georgia Lewis Anderson, Graeme Nail, Grégoire Mialon, Guan Pang, Guillem Cucurell, Hailey Nguyen, Hannah Korevaar, Hu Xu, Hugo Touvron, Iliyan Zarov, Imanol Arrieta Ibarra, Isabel M. Kloumann, Ishan Misra, Ivan Evtimov, Jade Copet, Jaewon Lee, Jan Geffert, Jana Vranes, Jason Park, Jay Mahadeokar, Jeet Shah, Jelmer van der Linde, Jennifer Billock, Jenny Hong, Jenya Lee, Jeremy Fu, Jianfeng Chi, Jianyu Huang, Jiawen Liu, Jie Wang, Jiecao Yu, Joanna Bitton, Joe Spisak, Jongsoo Park, Joseph Rocca, Joshua Johnstun, Joshua Saxe, Junteng Jia, Kalyan Vasuden Alwala, Kartikeya Upasani, Kate Plawiak, Ke Li, Kenneth Heafield, Kevin Stone, et al.:
The Llama 3 Herd of Models. CoRR abs/2407.21783 (2024) - [i20]Wonjune Kang, Junteng Jia, Chunyang Wu, Wei Zhou, Egor Lakomkin, Yashesh Gaur, Leda Sari, Suyoun Kim, Ke Li, Jay Mahadeokar, Ozlem Kalinli:
Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech. CoRR abs/2410.01162 (2024) - [i19]Junteng Jia, Gil Keren, Wei Zhou, Egor Lakomkin, Xiaohui Zhang, Chunyang Wu, Frank Seide, Jay Mahadeokar, Ozlem Kalinli:
Efficient Streaming LLM for Speech Recognition. CoRR abs/2410.03752 (2024) - 2023
- [c23]Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. ICASSP 2023: 1-5 - [c22]Qiang Qian, Chunyang Wu, Li Zhang, Shaojun Xie, Shian Guo:
Improved Repetitive Control with Enhanced Active Damping Method for 400Hz Inverter. IECON 2023: 1-6 - [c21]Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. INTERSPEECH 2023: 241-245 - [i18]Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. CoRR abs/2305.12498 (2023) - [i17]Shuo Liu, Leda Sari, Chunyang Wu, Gil Keren, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli:
Towards Selection of Text-to-speech Data to Augment ASR Training. CoRR abs/2306.00998 (2023) - [i16]Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. CoRR abs/2307.11795 (2023) - [i15]Yuan Shangguan, Haichuan Yang, Danni Li, Chunyang Wu, Yassir Fathullah, Dilin Wang, Ayushi Dalmia, Raghuraman Krishnamoorthi, Ozlem Kalinli, Junteng Jia, Jay Mahadeokar, Xin Lei, Mike Seltzer, Vikas Chandra:
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models. CoRR abs/2309.01947 (2023) - [i14]Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen:
End-to-End Speech Recognition Contextualization with Large Language Models. CoRR abs/2309.10917 (2023) - [i13]Jiamin Xie, Ke Li, Jinxi Guo, Andros Tjandra, Yuan Shangguan, Leda Sari, Chunyang Wu, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli:
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model. CoRR abs/2309.13018 (2023) - [i12]Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data. CoRR abs/2311.06753 (2023) - 2022
- [j5]Chunyang Wu, Fei Zhu, Quan Liu:
Explore the weakness: Instructive exploration adversarial robust reinforcement learning. J. King Saud Univ. Comput. Inf. Sci. 34(10 Part A): 8152-8161 (2022) - [c20]Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution. ICASSP 2022: 8277-8281 - [i11]Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. CoRR abs/2210.11588 (2022) - 2021
- [c19]Yongqiang Wang, Yangyang Shi, Frank Zhang, Chunyang Wu, Julian Chan, Ching-Feng Yeh, Alex Xiao:
Transformer in Action: A Comparative Study of Transformer-Based Acoustic Models for Large Scale Speech Recognition Applications. ICASSP 2021: 6778-6782 - [c18]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Ching-Feng Yeh, Julian Chan, Frank Zhang, Duc Le, Mike Seltzer:
Emformer: Efficient Memory Transformer Based Acoustic Model for Low Latency Streaming Speech Recognition. ICASSP 2021: 6783-6787 - [c17]Chunyang Wu, Zhiping Xiu, Yangyang Shi, Ozlem Kalinli, Christian Fuegen, Thilo Köhler, Qing He:
Transformer-Based Acoustic Modeling for Streaming Speech Synthesis. Interspeech 2021: 146-150 - [c16]Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency. Interspeech 2021: 2042-2046 - [c15]Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios. Interspeech 2021: 2107-2111 - [c14]Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. Interspeech 2021: 4553-4557 - [c13]Ching-Feng Yeh, Yongqiang Wang, Yangyang Shi, Chunyang Wu, Frank Zhang, Julian Chan, Michael L. Seltzer:
Streaming Attention-Based Models with Augmented Memory for End-To-End Speech Recognition. SLT 2021: 8-14 - [c12]Chunyang Wu, Bei-Wei Zhang, Jinhai Li:
Denoising method of ECG signal with power threshold function under wavelet transform and smoothing filter✱. WI/IAT (Workshop/Special Session) 2021: 327-333 - [i10]Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency. CoRR abs/2104.02176 (2021) - [i9]Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. CoRR abs/2104.02207 (2021) - [i8]Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios. CoRR abs/2104.02232 (2021) - [i7]Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution. CoRR abs/2110.05241 (2021) - 2020
- [c11]Chunyang Wu, Yongqiang Wang, Yangyang Shi, Ching-Feng Yeh, Frank Zhang:
Streaming Transformer-Based Acoustic Models Using Self-Attention with Augmented Memory. INTERSPEECH 2020: 2132-2136 - [c10]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer:
Weak-Attention Suppression for Transformer Based Speech Recognition. INTERSPEECH 2020: 4996-5000 - [i6]Chunyang Wu, Yongqiang Wang, Yangyang Shi, Ching-Feng Yeh, Frank Zhang:
Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory. CoRR abs/2005.08042 (2020) - [i5]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer:
Weak-Attention Suppression For Transformer Based Speech Recognition. CoRR abs/2005.09137 (2020) - [i4]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Ching-Feng Yeh, Julian Chan, Frank Zhang, Duc Le, Michael L. Seltzer:
Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition. CoRR abs/2010.10759 (2020) - [i3]Yongqiang Wang, Yangyang Shi, Frank Zhang, Chunyang Wu, Julian Chan, Ching-Feng Yeh, Alex Xiao:
Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications. CoRR abs/2010.14665 (2020) - [i2]Ching-Feng Yeh, Yongqiang Wang, Yangyang Shi, Chunyang Wu, Frank Zhang, Julian Chan, Michael L. Seltzer:
Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition. CoRR abs/2011.07120 (2020)
2010 – 2019
- 2018
- [b1]Chunyang Wu:
Structured deep neural networks for speech recognition. University of Cambridge, UK, 2018 - [j4]Chunyang Wu, Mark J. F. Gales, Anton Ragni, Penny Karanasou, Khe Chai Sim:
Improving Interpretability and Regularization in Deep Learning. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 256-265 (2018) - 2017
- [j3]Penny Karanasou, Chunyang Wu, Mark J. F. Gales, Philip C. Woodland:
I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models. IEEE ACM Trans. Audio Speech Lang. Process. 25(4): 818-828 (2017) - [j2]Lun Liu, Elisabete A. Silva, Chunyang Wu, Hui Wang:
A machine learning-based method for the large-scale evaluation of the qualities of the urban environment. Comput. Environ. Urban Syst. 65: 113-125 (2017) - [c9]Anton Ragni, Chunyang Wu, Mark J. F. Gales, J. Vasilakes, Kate M. Knill:
Stimulated training for automatic speech recognition and keyword search in limited resource conditions. ICASSP 2017: 4830-4834 - [c8]Chunyang Wu, Mark J. F. Gales:
Deep Activation Mixture Model for Speech Recognition. INTERSPEECH 2017: 1611-1615 - 2016
- [c7]Chunyang Wu, Penny Karanasou, Mark J. F. Gales:
Combining i-vector representation and structured neural networks for rapid adaptation. ICASSP 2016: 5000-5004 - [c6]Chunyang Wu, Penny Karanasou, Mark J. F. Gales, Khe Chai Sim:
Stimulated Deep Neural Network for Speech Recognition. INTERSPEECH 2016: 400-404 - [i1]Lun Liu, Hui Wang, Chunyang Wu:
A machine learning method for the large-scale evaluation of urban visual environment. CoRR abs/1608.03396 (2016) - 2015
- [j1]Fan Wu, Zuying Wei, Chunyang Wu, Guihai Chen, Bo Sheng:
Incentive-compatible adaptive-width channel allocation for non-cooperative wireless networks. Int. J. Sens. Networks 18(3/4): 227-237 (2015) - [c5]Shuo Zang, Hai Zhao, Chunyang Wu, Rui Wang:
A novel word reordering method for statistical machine translation. FSKD 2015: 843-848 - [c4]Chunyang Wu, Mark J. F. Gales:
Multi-basis adaptive neural network for rapid adaptation in speech recognition. ICASSP 2015: 4315-4319 - 2012
- [c3]Xiaotian Zhang, Chunyang Wu, Hai Zhao:
Chinese Coreference Resolution via Ordered Filtering. EMNLP-CoNLL Shared Task 2012: 95-99 - [c2]Chunyang Wu, Fan Wu, Guihai Chen, Bo Sheng:
AMPLE: A Novel Incentive Approach to Adaptive-Width Channel Allocation in Multi-hop, Non-cooperative Wireless Networks. WASA 2012: 43-59 - [c1]Chunyang Wu, Hai Zhao:
Regression with Phrase Indicators for Estimating MT Quality. WMT@NAACL-HLT 2012: 152-156
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-12 21:58 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint