default search action

combined dblp search
author search
venue search
publication search

ask others

Khe Chai Sim

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SimHMSSM0S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SimHMSSM0S24
Khe Chai Sim, Zhouyuan Huo, Tsendsuren Munkhdalai, Nikhil Siddhartha, Adam Stooke, Zhong Meng, Bo Li, Tara N. Sainath:
A Comparison of Parameter-Efficient ASR Domain Adaptation Methods for Universal Speech and Language Models. ICASSP 2024: 6900-6904
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GargHSSCAMKWMHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GargHSSCAMKWMHS24
Shefali Garg, Zhouyuan Huo, Khe Chai Sim, Suzan Schwartz, Mason Chua, Alëna Aksënova, Tsendsuren Munkhdalai, Levi King, Darryl Wright, Zion Mengesha, Dongseong Hwang, Tara N. Sainath, Françoise Beaufays, Pedro Moreno Mengibar:
Improving Speech Recognition for African American English with Audio Classification. ICASSP 2024: 12356-12360
[c112]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/WangPSMHLS0QCSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/WangPSMHLS0QCSZ24
Weiran Wang, Rohit Prabhavalkar, Haozhe Shan, Zhong Meng, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Chengjian Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Speech Recognition Models with Time Reduction. NAACL-HLT 2024: 6206-6217
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-19709
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-19709
Tsendsuren Munkhdalai, Youzheng Chen, Khe Chai Sim, Fadi Biadsy, Tara N. Sainath, Pedro Moreno Mengibar:
Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models. CoRR abs/2403.19709 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09173
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09173
Dongseong Hwang, Weiran Wang, Zhuoyuan Huo, Khe Chai Sim, Pedro Moreno Mengibar:
TransformerFAM: Feedback attention is working memory. CoRR abs/2404.09173 (2024)
2023
[c111]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SongWPCJVCHWSRS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SongWPCJVCHWSRS23
Gan Song, Zelin Wu, Golan Pundak, Angad Chandorkar, Kandarp Joshi, Xavier Velez, Diamantino Caseiro, Ben Haynor, Weiran Wang, Nikhil Siddhartha, Pat Rondon, Khe Chai Sim:
Contextual Spelling Correction with Large Language Models. ASRU 2023: 1-8
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuoSLHSS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuoSLHSS23
Zhouyuan Huo, Khe Chai Sim, Bo Li, Dongseong Hwang, Tara N. Sainath, Trevor Strohman:
Resource-Efficient Transfer Learning from Speech Foundation Model Using Hierarchical Feature Fusion. ICASSP 2023: 1-5
[c109]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HwangSZS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HwangSZS23
Dongseong Hwang, Khe Chai Sim, Yu Zhang, Trevor Strohman:
Comparison of Soft and Hard Target RNN-T Distillation for Large-Scale ASR. ICASSP 2023: 1-5
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiHHBPSSZHSB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiHHBPSSZHSB23
Bo Li, Dongseong Hwang, Zhouyuan Huo, Junwen Bai, Guru Prakash, Tara N. Sainath, Khe Chai Sim, Yu Zhang, Wei Han, Trevor Strohman, Françoise Beaufays:
Efficient Domain Adaptation for Speech Foundation Models. ICASSP 2023: 1-5
[c107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuMRPSL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuMRPSL23
Zelin Wu, Tsendsuren Munkhdalai, Pat Rondon, Golan Pundak, Khe Chai Sim, Christopher Li:
Dual-Mode NAM: Effective Top-K Context Injection for End-to-End ASR. INTERSPEECH 2023: 221-225
[c106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuoSHMSM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuoSHMSM23
Zhouyuan Huo, Khe Chai Sim, Dongseong Hwang, Tsendsuren Munkhdalai, Tara N. Sainath, Pedro Moreno Mengibar:
Re-investigating the Efficient Transfer Learning of Speech Foundation Model using Feature Fusion Methods. INTERSPEECH 2023: 556-560
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-01496
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-01496
Bo Li, Dongseong Hwang, Zhouyuan Huo, Junwen Bai, Guru Prakash, Tara N. Sainath, Khe Chai Sim, Yu Zhang, Wei Han, Trevor Strohman, Françoise Beaufays:
Efficient Domain Adaptation for Speech Foundation Models. CoRR abs/2302.01496 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-01789
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-01789
Dongseong Hwang, Changwan Ryu, Khe Chai Sim:
Edit Distance based RL for RNNT decoding. CoRR abs/2306.01789 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09996
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09996
Shefali Garg, Zhouyuan Huo, Khe Chai Sim, Suzan Schwartz, Mason Chua, Alëna Aksënova, Tsendsuren Munkhdalai, Levi King, Darryl Wright, Zion Mengesha, Dongseong Hwang, Tara N. Sainath, Françoise Beaufays, Pedro Moreno Mengibar:
Improving Speech Recognition for African American English With Audio Classification. CoRR abs/2309.09996 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12963
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12963
Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Models for Short Search Queries. CoRR abs/2309.12963 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00178
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00178
Weiran Wang, Zelin Wu, Diamantino Caseiro, Tsendsuren Munkhdalai, Khe Chai Sim, Pat Rondon, Golan Pundak, Gan Song, Rohit Prabhavalkar, Zhong Meng, Ding Zhao, Tara N. Sainath, Pedro Moreno Mengibar:
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm. CoRR abs/2310.00178 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-04627
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-04627
Liam Collins, Shanshan Wu, Sewoong Oh, Khe Chai Sim:
Profit: Benchmarking Personalization and Robustness Trade-off in Federated Prompt Tuning. CoRR abs/2310.04627 (2023)
2022
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/ZhangPHQGSJXHWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/ZhangPHQGSJXHWZ22
Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu:
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition. IEEE J. Sel. Top. Signal Process. 16(6): 1519-1532 (2022)
[c105]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BaiLZBSSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BaiLZBSSS22
Junwen Bai, Bo Li, Yu Zhang, Ankur Bapna, Nikhil Siddhartha, Khe Chai Sim, Tara N. Sainath:
Joint Unsupervised and Supervised Training for Multilingual ASR. ICASSP 2022: 6402-6406
[c104]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HwangMHSGQSSBH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HwangMHSGQSSBH22
Dongseong Hwang, Ananya Misra, Zhouyuan Huo, Nikhil Siddhartha, Shefali Garg, David Qiu, Khe Chai Sim, Trevor Strohman, Françoise Beaufays, Yanzhang He:
Large-Scale ASR Domain Adaptation Using Self- and Semi-Supervised Learning. ICASSP 2022: 6627-6631
[c103]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MunkhdalaiSCGCS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MunkhdalaiSCGCS22
Tsendsuren Munkhdalai, Khe Chai Sim, Angad Chandorkar, Fan Gao, Mason Chua, Trevor Strohman, Françoise Beaufays:
Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition. ICASSP 2022: 6632-6636
[c102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BreinerRVGMSGCM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BreinerRVGMSGCM22
Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey:
UserLibri: A Dataset for ASR Personalization Using Only Text. INTERSPEECH 2022: 694-698
[c101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HwangSHS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HwangSHS22
Dongseong Hwang, Khe Chai Sim, Zhouyuan Huo, Trevor Strohman:
Pseudo Label Is Better Than Human Label. INTERSPEECH 2022: 1421-1425
[c100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PundakMS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PundakMS22
Golan Pundak, Tsendsuren Munkhdalai, Khe Chai Sim:
On-the-fly ASR Corrections with Audio Exemplars. INTERSPEECH 2022: 3148-3152
[c99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuoHSGMSSB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuoHSGMSSB22
Zhouyuan Huo, Dongseong Hwang, Khe Chai Sim, Shefali Garg, Ananya Misra, Nikhil Siddhartha, Trevor Strohman, Françoise Beaufays:
Incremental Layer-Wise Self-Supervised Learning for Efficient Unsupervised Speech Domain Adaptation On Device. INTERSPEECH 2022: 4845-4849
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/QiuMHS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/QiuMHS22
David Qiu, Tsendsuren Munkhdalai, Yanzhang He, Khe Chai Sim:
Context-Aware Neural Confidence Estimation for Rare Word Speech Recognition. SLT 2022: 31-37
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MunkhdalaiWPSLRS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MunkhdalaiWPSLRS22
Tsendsuren Munkhdalai, Zelin Wu, Golan Pundak, Khe Chai Sim, Jiayang Li, Pat Rondon, Tara N. Sainath:
NAM+: Towards Scalable End-to-End Contextual Biasing for Adaptive ASR. SLT 2022: 190-196
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/StookeSCMS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/StookeSCMS22
Adam Stooke, Khe Chai Sim, Mason Chua, Tsendsuren Munkhdalai, Trevor Strohman:
Internal Language Model Personalization of E2E Automatic Speech Recognition Using Random Encoder Features. SLT 2022: 213-220
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-12668
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-12668
Dongseong Hwang, Khe Chai Sim, Zhouyuan Huo, Trevor Strohman:
Pseudo Label Is Better Than Human Label. CoRR abs/2203.12668 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-00706
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-00706
Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey:
UserLibri: A Dataset for ASR Personalization Using Only Text. CoRR abs/2207.00706 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-03067
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-03067
Sandy Ritchie, You-Chi Cheng, Mingqing Chen, Rajiv Mathews, Daan van Esch, Bo Li, Khe Chai Sim:
Large vocabulary speech recognition for languages of Africa: multilingual modeling and self-supervised learning. CoRR abs/2208.03067 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05793
Dongseong Hwang, Khe Chai Sim, Yu Zhang, Trevor Strohman:
Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR. CoRR abs/2210.05793 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-02712
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-02712
Zhouyuan Huo, Khe Chai Sim, Bo Li, Dongseong Hwang, Tara N. Sainath, Trevor Strohman:
Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion. CoRR abs/2211.02712 (2022)
2021
[c95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MisraHHGSNS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MisraHHGSNS21
Ananya Misra, Dongseong Hwang, Zhouyuan Huo, Shefali Garg, Nikhil Siddhartha, Arun Narayanan, Khe Chai Sim:
A Comparison of Supervised and Unsupervised Pre-Training of End-to-End Models. Interspeech 2021: 731-735
[c94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimCGCMB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimCGCMB21
Khe Chai Sim, Angad Chandorkar, Fan Gao, Mason Chua, Tsendsuren Munkhdalai, Françoise Beaufays:
Robust Continuous On-Device Personalization for Automatic Speech Recognition. Interspeech 2021: 1284-1288
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-10259
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-10259
Katrin Tomanek, Françoise Beaufays, Julie Cattiau, Angad Chandorkar, Khe Chai Sim:
On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech. CoRR abs/2106.10259 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-13226
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-13226
Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu:
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition. CoRR abs/2109.13226 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-00155
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-00155
Zhouyuan Huo, Dongseong Hwang, Khe Chai Sim, Shefali Garg, Ananya Misra, Nikhil Siddhartha, Trevor Strohman, Françoise Beaufays:
Incremental Layer-wise Self-Supervised Learning for Efficient Speech Domain Adaptation On Device. CoRR abs/2110.00155 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-00165
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-00165
Dongseong Hwang, Ananya Misra, Zhouyuan Huo, Nikhil Siddhartha, Shefali Garg, David Qiu, Khe Chai Sim, Trevor Strohman, Françoise Beaufays, Yanzhang He:
Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning. CoRR abs/2110.00165 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-02220
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-02220
Tsendsuren Munkhdalai, Khe Chai Sim, Angad Chandorkar, Fan Gao, Mason Chua, Trevor Strohman, Françoise Beaufays:
Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition. CoRR abs/2110.02220 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-08137
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-08137
Junwen Bai, Bo Li, Yu Zhang, Ankur Bapna, Nikhil Siddhartha, Khe Chai Sim, Tara N. Sainath:
Joint Unsupervised and Supervised Training for Multilingual ASR. CoRR abs/2111.08137 (2021)
2020
[c93]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GooneratneSZKBM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GooneratneSZKBM20
Mary Gooneratne, Khe Chai Sim, Petr Zadrazil, Andreas Kabel, Françoise Beaufays, Giovanni Motta:
Low-Rank Gradient Approximation for Memory-Efficient on-Device Training of Deep Neural Network. ICASSP 2020: 3017-3021
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-08885
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-08885
Mary Gooneratne, Khe Chai Sim, Petr Zadrazil, Andreas Kabel, Françoise Beaufays, Giovanni Motta:
Low-rank Gradient Approximation For Memory-Efficient On-device Training of Deep Neural Network. CoRR abs/2001.08885 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c92]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SimJMZBBGKKLZZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SimJMZBBGKKLZZ19
Khe Chai Sim, Leif Johnson, Giovanni Motta, Lillian Zhou, Françoise Beaufays, Arnaud Benard, Dhruv Guliani, Andreas Kabel, Nikhil Khare, Tamar Lucassen, Petr Zadrazil, Harry Zhang:
Personalization of End-to-End Speech Recognition on Mobile Devices for Named Entities. ASRU 2019: 23-30
[c91]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeymannS019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeymannS019
Jahn Heymann, Khe Chai Sim, Bo Li:
Improving CTC Using Stimulated Learning for Sequence Modeling. ICASSP 2019: 5701-5705
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeSPMAZRKWPLBSL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeSPMAZRKWPLBSL19
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition for Mobile Devices. ICASSP 2019: 6381-6385
[c89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimZB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimZB19
Khe Chai Sim, Petr Zadrazil, Françoise Beaufays:
An Investigation into On-Device Personalization of End-to-End Automatic Speech Recognition Models. INTERSPEECH 2019: 774-778
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-06678
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-06678
Khe Chai Sim, Petr Zadrazil, Françoise Beaufays:
An Investigation Into On-device Personalization of End-to-end Automatic Speech Recognition Models. CoRR abs/1909.06678 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-09251
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-09251
Khe Chai Sim, Françoise Beaufays, Arnaud Benard, Dhruv Guliani, Andreas Kabel, Nikhil Khare, Tamar Lucassen, Petr Zadrazil, Harry Zhang, Leif Johnson, Giovanni Motta, Lillian Zhou:
Personalization of End-to-end Speech Recognition On Mobile Devices For Named Entities. CoRR abs/1912.09251 (2019)
2018
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WuGRKS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WuGRKS18
Chunyang Wu, Mark J. F. Gales, Anton Ragni, Penny Karanasou, Khe Chai Sim:
Improving Interpretability and Regularization in Deep Learning. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 256-265 (2018)
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KoppulaSC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KoppulaSC18
Skanda Koppula, Khe Chai Sim, Kean K. Chin:
Understanding Recurrent Neural State Using Memory Signatures. ICASSP 2018: 2396-2400
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiSSBWNCWR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiSSBWNCWR18
Bo Li, Tara N. Sainath, Khe Chai Sim, Michiel Bacchiani, Eugene Weinstein, Patrick Nguyen, Zhifeng Chen, Yanghui Wu, Kanishka Rao:
Multi-Dialect Speech Recognition with a Single Sequence-to-Sequence Model. ICASSP 2018: 4749-4753
[c86]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SamarakoonMS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SamarakoonMS18
Lahiru Samarakoon, Brian Mak, Khe Chai Sim:
learning Effective Factorized Hidden Layer Bases Using Student-Teacher Training for LSTM Acoustic Model Adaptation. ICASSP 2018: 5954-5958
[c85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimNMTPSHLB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimNMTPSHLB18
Khe Chai Sim, Arun Narayanan, Ananya Misra, Anshuman Tripathi, Golan Pundak, Tara N. Sainath, Parisa Haghani, Bo Li, Michiel Bacchiani:
Domain Adaptation Using Factorized Hidden Layer for Robust Automatic Speech Recognition. INTERSPEECH 2018: 892-896
[c84]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/NarayananMSPTEH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/NarayananMSPTEH18
Arun Narayanan, Ananya Misra, Khe Chai Sim, Golan Pundak, Anshuman Tripathi, Mohamed Elfeky, Parisa Haghani, Trevor Strohman, Michiel Bacchiani:
Toward Domain-Invariant Speech Recognition via Large Scale Training. SLT 2018: 441-447
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/BagbyRS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/BagbyRS18
Tom Bagby, Kanishka Rao, Khe Chai Sim:
Efficient Implementation of Recurrent Neural Network Transducer in Tensorflow. SLT 2018: 506-512
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-03816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-03816
Skanda Koppula, Khe Chai Sim, Kean K. Chin:
Understanding Recurrent Neural State Using Memory Signatures. CoRR abs/1802.03816 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-05312
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-05312
Arun Narayanan, Ananya Misra, Khe Chai Sim, Golan Pundak, Anshuman Tripathi, Mohamed Elfeky, Parisa Haghani, Trevor Strohman, Michiel Bacchiani:
Toward domain-invariant speech recognition via large scale training. CoRR abs/1808.05312 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-06621
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-06621
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition For Mobile Devices. CoRR abs/1811.06621 (2018)
2017
[c82]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SimNBSB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SimNBSB17
Khe Chai Sim, Arun Narayanan, Tom Bagby, Tara N. Sainath, Michiel Bacchiani:
Improving the efficiency of forward-backward algorithm using batched computation in TensorFlow. ASRU 2017: 258-264
[c81]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SamarakoonSM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SamarakoonSM17
Lahiru Samarakoon, Khe Chai Sim, Brian Mak:
An investigation into learning effective speaker subspaces for robust unsupervised DNN adaptation. ICASSP 2017: 5035-5039
[c80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSNCBMSSPCSWWV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSNCBMSSPCSWWV17
Bo Li, Tara N. Sainath, Arun Narayanan, Joe Caroselli, Michiel Bacchiani, Ananya Misra, Izhak Shafran, Hasim Sak, Golan Pundak, Kean K. Chin, Khe Chai Sim, Ron J. Weiss, Kevin W. Wilson, Ehsan Variani, Chanwoo Kim, Olivier Siohan, Mitchel Weintraub, Erik McDermott, Richard Rose, Matt Shannon:
Acoustic Modeling for Google Home. INTERSPEECH 2017: 399-403
[c79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SamarakoonMS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SamarakoonMS17
Lahiru Samarakoon, Brian Mak, Khe Chai Sim:
Learning Factorized Transforms for Unsupervised Adaptation of LSTM-RNN Acoustic Models. INTERSPEECH 2017: 744-748
[c78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimN17
Khe Chai Sim, Arun Narayanan:
An Efficient Phone N-Gram Forward-Backward Computation Using Dense Matrix Multiplication. INTERSPEECH 2017: 1646-1650
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/SimQMSKT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/SimQMSKT17
Khe Chai Sim, Yanmin Qian, Gautam Mantena, Lahiru Samarakoon, Souvik Kundu, Tian Tan:
Adaptation of Deep Neural Network Acoustic Models for Robust Automatic Speech Recognition. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 219-243
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1712-01541
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-01541
Bo Li, Tara N. Sainath, Khe Chai Sim, Michiel Bacchiani, Eugene Weinstein, Patrick Nguyen, Zhifeng Chen, Yonghui Wu, Kanishka Rao:
Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model. CoRR abs/1712.01541 (2017)
2016
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/Sim16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/Sim16
Khe Chai Sim:
Sensitivity-Characterised Activity Neurogram (SCAN) for Visualising and Understanding the Inner Workings of Deep Neural Network. IEICE Trans. Inf. Syst. 99-D(10): 2423-2430 (2016)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SamarakoonS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SamarakoonS16
Lahiru Samarakoon, Khe Chai Sim:
Factorized Hidden Layer Adaptation for Deep Neural Network Based Acoustic Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 24(12): 2241-2250 (2016)
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KunduMQTDS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KunduMQTDS16
Souvik Kundu, Gautam Mantena, Yanmin Qian, Tian Tan, Marc Delcroix, Khe Chai Sim:
Joint acoustic factor learning for robust deep neural network based automatic speech recognition. ICASSP 2016: 5025-5029
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SamarakoonS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SamarakoonS16
Lahiru Samarakoon, Khe Chai Sim:
On combining i-vectors and discriminative adaptation methods for unsupervised speaker normalization in DNN acoustic models. ICASSP 2016: 5275-5279
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TanQYKLSXZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TanQYKLSXZ16
Tian Tan, Yanmin Qian, Dong Yu, Souvik Kundu, Liang Lu, Khe Chai Sim, Xiong Xiao, Yu Zhang:
Speaker-aware training of LSTM-RNNS for acoustic modelling. ICASSP 2016: 5280-5284
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TanS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TanS16
Shawn Tan, Khe Chai Sim:
Towards implicit complexity control using variable-depth deep neural networks for automatic speech recognition. ICASSP 2016: 5965-5969
[c73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuKGS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuKGS16
Chunyang Wu, Penny Karanasou, Mark J. F. Gales, Khe Chai Sim:
Stimulated Deep Neural Network for Speech Recognition. INTERSPEECH 2016: 400-404
[c72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SamarakoonS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SamarakoonS16
Lahiru Samarakoon, Khe Chai Sim:
Subspace LHUC for Fast Adaptation of Deep Neural Network Acoustic Models. INTERSPEECH 2016: 1593-1597
[c71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KunduSG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KunduSG16
Souvik Kundu, Khe Chai Sim, Mark J. F. Gales:
Incorporating a Generative Front-End Layer to Deep Neural Network for Noise Robust Automatic Speech Recognition. INTERSPEECH 2016: 2359-2363
[c70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SamarakoonS16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SamarakoonS16a
Lahiru Samarakoon, Khe Chai Sim:
Multi-Attribute Factorized Hidden Layer Adaptation for DNN Acoustic Models. INTERSPEECH 2016: 3484-3488
[c69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrasadS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrasadS16
Animesh Prasad, Khe Chai Sim:
Microphone Distance Adaptation Using Cluster Adaptive Training for Robust Far Field Speech Recognition. INTERSPEECH 2016: 3823-3827
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/TanS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/TanS16
Shawn Tan, Khe Chai Sim:
Learning utterance-level normalisation using Variational Autoencoders for robust automatic speech recognition. SLT 2016: 43-49
[c67]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SamarakoonS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SamarakoonS16
Lahiru Samarakoon, Khe Chai Sim:
Low-rank bases for factorized hidden layer adaptation of DNN acoustic models. SLT 2016: 652-658
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MantenaS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MantenaS16
Gautam Mantena, Khe Chai Sim:
Entropy-based pruning of hidden units to reduce DNN parameters. SLT 2016: 672-679
2015
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/Sim15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/Sim15
Khe Chai Sim:
On constructing and analysing an interpretable brain model for the DNN based on hidden activity patterns. ASRU 2015: 22-29
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SamarakoonS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SamarakoonS15
Lahiru Samarakoon, Khe Chai Sim:
Learning factorized feature transforms for speaker normalization. ASRU 2015: 145-152
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/TanSG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/TanSG15
Shawn Tan, Khe Chai Sim, Mark J. F. Gales:
Improving the interpretability of deep neural networks with stimulated learning. ASRU 2015: 617-623
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangS15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangS15a
Hengguan Huang, Khe Chai Sim:
An investigation of augmenting speaker representations to improve speaker normalisation for DNN-based speech recognition. ICASSP 2015: 4610-4613
2014
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuS14
Shilin Liu, Khe Chai Sim:
Temporally Varying Weight Regression: A Semi-Parametric Trajectory Model for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(1): 151-160 (2014)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiS14
Bo Li, Khe Chai Sim:
A Spectral Masking Approach to Noise-Robust Speech Recognition Using Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 22(8): 1296-1305 (2014)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangS14
Guangsen Wang, Khe Chai Sim:
Regression-Based Context-Dependent Modeling of Deep Neural Networks for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(11): 1660-1669 (2014)
[c61]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/WangNS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/WangNS14
Xuancong Wang, Hwee Tou Ng, Khe Chai Sim:
A Beam-Search Decoder for Disfluency Detection. COLING 2014: 1457-1467
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/WangSN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/WangSN14
Xuancong Wang, Khe Chai Sim, Hwee Tou Ng:
Combining Punctuation and Disfluency Prediction: An Empirical Study. EMNLP 2014: 121-130
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuS14
Shilin Liu, Khe Chai Sim:
On combining DNN and GMM with unsupervised speaker adaptation for robust automatic speech recognition. ICASSP 2014: 195-199
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiS14
Bo Li, Khe Chai Sim:
An ideal hidden-activation mask for deep neural networks based noise-robust speech recognition. ICASSP 2014: 200-204
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BuQSYY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BuQSYY14
Suliang Bu, Yanmin Qian, Khe Chai Sim, Yongbin You, Kai Yu:
Second order vector taylor series based robust speech recognition. ICASSP 2014: 1769-1773
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangS14
Guangsen Wang, Khe Chai Sim:
Refinements of regression-based context-dependent modelling of deep neural networks for automatic speech recognition. ICASSP 2014: 3022-3026
[c55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS14
Bo Li, Khe Chai Sim:
Modeling long temporal contexts for robust DNN-based speech recognition. INTERSPEECH 2014: 353-357
[c54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuS14
Shilin Liu, Khe Chai Sim:
Joint adaptation and adaptive training of TVWR for robust automatic speech recognition. INTERSPEECH 2014: 636-640
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/Sim14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/Sim14
Khe Chai Sim:
A multimodal stroke-based predictive input for efficient Chinese text entry on mobile devices. SLT 2014: 448-453
2013
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DuanFLSW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DuanFLSW13
Zhiyan Duan, Haotian Fang, Bo Li, Khe Chai Sim, Ye Wang:
The NUS sung and spoken lyrics corpus: A quantitative comparison of singing and speech. APSIPA 2013: 1-9
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WangS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WangS13
Guangsen Wang, Khe Chai Sim:
Context dependent acoustic keyword spotting using deep neural network. APSIPA 2013: 1-10
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiS13
Bo Li, Khe Chai Sim:
Improving robustness of deep neural networks via spectral masking for automatic speech recognition. ASRU 2013: 279-284
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WangS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WangS13
Guangsen Wang, Khe Chai Sim:
Context-dependent modelling of deep neural network using logistic regression. ASRU 2013: 338-343
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiuS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiuS13
Shilin Liu, Khe Chai Sim:
Multi-stream temporally varying weight regression for cross-lingual speech recognition. ASRU 2013: 434-439
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Sim13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Sim13
Khe Chai Sim:
Approximated Parallel Model Combination for efficient noise-robust speech recognition. ICASSP 2013: 7383-7387
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiS13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiS13a
Bo Li, Khe Chai Sim:
Noise adaptive front-end normalization based on Vector Taylor Series for Deep Neural Networks in robust speech recognition. ICASSP 2013: 7408-7412
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuS13
Shilin Liu, Khe Chai Sim:
Parameter clustering for temporally varying weight regression for automatic speech recognition. INTERSPEECH 2013: 1796-1800
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangS13
Xiaoxuan Wang, Khe Chai Sim:
Integrating conditional random fields and joint multi-gram model with syllabic features for grapheme-to-phone conversion. INTERSPEECH 2013: 2321-2325
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuS13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuS13a
Shilin Liu, Khe Chai Sim:
An investigation of temporally varying weight regression for noise robust speech recognition. INTERSPEECH 2013: 2963-2967
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiTS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiTS13
Bo Li, Yu Tsao, Khe Chai Sim:
An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition. INTERSPEECH 2013: 3002-3006
2012
[c41]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/Sim12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Sim12
Khe Chai Sim:
Probabilistic Integration of Partial Lexical Information for Noise Robust Haptic Voice Recognition. ACL (1) 2012: 31-39
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangS12a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangS12a
Guangsen Wang, Khe Chai Sim:
An investigation of tied-mixture GMM based triphone state clustering. ICASSP 2012: 4717-4720
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuS12
Shilin Liu, Khe Chai Sim:
Implicit trajectory modelling using temporally varying weight regression for automatic speech recognition. ICASSP 2012: 4761-4764
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/SimZYL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/SimZYL12
Khe Chai Sim, Shengdong Zhao, Kai Yu, Hank Liao:
ICMI'12 grand challenge: haptic voice recognition. ICMI 2012: 363-370
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/MoonS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/MoonS12
Seungwhan Moon, Khe Chai Sim:
Design and implementation of the note-taking style haptic voice recognition for mobile devices. ICMI 2012: 533-538
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/WangLLWWS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/WangLLWWS12
Guangsen Wang, Bo Li, Shilin Liu, Xuancong Wang, Xiaoxuan Wang, Khe Chai Sim:
Improving mandarin predictive text input by augmenting pinyin initials with speech and tonal information. ICMI 2012: 545-550
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/Sim12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/Sim12
Khe Chai Sim:
Speak-as-you-swipe (SAYS): a multimodal interface combining speech and gesture keyboard synchronously for continuous mobile text entry. ICMI 2012: 555-560
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangNS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangNS12
Xuancong Wang, Hwee Tou Ng, Khe Chai Sim:
Dynamic Conditional Random Fields for Joint Sentence Boundary and Punctuation Prediction. INTERSPEECH 2012: 1384-1387
[c33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS12a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS12a
Bo Li, Khe Chai Sim:
A Two-stage Speaker Adaptation Approach for Subspace Gaussian Mixture Model based Nonnative Speech Recognition. INTERSPEECH 2012: 1772-1775
[c32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AzimWS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AzimWS12
Aisha S. Azim, Xiaoxuan Wang, Khe Chai Sim:
A Weighted Combination of Speech with Text-based Models for Arabic Diacritization. INTERSPEECH 2012: 2334-2337
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouSTW12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouSTW12
Yinsheng Zhou, Khe Chai Sim, Patsy Tan, Ye Wang:
MOGAT: mobile games with auditory training for children with cochlear implants. ACM Multimedia 2012: 429-438
2011
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LeeYLKS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LeeYLKS11
Kong Aik Lee, Chang Huai You, Haizhou Li, Tomi Kinnunen, Khe Chai Sim:
Using Discrete Probabilities With Bhattacharyya Measure for SVM-Based Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 19(4): 861-870 (2011)
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SimL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SimL11
Khe Chai Sim, Minh-Thang Luong:
A Trajectory-based Parallel Model Combination with a unified static and dynamic parameter compensation for noisy speech recognition. ASRU 2011: 107-112
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangS11
Guangsen Wang, Khe Chai Sim:
Sequential Classification Criteria for NNs in Automatic Speech Recognition. INTERSPEECH 2011: 441-444
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangS11a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangS11a
Guangsen Wang, Khe Chai Sim:
Comparison of Smoothing Techniques for Robust Context Dependent Acoustic Modelling in Hybrid NN/HMM Systems. INTERSPEECH 2011: 457-460
2010
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tois/ChiaSLN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tois/ChiaSLN10
Tee Kiah Chia, Khe Chai Sim, Haizhou Li, Hwee Tou Ng:
Statistical lattice-based spoken document retrieval. ACM Trans. Inf. Syst. 28(1): 2:1-2:30 (2010)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/tomccap/MaddageSL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tomccap/MaddageSL10
Namunu Chinthaka Maddage, Khe Chai Sim, Haizhou Li:
Word level automatic alignment of music and lyrics using vocal synthesis. ACM Trans. Multim. Comput. Commun. Appl. 6(3): 19:1-19:16 (2010)
[c27]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/LeeLYKS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/LeeLYKS10
Kong-Aik Lee, Haizhou Li, Chang Huai You, Tomi Kinnunen, Khe Chai Sim:
Discrete expected likelihood kernel for SVM-based speaker verification. EUSIPCO 2010: 591-595
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Sim10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Sim10
Khe Chai Sim:
A minimum variance asynchronous Detection Error Trade-off performance analysis for multi-class detection problems. ICASSP 2010: 4458-4461
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SimL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SimL10
Khe Chai Sim, Kong-Aik Lee:
Adaptive score fusion using Weighted Logistic Linear Regression for spoken language recognition. ICASSP 2010: 5018-5021
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sim10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sim10
Khe Chai Sim:
Probabilistic state clustering using conditional random field for context-dependent acoustic modelling. INTERSPEECH 2010: 70-73
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS10
Bo Li, Khe Chai Sim:
Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems. INTERSPEECH 2010: 526-529
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS10a
Bo Li, Khe Chai Sim:
Hidden logistic linear regression for support vector machine based phone verification. INTERSPEECH 2010: 2614-2617
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimL10
Khe Chai Sim, Shilin Liu:
Semi-parametric trajectory modelling using temporally varying feature mapping for speech recognition. INTERSPEECH 2010: 2982-2985
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/Sim10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/Sim10
Khe Chai Sim:
Haptic Voice Recognition: Augmenting speech modality with touch events for efficient speech recognition. SLT 2010: 73-78

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/Sim09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/Sim09
Khe Chai Sim:
Discriminative Product-of-Expert acoustic mapping for cross-lingual phone recognition. ASRU 2009: 546-551
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiMLSZSYTKHPGLDNTEASSJ09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiMLSZSYTKHPGLDNTEASSJ09
Haizhou Li, Bin Ma, Kong-Aik Lee, Hanwu Sun, Donglai Zhu, Khe Chai Sim, Changhuai You, Rong Tong, Ismo Kärkkäinen, Chien-Lin Huang, Vladimir Pervouchine, Wu Guo, Yijie Li, Li-Rong Dai, Mohaddeseh Nosratighods, Tharmarajah Thiruvaran, Julien Epps, Eliathamby Ambikairajah, Chng Eng Siong, Tanja Schultz, Qin Jin:
The I4U system in NIST 2008 speaker recognition evaluation. ICASSP 2009: 4201-4204
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimL09
Khe Chai Sim, Haizhou Li:
Stream-based context-sensitive phone mapping for cross-lingual speech recognition. INTERSPEECH 2009: 3019-3022
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/slte/Sim09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slte/Sim09
Khe Chai Sim:
Improving phone verification using state-level posterior features and support vector machine for automatic mispronunciation detection. SLaTE 2009: 133-136
2008
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SimL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SimL08
Khe Chai Sim, Haizhou Li:
On Acoustic Diversification Front-End for Spoken Language Identification. IEEE Trans. Speech Audio Process. 16(5): 1029-1037 (2008)
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SimL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SimL08
Khe Chai Sim, Haizhou Li:
Robust phone set mapping using decision tree clustering for cross-lingual phone recognition. ICASSP 2008: 4309-4312
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimL08
Khe Chai Sim, Haizhou Li:
Context-sensitive probabilistic phone mapping model for cross-lingual speech recognition. INTERSPEECH 2008: 2715-2718
[c13]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/paclic/LiMLSSTZY08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/paclic/LiMLSSTZY08
Haizhou Li, Bin Ma, Kong-Aik Lee, Khe Chai Sim, Hanwu Sun, Rong Tong, Donglai Zhu, Changhuai You:
NIST 2007 Language Recognition Evaluation: From the Perspective of IIR. PACLIC 2008: 46-57
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/ChiaSLN08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/ChiaSLN08
Tee Kiah Chia, Khe Chai Sim, Haizhou Li, Hwee Tou Ng:
A lattice-based approach to query-by-example spoken document retrieval. SIGIR 2008: 363-370
2007
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/SimG07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/SimG07
Khe Chai Sim, Mark J. F. Gales:
Discriminative semi-parametric trajectory model for speech recognition. Comput. Speech Lang. 21(4): 669-687 (2007)
[c11]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/LiSKD07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LiSKD07
Haizhou Li, Khe Chai Sim, Jin-Shea Kuo, Minghui Dong:
Semantic Transliteration of Personal Names. ACL 2007
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TomalinGLSSWWY07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TomalinGLSSWWY07
Marcus Tomalin, Mark J. F. Gales, X. Andrew Liu, Khe Chai Sim, Rohit Sinha, Lan Wang, Philip C. Woodland, Kai Yu:
Improving Speech Transcription for Mandarin-English Translation. ICASSP (4) 2007: 97-100
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SimBGSW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SimBGSW07
Khe Chai Sim, William J. Byrne, Mark J. F. Gales, Hichem Sahbi, Philip C. Woodland:
Consensus Network Decoding for Statistical Machine Translation System Combination. ICASSP (4) 2007: 105-108
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimL07
Khe Chai Sim, Haizhou Li:
Fusion of contrastive acoustic models for parallel phonotactic spoken language identification. INTERSPEECH 2007: 170-173
2006
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SimG06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SimG06
Khe Chai Sim, Mark J. F. Gales:
Minimum phone error training of precision matrix models. IEEE Trans. Speech Audio Process. 14(3): 882-889 (2006)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SinhaGKLSW06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SinhaGKLSW06
Rohit Sinha, Mark J. F. Gales, Do Yeong Kim, X. Andrew Liu, Khe Chai Sim, Philip C. Woodland:
The Cu-Htk Mandarin Broadcast News Transcription System. ICASSP (1) 2006: 1077-1080
2005
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SimG05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SimG05
Khe Chai Sim, Mark J. F. Gales:
Adaptation of Precision Matrix Models on Large Vocabulary Continuous Speech Recognition. ICASSP (1) 2005: 97-100
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GalesJLSWY05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GalesJLSWY05
Mark J. F. Gales, Bin Jia, X. Andrew Liu, Khe Chai Sim, Philip C. Woodland, Kai Yu:
Development of the CUHTK 2004 Mandarin Conversational Telephone Speech Transcription System. ICASSP (1) 2005: 841-844
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuGSY05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuGSY05
Xunying Liu, Mark J. F. Gales, Khe Chai Sim, Kai Yu:
Investigation of Acoustic Modeling Techniques for LVCSR Systems. ICASSP (1) 2005: 849-852
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimCEGMSW05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimCEGMSW05
Do Yeong Kim, Ho Yin Chan, Gunnar Evermann, Mark J. F. Gales, David Mrva, Khe Chai Sim, Philip C. Woodland:
Development of the CU-HTK 2004 Broadcast News Transcription Systems. ICASSP (1) 2005: 861-864
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimG05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimG05
Khe Chai Sim, Mark J. F. Gales:
Temporally varying model parameters for large vocabulary continuous speech recognition. INTERSPEECH 2005: 2137-2140
2004
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SimG04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SimG04
Khe Chai Sim, Mark J. F. Gales:
Basis superposition precision matrix modelling for large vocabulary continuous speech recognition. ICASSP (1) 2004: 801-804

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.