default search action

combined dblp search
author search
venue search
publication search

ask others

Kshitiz Kumar

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2023
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-06327
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-06327
Mohammad Soleymanpour, Mahmoud Al Ismail, Fahimeh Bahmaninezhad, Kshitiz Kumar, Jian Wu:
Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss. CoRR abs/2308.06327 (2023)
2022
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TompkinsKW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TompkinsKW22
Daniel Tompkins, Kshitiz Kumar, Jian Wu:
Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, and Pretraining: an Ablation Study. ICASSP 2022: 1016-1020
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-03514
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-03514
Daniel Tompkins, Kshitiz Kumar, Jian Wu:
Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study. CoRR abs/2202.03514 (2022)
2021
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DasKW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DasKW21
Amit Das, Kshitiz Kumar, Jian Wu:
Multi-Dialect Speech Recognition in English Using Attention on Ensemble of Experts. ICASSP 2021: 6244-6248
[c29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AfshanKW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AfshanKW21
Amber Afshan, Kshitiz Kumar, Jian Wu:
Sequence-Level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models. Interspeech 2021: 4084-4088
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-00099
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-00099
Amber Afshan, Kshitiz Kumar, Jian Wu:
Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models. CoRR abs/2107.00099 (2021)
2020
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarSKW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarSKW20
Kshitiz Kumar, Emilian Stoimenov, Hosam Khalil, Jian Wu:
Fast and Slow Acoustic Model. INTERSPEECH 2020: 541-545
[c27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarRGW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarRGW20
Kshitiz Kumar, Bo Ren, Yifan Gong, Jian Wu:
Bandpass Noise Generation and Augmentation for Unified ASR. INTERSPEECH 2020: 1683-1687
[c26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarLGW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarLGW20
Kshitiz Kumar, Chaojun Liu, Yifan Gong, Jian Wu:
1-D Row-Convolution LSTM: Fast Streaming ASR at Accuracy Parity with LC-BLSTM. INTERSPEECH 2020: 2107-2111
[c25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JoshiZMKL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JoshiZMKL20
Vikas Joshi, Rui Zhao, Rupesh R. Mehta, Kshitiz Kumar, Jinyu Li:
Transfer Learning Approaches for Streaming End-to-End Speech Recognition System. INTERSPEECH 2020: 2152-2156
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-05086
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-05086
Vikas Joshi, Rui Zhao, Rupesh R. Mehta, Kshitiz Kumar, Jinyu Li:
Transfer Learning Approaches for Streaming End-to-End Speech Recognition System. CoRR abs/2008.05086 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KumarAG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KumarAG19
Kshitiz Kumar, Tasos Anastasakos, Yifan Gong:
Word Characters and Phone Pronunciation Embedding for ASR Confidence Classifier. ICASSP 2019: 2712-2716
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KumarG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KumarG19
Kshitiz Kumar, Yifan Gong:
Static and Dynamic State Predictions for Acoustic Model Combination. ICASSP 2019: 2782-2786
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-01239
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-01239
Ke Li, Jinyu Li, Yong Zhao, Kshitiz Kumar, Yifan Gong:
Speaker Adaptation for End-to-End CTC Models. CoRR abs/1901.01239 (2019)
2018
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/isnn/ZhouKAKKS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isnn/ZhouKAKKS18
Xinhui Zhou, Chiman Kwan, Bulent Ayhan, Chanwoo Kim, Kshitiz Kumar, Richard M. Stern:
A Comparative Study of Spatial Speech Separation Techniques to Improve Speech Recognition. ISNN 2018: 494-502
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiLZKG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiLZKG18
Ke Li, Jinyu Li, Yong Zhao, Kshitiz Kumar, Yifan Gong:
Speaker Adaptation for End-to-End CTC Models. SLT 2018: 542-549
2017
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoLKG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoLKG17
Yong Zhao, Jinyu Li, Kshitiz Kumar, Yifan Gong:
Extended low-rank plus diagonal adaptation for deep and recurrent neural networks. ICASSP 2017: 5040-5044
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/GongHKLLYZZZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/GongHKLLYZZZ17
Yifan Gong, Yan Huang, Kshitiz Kumar, Jinyu Li, Chaojun Liu, Guoli Ye, Shi-Xiong Zhang, Yong Zhao, Rui Zhao:
Challenges in and Solutions to Deep Learning Network Acoustic Modeling in Speech Recognition Products at Microsoft. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 401-417
2016
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuWKG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuWKG16
Chaojun Liu, Yongqiang Wang, Kshitiz Kumar, Yifan Gong:
Investigations on speaker adaptation of LSTM RNN models for speech recognition. ICASSP 2016: 5020-5024
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KumarLG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KumarLG16
Kshitiz Kumar, Chaojun Liu, Yifan Gong:
Non-negative intermediate-layer DNN adaptation for a 10-KB speaker adaptation profile. ICASSP 2016: 5285-5289
2015
[c17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarBZLDG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarBZLDG15
Kshitiz Kumar, Ziad Al Bawab, Yong Zhao, Chaojun Liu, Benoît Dumoulin, Yifan Gong:
Confidence-features and confidence-scores for ASR applications in arbitration and DNN speaker adaptation. INTERSPEECH 2015: 702-706
[c16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarLYG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarLYG15
Kshitiz Kumar, Chaojun Liu, Kaisheng Yao, Yifan Gong:
Intermediate-layer DNN adaptation for offline and session-based iterative speaker adaptation. INTERSPEECH 2015: 1091-1095
[c15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarLG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarLG15
Kshitiz Kumar, Chaojun Liu, Yifan Gong:
Delta-melspectra features for noise robustness to DNN-based ASR systems. INTERSPEECH 2015: 2445-2448
2014
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarLG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarLG14
Kshitiz Kumar, Chaojun Liu, Yifan Gong:
Normalization of ASR confidence classifier scores via confidence mapping. INTERSPEECH 2014: 1199-1203
2013
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangKLGD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangKLGD13
Po-Sen Huang, Kshitiz Kumar, Chaojun Liu, Yifan Gong, Li Deng:
Predicting speech recognition confidence using deep learning with word identity and score features. ICASSP 2013: 7413-7417
2011
[b1]
- view
  authority control:
- export record
  dblp key:
  - phd/us/Kumar18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/Kumar18a
Kshitiz Kumar:
A Spectro-Temporal Framework for Compensation of Reverberation for Speech Recognition. Carnegie Mellon University, USA, 2011
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KumarSRS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KumarSRS11
Kshitiz Kumar, Rita Singh, Bhiksha Raj, Richard M. Stern:
Gammatone sub-band magnitude-domain dereverberation for ASR. ICASSP 2011: 4604-4607
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KumarKS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KumarKS11
Kshitiz Kumar, Chanwoo Kim, Richard M. Stern:
Delta-spectral cepstral coefficients for robust speech recognition. ICASSP 2011: 4784-4787
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimKS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimKS11
Chanwoo Kim, Kshitiz Kumar, Richard M. Stern:
Binaural sound source separation motivated by auditory processing. ICASSP 2011: 5072-5075
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KumarRSS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KumarRSS11
Kshitiz Kumar, Bhiksha Raj, Rita Singh, Richard M. Stern:
An iterative least-squares technique for dereverberation. ICASSP 2011: 5488-5491
2010
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KumarS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KumarS10
Kshitiz Kumar, Richard M. Stern:
Maximum-likelihood-based cepstral inverse filtering for blind speech dereverberation. ICASSP 2010: 4282-4285

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KimKS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KimKS09
Chanwoo Kim, Kshitiz Kumar, Richard M. Stern:
Robust speech recognition using a Small Power Boosting algorithm. ASRU 2009: 243-248
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/KumarNMLRP09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/KumarNMLRP09
Kshitiz Kumar, Jirí Navrátil, Etienne Marcheret, Vit Libal, Ganesh N. Ramaswamy, Gerasimos Potamianos:
Audio-visual speech synchronization detection using a bimodal linear prediction model. CVPR Workshops 2009: 53-59
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarNMLP09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarNMLP09
Kshitiz Kumar, Jirí Navrátil, Etienne Marcheret, Vit Libal, Gerasimos Potamianos:
Robust audio-visual speech synchrony detection by generalized bimodal linear prediction. INTERSPEECH 2009: 2251-2254
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimKRS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimKRS09
Chanwoo Kim, Kshitiz Kumar, Bhiksha Raj, Richard M. Stern:
Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain. INTERSPEECH 2009: 2495-2498
2008
[c3]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/eusipco/KumarWWS08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/KumarWWS08
Kshitiz Kumar, Qi Wu, Yiming Wang, Marios Savvides:
Noise robust speaker identification using Bhattacharyya distance in adapted Gaussian models space. EUSIPCO 2008: 1-4
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KumarS08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KumarS08
Kshitiz Kumar, Richard M. Stern:
Environment-invariant compensation for reverberation using linear post-filtering for minimum distortion. ICASSP 2008: 4121-4124
2007
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KumarCS07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KumarCS07
Kshitiz Kumar, Tsuhan Chen, Richard M. Stern:
Profile View Lip Reading. ICASSP (4) 2007: 429-432

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.