default search action
Van Hai Do
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j4]Bao Thang Ta, Nhat Minh Le, Van Hai Do:
Transfer learning methods for low-resource speech accent recognition: A case study on Vietnamese language. Eng. Appl. Artif. Intell. 132: 107895 (2024) - [c30]Bao Thang Ta, Minh Khang Pham, Nhat Minh Le, Van Hai Do:
Human Behavior Modeling in Speech Transcribing Process via Pretrained Speech Recognition Models. IJCNN 2024: 1-6 - 2023
- [c29]Ngoc-Anh Nguyen Thi, Bao Thang Ta, Nhat Minh Le, Van Hai Do:
An Automatic Pipeline For Building Emotional Speech Dataset. APSIPA ASC 2023: 1030-1035 - [c28]Minh Tu Le, Bao Thang Ta, Nhat Minh Le, Phi Le Nguyen, Van Hai Do:
A Gaussian Distribution Labeling Method for Speech Quality Assessment. CSoNet 2023: 27-38 - [c27]Bao Thang Ta, Minh Tu Le, Nhat Minh Le, Van Hai Do:
Probing Speech Quality Information in ASR Systems. INTERSPEECH 2023: 541-545 - [c26]Dinh Son Dang, Tung Lam Nguyen, Bao Thang Ta, Tien Thanh Nguyen, Thi Ngoc Anh Nguyen, Dang Linh Le, Nhat Minh Le, Van Hai Do:
LightVoc: An Upsampling-Free GAN Vocoder Based On Conformer And Inverse Short-time Fourier Transform. INTERSPEECH 2023: 3043-3047 - 2022
- [c25]Bao Thang Ta, Xuan Vuong Dang, Quang Tien Duong, Nhat Minh Le, Van Hai Do:
Improving Vietnamese Accent Recognition Using ASR Transfer Learning. O-COCOSDA 2022 2022: 1-6 - [c24]Quang Tien Duong, Duc Huy Nguyen, Bao Thang Ta, Nhat Minh Le, Van Hai Do:
Improving Self-supervised Audio Representation based on Contrastive Learning with Conformer Encoder. SoICT 2022: 270-275 - 2021
- [c23]Quang Tien Duong, Van Hai Do:
Development of Accent Recognition Systems for Vietnamese Speech. O-COCOSDA 2021: 174-179 - 2020
- [c22]Van Hai Do, Van Tuan Mai:
Agent/Client Speech Identification for Mixed-Channel Conversation in Customer Service Call Centers. IALP 2020: 197-200
2010 – 2019
- 2018
- [j3]Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark A. Hasegawa-Johnson:
Multitask Learning for Phone Recognition of Underresourced Languages Using Mismatched Transcription. IEEE ACM Trans. Audio Speech Lang. Process. 26(3): 501-514 (2018) - [c21]Quoc Bao Nguyen, Van Tuan Mai, Quang Trung Le, Ba Quyen Dam, Van Hai Do:
Development of a Vietnamese Large Vocabulary Continuous Speech Recognition System under Noisy Conditions. SoICT 2018: 222-226 - 2017
- [c20]Mark Hasegawa-Johnson, Preethi Jyothi, Wenda Chen, Van Hai Do:
Mismatched crowdsourcing: Mining latent skills to acquire speech transcriptions. ACSSC 2017: 1277-1281 - [c19]Nancy F. Chen, Boon Pang Lim, Van Hai Do, Van Tung Pham, Chongjia Ni, Haihua Xu, Mark Hasegawa-Johnson, Wenda Chen, Xiong Xiao, Sunil Sivadas, Eng Siong Chng, Bin Ma, Haizhou Li:
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory. APSIPA 2017: 1322-1327 - [c18]Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson:
Multi-Task Learning Using Mismatched Transcription for Under-Resourced Speech Recognition. INTERSPEECH 2017: 734-738 - [c17]Quoc Bao Nguyen, Van Hai Do, Ba Quyen Dam, Minh Hung Le:
Development of a Vietnamese speech recognition system for Viettel call center. O-COCOSDA 2017: 1-5 - 2016
- [c16]Thi-Nga Ho, Tze Yuang Chong, Van Hai Do, Van Tung Pham, Eng Siong Chng:
Improving Efficiency of Sentence Boundary Detection by Feature Selection. ACIIDS (2) 2016: 594-603 - [c15]Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson:
Speech recognition of under-resourced languages using mismatched transcriptions. IALP 2016: 112-115 - [c14]Haihua Xu, Jingyong Hou, Xiong Xiao, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Van Hai Do, Hang Lv, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li:
Approximate search of audio queries by using DTW with phone time boundary and data augmentation. ICASSP 2016: 6030-6034 - [c13]Nancy F. Chen, Van Tung Pham, Haihua Xu, Xiong Xiao, Van Hai Do, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Chin-Hui Lee, Eng Siong Chng, Bin Ma, Haizhou Li:
Exemplar-inspired strategies for low-resource spoken keyword search in Swahili. ICASSP 2016: 6040-6044 - [c12]Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson:
Analysis of Mismatched Transcriptions Generated by Humans and Machines for Under-Resourced Languages. INTERSPEECH 2016: 3863-3867 - [c11]Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson:
A many-to-one phone mapping approach for cross-lingual speech recognition. RIVF 2016: 120-124 - 2015
- [b1]Van Hai Do:
Acoustic modeling for speech recognition under limited training data conditions. Nanyang Technological University, Singapore, 2015 - [j2]Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Context-dependent Phone Mapping for Acoustic Modeling of Under-resourced Languages. Int. J. Asian Lang. Process. 23(1): 21-33 (2015) - [c10]Van Hai Do, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Distance metric learning for kernel density-based acoustic model under limited training data conditions. APSIPA 2015: 54-58 - [c9]Van Tung Pham, Haihua Xu, Van Hai Do, Tze Yuang Chong, Xiong Xiao, Eng Siong Chng, Haizhou Li:
On the study of very low-resource language keyword search. APSIPA 2015: 358-364 - [c8]Van Hai Do, Xiong Xiao, Haihua Xu, Eng Siong Chng, Haizhou Li:
Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation. APSIPA 2015: 594-98 - [c7]Haihua Xu, Van Hai Do, Xiong Xiao, Engsiong Chng:
A comparative study of BNF and DNN multilingual training on cross-lingual low-resource speech recognition. INTERSPEECH 2015: 2132-2136 - 2014
- [j1]Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Cross-Lingual Phone Mapping for Large Vocabulary Speech Recognition of Under-Resourced Languages. IEICE Trans. Inf. Syst. 97-D(2): 285-295 (2014) - [c6]Van Hai Do, Xiong Xiao, Chng Eng Siong, Haizhou Li:
Kernel density-based acoustic model with cross-lingual bottleneck features for resource limited LVCSR. INTERSPEECH 2014: 6-10 - [c5]Mirco Ravanelli, Van Hai Do, Adam Janin:
TANDEM-bottleneck feature combination using hierarchical Deep Neural Networks. ISCSLP 2014: 113-117 - 2013
- [c4]Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Context-dependent phone mapping for LVCSR of under-resourced languages. INTERSPEECH 2013: 500-504 - [c3]Korbinian Riedhammer, Van Hai Do, James Hieronymus:
A study on LVCSR and keyword search for tagalog. INTERSPEECH 2013: 2529-2533 - 2012
- [c2]Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
A Phone Mapping Technique for Acoustic Modeling of Under-Resourced Languages. IALP 2012: 233-236 - [c1]Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Context dependant phone mapping for cross-lingual acoustic modeling. ISCSLP 2012: 16-20
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:06 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint