default search action
ISCSLP 2012: Kowloon Tong, China
- 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012, Kowloon Tong, China, December 5-8, 2012. IEEE 2012, ISBN 978-1-4673-2506-6
- Jia Pan, Cong Liu, Zhiguo Wang, Yu Hu, Hui Jiang:
Investigation of deep neural networks (DNN) for large vocabulary continuous speech recognition: Why DNN surpasses GMMS in acoustic modeling. 301-305 - Maolin Wang, Shengnan Xiong, Jiayun Li, Ziyu Xiong:
A study on the coarticulation of bi-syllabic words in Chinese. 426-430 - Su Jun Leow, Tze Siong Lau, Alvina Goh, Han Meng Peh, Teck Khim Ng, Sabato Marco Siniscalchi, Chin-Hui Lee:
A new confidence measure combining Hidden Markov Models and Artificial Neural Networks of phonemes for effective keyword spotting. 112-116 - Zhengqi Wen, Jianhua Tao, Hao Che:
Statistical modification based post-filtering technique for HMM-based speech synthesis. 146-149 - Xin Wang, Zhen-Hua Ling, Li-Rong Dai:
Cross-stream dependency modeling using continuous F0 model for HMM-based speech synthesis. 84-87 - Wai-Sum Lee:
A cross-dialect comparison of vowel dispersion and vowel variability. 25-29 - Feng-Long Xie, Yi-Jian Wu, Frank K. Soong:
Cross validation and Minimum Generation Error for improved model clustering in HMM-based TTS. 60-63 - Ying-Lang Chang, Jen-Tzung Chien:
Bayesian nonparametric language models. 188-192 - Chenhao Zhang, Thomas Fang Zheng, Ruxin Chen:
Text-Dependent Speaker Recognition with long-term features based on functional data analysis. 340-344 - Xuan Ji, Jing Wang, Hailong He, Jingming Kuang:
The lossless adaptive arithmetic coding based on context for ITU-T G.719 at variable rate. 210-214 - Qiang Wang, Zhiyuan Guo, Gang Liu, Jun Guo:
Boundary-expanding locality sensitive hashing. 358-362 - Wei-Fan Chen, Chin-Kuan Kuo, Yih-Ru Wang, Sin-Horng Chen:
A syllable-based prosody modeling for L1 and L2 English speeches. 281-285 - Kui Wu, Yan Song, Wu Guo, Li-Rong Dai:
Intra-conversation intra-speaker variability compensation for speaker clustering. 330-334 - Liang He, Jia Li:
Discriminant local information distance preserving projection for text-independent speaker recognition. 349-352 - Xingyu Na, Xiang Xie, Jingming Kuang, Yaling He:
An improved tone labeling and prediction method with non-uniform segmentation of F0 contour. 252-255 - Xiaotian Zhang, Yao Qian, Hai Zhao, Frank K. Soong:
Break index labeling of mandarin text via syntactic-to-prosodic tree mapping. 256-260 - Cheng Hsien Lin, Po Kai Huang, Cheng-Yuan Lin, Chih-Chung Kuo:
Effective sentence selection based on phone/model coverage maximization for speaker adaptation in HMM-based speech synthesis. 74-78 - I-Fan Su, Sin-Ting Yeung, Brendan S. Weekes, Sam-Po Law:
Locus of orthographic facilitation effect in spoken word production: Evidence from cantonese Chinese. 440-444 - Lei Xie, Chenglin Xu, Xiaoxuan Wang:
Prosody-based sentence boundary detection in Chinese broadcast news. 261-265 - Wai-Sum Lee:
Articulatory and spectral characteristics of Cantonese vowels. 45-49 - Cheng-Yuan Lin, Chien-Hung Huang, Chih-Chung Kuo:
A simple and effective pitch re-estimation method for rich prosody and speaking styles in HMM-based speech synthesis. 286-290 - Zhiyang He, Ping Lv, Wei Li, Ji Wu:
A synchronized pruning composition algorithm of weighted finite state transducers for large vocabulary speech recognition. 11-15 - Chao-Hong Liu, Chung-Hsien Wu, David Sarwono:
Alternative hypothesis generation using a weighted kernel feature matrix for ASR substitution error correction. 1-5 - Qinghua Wu, Xiao-Lei Zhang, Ping Lv, Ji Wu:
Perceptual similarity between audio clips and feature selection for its measurement. 387-391 - Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
Exploring mutual information for GMM-based spectral conversion. 50-54 - Jinfu Ni, Yoshinori Shiga, Hisashi Kawai, Hideki Kashioka:
Resonance-based spectral deformation in HMM-based speech synthesis. 88-92 - Yong Xu, Wu Guo, Li-Rong Dai:
A hybrid fragment / syllable-based system for improved OOV term detection. 378-382 - Yong Xu, Wu Guo, Shan Su, Li-Rong Dai:
Spoken term detection for OOV terms based on triphone confusion matrix. 98-102 - Maolin Wang, Wei Shi, Ruixian Huang, Ziyu Xiong:
The temporal effect of speaking rate, focus and prosody in Chinese. 445-449 - Ruofei Chen, Cheung-Fat Chan:
Hierarchical clustering and robust identification for block-based autoregressive speech parameter estimation. 103-107 - Shixiang Lu, Wei Wei, Xiaoyin Fu, Lichun Fan, Bo Xu:
Phrase-based data selection for language model adaptation in spoken language translation. 193-196 - Syu-Siang Wang, Jeih-Weih Hung, Yu Tsao:
A study on cepstral sub-band normalization for robust ASR. 141-145 - Chen Zhao, Hongcui Wang, Songgun Hyon, Jianguo Wei, Jianwu Dang:
Efficient feature extraction of speaker identification using phoneme mean F-ratio for Chinese. 345-348 - Duy Khanh Ninh, Masanori Morise, Yoichi Yamashita:
Incorporating dynamic features into minimum generation error training for HMM-based speech synthesis. 55-59 - Guo Li, Peggy Mok:
Preliminary study on the interlanguage speech intelligibility benefit for English-Mandarin bilingual l2 learners. 409-412 - Huijun Ding, Tan Lee, Ing Yann Soon:
Two objective measures for speech distortion and noise reduction evaluation of enhanced speech signals. 117-121 - Tao Jiang, Zhiyong Wu, Jia Jia, Lianhong Cai:
Perceptual clustering based unit selection optimization for concatenative text-to-speech synthesis. 64-68 - Weifeng Li, Qingmin Liao:
Keyword-specific normalization based keyword spotting for spontaneous speech. 233-237 - Siu Wa Lee, Minghui Dong, Haizhou Li:
A study of F0 modelling and generation with lyrics and shape characterization for singing voice synthesis. 150-154 - Yuguang Wang, Hongcui Wang, Jiaqi Gao, Jianguo Wei, Jianwu Dang:
Detailed morphological analysis of mandarin sustained steady vowels. 413-416 - Chunrong Li, Zhiyong Wu, Fanbo Meng, Helen M. Meng, Lianhong Cai:
Detection and emphatic realization of contrastive word pairs for expressive text-to-speech synthesis. 93-97 - Yan Li, Si Li, Weiran Xu, Jun Guo:
Analyzing semantic orientation of terms using Affinity Propagation. 30-34 - Xixin Wu, Zhiyong Wu, Jia Jia, Lianhong Cai:
Adaptive named entity recognition based on conditional random fields with automatic updated dynamic gazetteers. 363-367 - Van Hai Do, Xiong Xiao, Engsiong Chng, Haizhou Li:
Context dependant phone mapping for cross-lingual acoustic modeling. 16-20 - Xiaoyin Fu, Wei Wei, Lichun Fan, Shixiang Lu, Bo Xu:
Nesting hierarchical phrase-based model for speech-to-speech translation. 368-372 - Yu Zou, Yan Wang, Wei He:
Diachronic contrastive analysis on read speech in broadcast news: Evidence from pitch and duration. 291-295 - Masashi Unoki, Xugang Lu:
Unified denoising and dereverberation method used in restoration of MTF-based power envelope. 215-219 - Xugang Lu, Masashi Unoki, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Controlling the tradeoff property in a regularization framework for noise reduction. 201-205 - Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Acoustic space partition based on broad phonetic class for ensemble acoustic modeling. 311-314 - Fei Chen, Tian Guan, Lena L. N. Wong:
Effects of excitation spread on the intelligibility of Mandarin speech in cochlear implant simulations. 35-39 - Duc Hoang Ha Nguyen, Xiong Xiao, Chng Eng Siong, Haizhou Li:
An analysis of vector Taylor series model compensation for non-stationary noise in speech recognition. 131-135 - Wenping Hu, Yao Qian, Frank K. Soong:
Pitch accent detection and prediction with DCT features and CRF model. 266-270 - Dazuo Wang, Xiuxiu Wang, Gang Peng:
Effects of carriers on Mandarin tone categorical perception. 417-421 - Yao Qian, Frank K. Soong:
A unified trajectory tiling approach to high quality TTS and cross-lingual voice transformation. 165-169 - Pengfei Liu, Ka-Wa Yuen, Wai-Kim Leung, Helen M. Meng:
mENUNCIATE: Development of a computer-aided pronunciation training system on a cross-platform framework for mobile, speech-enabled application development. 170-173 - Yingying Gao, Weibin Zhu:
How to describe speech emotion more completely - An investigation on Chinese broadcast news speech. 450-453 - Cheung-Chi Leung, Bin Ma, Haizhou Li:
Phonotactic spoken language recognition: Using diversely adapted acoustic models in parallel phone recognizers. 108-111 - Cuiling Zhang:
Acoustic analysis of disguised voices with raised and lowered pitch. 353-357 - Jian Xu, Zhi-Jie Yan, Qiang Huo:
A comparative study of fMPE and RDLT approaches to LVCSR. 21-24 - Jian Zhang, Risheng Xia, Zhonghua Fu, Junfeng Li, Yonghong Yan:
A fast two-microphone noise reduction algorithm based on power level ratio for mobile phone. 206-209 - Jian Xu, Zhi-Jie Yan, Qiang Huo:
A feature-transform based approach to unsupervised task adaptation and personalization. 229-232 - Kuan-Lang Huang, Tai-Shih Chi:
TDOA information based vad for robust speech recognition in directional and diffuse noise field. 126-130 - Dac-Thang Hoang, Hsiao-Chuan Wang:
A phone segmentation method and its evaluation on Mandarin speech corpus. 373-377 - Mengxue Cao, Aijun Li, Qiang Fang, Jianguo Wei, Chan Song, Jianwu Dang:
Acoustic and articulatory analysis on Japanese vowels in emotional speech. 40-44 - Po-Yi Shih, Bo-Wei Chen, Jhing-Fa Wang, Jhing-Wei Wu:
Enhanced lengthening cancellation using bidirectional pitch similarity alignment for spontaneous speech. 238-242 - Jinfu Ni, Yoshinori Shiga, Hisashi Kawai, Hideki Kashioka:
Experiments on unsupervised statistical parametric speech synthesis. 155-159 - Yasuaki Kanai, Masashi Unoki:
Robust voice activity detection using empirical mode decomposition and modulation spectrum analysis. 400-404 - Kun Li, Helen M. Meng:
Perceptually-motivated assessment of automatically detected lexical stress in L2 learners' speech. 179-183 - Na Li, Yu Qiao:
Voice conversion using Bayesian mixture of Probabilistic Linear Regressions and dynamic kernel features. 69-73 - Chen-Yu Yang, Georgina Brown, Liang Lu, Junichi Yamagishi, Simon King:
Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation. 220-223 - Junhong Zhao, Weiqiang Zhang, Hua Yuan, Jia Liu, Shanhong Xia:
Automatic pitch accent detection using auto-context with acoustic features. 247-251 - Xian-Jun Xia, Zhen-Hua Ling, Chen-Yu Yang, Li-Rong Dai:
Improved unit selection speech synthesis method utilizing subjective evaluation results on synthetic speech. 160-164 - Zhanlei Yang, Wenju Liu, Hao Chao:
An improved steady segment based decoding algorithm by using response probability for LVCSR. 306-310 - Yang Li, Xunying Liu, Lan Wang:
Structured modeling based on generalized variable parameter HMMs and speaker adaptation. 136-140 - Wei Rao, Man-Wai Mak:
Alleviating the small sample-size problem in i-vector based speaker verification. 335-339 - Chen-Yu Chiang, Sabato Marco Siniscalchi, Yih-Ru Wang, Sin-Horng Chen, Chin-Hui Lee:
A study on cross-language knowledge integration in Mandarin LVCSR. 315-319 - Lichun Fan, Dengfeng Ke, Xiaoyin Fu, Shixiang Lu, Bo Xu:
Power-normalized PLP (PNPLP) feature for robust speech recognition. 224-228 - Jia Jia, Wai-Kim Leung, Ye Tian, Lianhong Cai, Helen M. Meng:
Analysis on mispronunciations in CAPT based on computational speech perception. 174-178 - Ching-feng Yeh, Yiu-Chang Lin, Lin-Shan Lee:
Minimum Phone Error model training on merged acoustic units for transcribing bilingual code-switched speech. 320-324 - Guoli Ye, Brian Mak:
Speaker-ensemble hidden Markov modeling for automatic speech recognition. 6-10 - Song Wang, Shen Liu, Jianguo Wei, Qiang Fang, Jianwu Dang:
Reconstruction of vocal tract based on multi-source image information. 396-399 - Ye Tian, Jia Jia, Yongxin Wang, Lianhong Cai:
A real-time tone enhancement method for continuous Mandarin speeches. 405-408 - Chiu-yu Tseng, Chao-yu Su:
Information allocation and prosodic expressiveness in continuous speech: A Mandarin cross-genre analysis. 243-246 - Yi-Chin Huang, Chung-Hsien Wu, Sz-Ting Weng:
Hierarchical prosodic pattern selection based on Fujisaki model for natural mandarin speech synthesis. 79-83 - Chan Song, Jianguo Wei, Qiang Fang, Shen Liu, Yuguang Wang, Jianwu Dang:
Tongue shape synthesis based on Active Shape Model. 383-386 - Hua Yuan, Junhong Zhao, Jia Liu:
Improve mispronunciation detection with Tandem feature. 184-187 - Bin Li, Rong Rong:
Tones in whispered Mandarin. 422-425 - Ting Zou, Jinsong Zhang, Wen Cao:
A comparative study of perception of tone 2 and tone 3 in Mandarin by native speakers and Japanese learners. 431-435 - Sagun Dhakhwa, Jens Allwood:
Self documentation of endangered languages. 392-395 - Jun Du, Qiang Huo:
Synthesized stereo-based stochastic mapping with data selection for robust speech recognition. 122-125 - Hongwei Ding, Daniel Hirst:
A preliminary investigation of the third tone sandhi in standard Chinese with a prosodic corpus. 436-439 - Xin Chen, Jian Cheng:
Acoustic modeling for native and non-native Mandarin speech recognition. 325-329 - Yinghao Li, Jinghua Zhang, Jiangping Kong:
The coarticulation resistance of consonants in standard Chinese - An electropalatographic and acoustic study. 454-458 - Aijun Li, Qiang Fang, Yuan Jia, Jianwu Dang:
More targets? Simulating emotional intonation of mandarin with PENTA. 271-275 - Yuan Jia, Aijun Li:
Phonetic realization of accent from Chinese English learners in various dialectal regions. 296-300 - Xinhui Hu, Youzheng Wu, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Collecting sentences from web resources for constructing spontaneous Chinese language model. 197-200 - Helen M. Meng:
Welcome message from the conference chair. - Brian Mak, Bin Ma:
Welcome message from the technical program chairs.
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.