default search action
Yoshinori Shiga
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2022
- [j8]Takuma Okamoto, Keisuke Matsubara, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Neural speech-rate conversion with multispeaker WaveNet vocoder. Speech Commun. 138: 1-12 (2022) - 2021
- [j7]Keisuke Matsubara, Takuma Okamoto, Ryoichi Takashima, Tetsuya Takiguchi, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Full-Band LPCNet: A Real-Time Neural Vocoder for 48 kHz Audio With a CPU. IEEE Access 9: 94923-94933 (2021) - [c42]Takuma Okamoto, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Noise Level Limited Sub-Modeling for Diffusion Probabilistic Vocoders. ICASSP 2021: 6029-6033 - [c41]Keisuke Matsubara, Takuma Okamoto, Ryoichi Takashima, Tetsuya Takiguchi, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
High-Intelligibility Speech Synthesis for Dysarthric Speakers with LPCNet-Based TTS and CycleVAE-Based VC. ICASSP 2021: 7058-7062 - 2020
- [c40]Takuma Okamoto, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Transformer-Based Text-to-Speech with Weighted Forced Attention. ICASSP 2020: 6729-6733 - [p2]Hiroaki Kato, Shoji Harada, Tasuku Kitade, Yoshinori Shiga:
Multilingualization of Speech Processing. Speech-to-Speech Translation 2020: 1-20 - [p1]Yoshinori Shiga, Jinfu Ni, Kentaro Tachibana, Takuma Okamoto:
Text-to-Speech Synthesis. Speech-to-Speech Translation 2020: 39-52
2010 – 2019
- 2019
- [c39]Takuma Okamoto, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Tacotron-Based Acoustic Model Using Phoneme Alignment for Practical Neural Text-to-Speech Systems. ASRU 2019: 214-221 - [c38]Saly Keo, Soky Kak, Yoshinori Shiga, Hiroaki Kato, Hisashi Kawai:
HMM-based TTS System Framework. CIFEr 2019: 1 - [c37]Takuma Okamoto, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Investigations of Real-time Gaussian Fftnet and Parallel Wavenet Neural Vocoders with Simple Acoustic Features. ICASSP 2019: 7020-7024 - [c36]Takuma Okamoto, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Real-Time Neural Text-to-Speech with Sequence-to-Sequence Acoustic Model and WaveGlow or Single Gaussian WaveRNN Vocoders. INTERSPEECH 2019: 1308-1312 - [c35]Jinfu Ni, Yoshinori Shiga, Hisashi Kawai:
Duration Modeling with Global Phoneme-Duration Vectors. INTERSPEECH 2019: 4465-4469 - 2018
- [c34]Takuma Okamoto, Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
An Investigation of Subband Wavenet Vocoder Covering Entire Audible Frequency Range with Limited Acoustic Features. ICASSP 2018: 5654-5658 - [c33]Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
An Investigation of Noise Shaping with Perceptual Weighting for Wavenet-Based Speech Generation. ICASSP 2018: 5664-5668 - [c32]Jinfu Ni, Yoshinori Shiga, Hisashi Kawai:
Multilingual Grapheme-to-Phoneme Conversion with Global Character Vectors. INTERSPEECH 2018: 2823-2827 - [c31]Takuma Okamoto, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Improving FFTNet Vocoder with Noise Shaping and Subband Approaches. SLT 2018: 304-311 - 2017
- [j6]Shigeki Matsuda, Teruaki Hayashi, Yutaka Ashikari, Yoshinori Shiga, Hidenori Kashioka, Keiji Yasuda, Hideo Okuma, Masao Uchiyama, Eiichiro Sumita, Hisashi Kawai, Satoshi Nakamura:
Development of the "VoiceTra" Multi-Lingual Speech Translation System. IEICE Trans. Inf. Syst. 100-D(4): 621-632 (2017) - [j5]Takashi Nose, Yusuke Arao, Takao Kobayashi, Komei Sugiura, Yoshinori Shiga:
Sentence Selection Based on Extended Entropy Using Phonetic and Prosodic Contexts for Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 25(5): 1107-1116 (2017) - [c30]Takuma Okamoto, Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Subband wavenet with overlapped single-sideband filterbanks. ASRU 2017: 698-704 - [c29]Jinfu Ni, Yoshinori Shiga, Hisashi Kawai:
Global Syllable Vectors for Building TTS Front-End with Deep Learning. INTERSPEECH 2017: 769-773 - 2016
- [j4]Jinfu Ni, Yoshinori Shiga, Chiori Hori:
Superpositional HMM-Based Intonation Synthesis Using a Functional F0 Model. J. Signal Process. Syst. 82(2): 273-286 (2016) - [c28]Jinfu Ni, Yoshinori Shiga, Hisashi Kawai:
Using Zero-Frequency Resonator to Extract Multilingual Intonation Structure. INTERSPEECH 2016: 1522-1526 - [c27]Kentaro Tachibana, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai:
Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework. INTERSPEECH 2016: 2288-2292 - 2015
- [j3]Komei Sugiura, Yoshinori Shiga, Hisashi Kawai, Teruhisa Misu, Chiori Hori:
A cloud robotics approach towards dialogue-oriented robot speech. Adv. Robotics 29(7): 449-456 (2015) - [c26]Jinfu Ni, Yoshinori Shiga, Chiori Hori:
Extraction of pitch register from expressive speech in Japanese. ICASSP 2015: 4764-4768 - [c25]Ye Kyaw Thu, Win Pa Pa, Jinfu Ni, Yoshinori Shiga, Andrew M. Finch, Chiori Hori, Hisashi Kawai, Eiichiro Sumita:
HMM based myanmar text to speech system. INTERSPEECH 2015: 2237-2241 - [c24]Takashi Nose, Yusuke Arao, Takao Kobayashi, Komei Sugiura, Yoshinori Shiga, Akinori Ito:
Entropy-based sentence selection for speech synthesis using phonetic and prosodic contexts. INTERSPEECH 2015: 3491-3495 - 2014
- [j2]Shinnosuke Takamichi, Tomoki Toda, Yoshinori Shiga, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Parameter Generation Methods With Rich Context Models for High-Quality and Flexible Text-To-Speech Synthesis. IEEE J. Sel. Top. Signal Process. 8(2): 239-250 (2014) - [c23]Jinfu Ni, Yoshinori Shiga, Chiori Hori:
Tuning intonation with pitch accent decomposition for HMM-based expressive speech synthesis. APSIPA 2014: 1-10 - [c22]Komei Sugiura, Yoshinori Shiga, Hisashi Kawai, Teruhisa Misu, Chiori Hori:
Non-monologue HMM-based speech synthesis for service robots: A cloud robotics approach. ICRA 2014: 2237-2242 - [c21]Jinfu Ni, Yoshinori Shiga, Chiori Hori:
Superpositional HMM-based intonation synthesis using a functional F0 model. ISCSLP 2014: 270-274 - 2013
- [c20]Shinnosuke Takamichi, Tomoki Toda, Yoshinori Shiga, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Improvements to HMM-based speech synthesis based on parameter generation with rich context models. INTERSPEECH 2013: 364-368 - [c19]Jinfu Ni, Yoshinori Shiga, Chiori Hori, Yutaka Kidawara:
A targets-based superpositional model of fundamental frequency contours applied to HMM-based speech synthesis. INTERSPEECH 2013: 1052-1056 - [c18]Shigeki Matsuda, Xinhui Hu, Yoshinori Shiga, Hideki Kashioka, Chiori Hori, Keiji Yasuda, Hideo Okuma, Masao Uchiyama, Eiichiro Sumita, Hisashi Kawai, Satoshi Nakamura:
Multilingual Speech-to-Speech Translation System: VoiceTra. MDM (2) 2013: 229-233 - 2012
- [c17]Yoshinori Shiga:
Effect of anti-aliasing filtering on the quality of speech from an HMM-based synthesizer. ICASSP 2012: 4525-4528 - [c16]Shinnosuke Takamichi, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai, Sakriani Sakti, Satoshi Nakamura:
An Evaluation of Parameter Generation Methods with Rich Context Models in HMM-Based Speech Synthesis. INTERSPEECH 2012: 1139-1142 - [c15]Jinfu Ni, Yoshinori Shiga, Hisashi Kawai, Hideki Kashioka:
Resonance-based spectral deformation in HMM-based speech synthesis. ISCSLP 2012: 88-92 - [c14]Jinfu Ni, Yoshinori Shiga, Hisashi Kawai, Hideki Kashioka:
Experiments on unsupervised statistical parametric speech synthesis. ISCSLP 2012: 155-159 - 2011
- [c13]Teruhisa Misu, Etsuo Mizukami, Yoshinori Shiga, Shinichi Kawamoto, Hisashi Kawai, Satoshi Nakamura:
Analysis on Effects of Text-to-Speech and Avatar Agent in Evoking Users' Spontaneous Listener's Reactions. IWSDS 2011: 77-89 - [c12]Teruhisa Misu, Etsuo Mizukami, Yoshinori Shiga, Shinichi Kawamoto, Hisashi Kawai, Satoshi Nakamura:
Toward Construction of Spoken Dialogue System that Evokes Users' Spontaneous Backchannels. SIGDIAL Conference 2011: 259-265 - 2010
- [c11]Yoshinori Shiga, Tomoki Toda, Shinsuke Sakai, Jinfu Ni, Hisashi Kawai, Keiichi Tokuda, Minoru Tsuzaki, Satoshi Nakamura:
NICT Blizzard Challenge 2010 Entry. Blizzard Challenge 2010 - [c10]Yoshinori Shiga, Tomoki Toda, Shinsuke Sakai, Hisashi Kawai:
Improved training of excitation for HMM-based parametric speech synthesis. INTERSPEECH 2010: 809-812
2000 – 2009
- 2009
- [c9]Ranniery Maia, Tomoki Toda, Shinsuke Sakai, Yoshinori Shiga, Jinfu Ni, Hisashi Kawai, Keiichi Tokuda, Minoru Tsuzaki, Satoshi Nakamura:
The NICT Entry for the Blizzard Challenge 2009: an Enhanced HMM-based Speech Synthesis System with Trajectory Training considering Global Variance and State-Dependent Mixed Excitation. Blizzard Challenge 2009 - [c8]Yoshinori Shiga:
Pulse density representation of spectrum for statistical speech processing. INTERSPEECH 2009: 1771-1774 - 2007
- [j1]Takehiko Kagoshima, Masahiro Morita, Shigenobu Seto, Masami Akamine, Yoshinori Shiga:
An F0 contour control model using an F0 contour codebook. Syst. Comput. Jpn. 38(1): 62-72 (2007) - 2004
- [c7]Yoshinori Shiga, Simon King:
Source-filter separation for articulation-to-speech synthesis. INTERSPEECH 2004: 1913-1916 - [c6]Yoshinori Shiga, Simon King:
Estimating detailed spectral envelopes using articulatory clustering. INTERSPEECH 2004: 2485-2488 - [c5]Yoshinori Shiga, Simon King:
Accurate spectral envelope estimation for articulation-to-speech synthesis. SSW 2004: 19-24 - 2003
- [c4]Yoshinori Shiga, Simon King:
Estimating the spectral envelope of voiced speech using multi-frame analysis. INTERSPEECH 2003: 1737-1740 - [c3]Yoshinori Shiga, Simon King:
Estimation of voice source and vocal tract characteristics based on multi-frame analysis. INTERSPEECH 2003: 1749-1752
1990 – 1999
- 1998
- [c2]Yoshinori Shiga, Hiroshi Matsuura, Tsuneo Nitta:
Segmental duration control based on an articulatory model. ICSLP 1998 - 1994
- [c1]Yoshinori Shiga, Yoshiyuki Hara, Tsuneo Nitta:
A novel segment-concatenation algorithm for a cepstrum-based synthesizer. ICSLP 1994: 1783-1786
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-22 00:33 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint