Antonio Bonafonte

Publication Name: Primeras Jornadas de Tecnologıa …

Download (.pdf)

Publisher: Citeseer

Publication Date: Jan 1, 2003

Publication Name: Proc. of …

Research Interests:
Speech Recognition, Statistical Machine Translation, Language Resources, Spontaneous speech, and Speech Translation

Download (.pdf)

In this paper, an extension of n-grams is proposed. In this extension, the memory of the model (n) is not fixed a priori. Instead, first, large memories are accepted and afterwards, merging criteria are applied to reduce complexity and to... more

In this paper, an extension of n-grams is proposed. In this extension, the memory of the model (n) is not fixed a priori. Instead, first, large memories are accepted and afterwards, merging criteria are applied to reduce complexity and to ensure reliable estimations. The results show how the perplexity obtained with x-grams is smaller than that of n-grams. Furthermore, the

Research Interests:
Language Model

Page 1. Towards Robust Glottal Source Modeling Javier Pérez, Antonio Bonafonte Department of Signal Theory and Communication TALP Research Center Technical University of Catalonia (UPC), Barcelona, Spain {javierp,antonio}@gps.tsc.upc.edu... more

Page 1. Towards Robust Glottal Source Modeling Javier Pérez, Antonio Bonafonte Department of Signal Theory and Communication TALP Research Center Technical University of Catalonia (UPC), Barcelona, Spain {javierp,antonio}@gps.tsc.upc.edu ...

Publisher: isca-speech.org

Publication Date: 2009

Publication Name: Tenth Annual Conference of the …

Publisher: isca-speech.org

Publication Date: 2005

Publication Name: Ninth European Conference on Speech …

Download (.pdf)

Publication Date: 2009

Research Interests:
Natural Language Processing, Computational Linguistics, Greedy Algorithms, Algorithm, and Procesamiento del Lenguaje Natural

Applying a recently presented text-independent speech alignment technique based on unit selection to the training of a voice conversion system suggested that the more training data was available, the less speaker-specific information was... more

Applying a recently presented text-independent speech alignment technique based on unit selection to the training of a voice conversion system suggested that the more training data was available, the less speaker-specific information was learned. This paradoxical effect contradicts experience we have from other corpus-based applications as speech recognition or synthesis. There, the performance usually gains with increasing amount of data.

Publication Date: 2000

Research Interests:
Speech Recognition, Speech Processing, Voice Conversion, and Unit Selection

Download (.pdf)

Research Interests:
Speech Recognition

There are many exhaustive works that deal with the use of models for segmental duration. The aim of this paper is to evaluate some of the properties mentioned in literature and evaluate factorial and sum-of-products models in front of a... more

There are many exhaustive works that deal with the use of models for segmental duration. The aim of this paper is to evaluate some of the properties mentioned in literature and evaluate factorial and sum-of-products models in front of a list- like approach for Catalan language as a base for a most exhaustive study on duration in this language. Sum-of-products

ABSTRACT

Resumen En este artıculo se presentan dos nuevos sistemas para las segmentación de voz en fonemas. Uno basado en un clustering acústico previo a un alineado por programación dinámica y el segundo basado en una corrección especıfica de las... more

Resumen En este artıculo se presentan dos nuevos sistemas para las segmentación de voz en fonemas. Uno basado en un clustering acústico previo a un alineado por programación dinámica y el segundo basado en una corrección especıfica de las fronteras mediante un ...

Research Interests:
Speech Synthesis, English language, Database Design, Human Interaction, Speech analysis, and English Language

Download (.pdf)

Publication Date: 1993

Research Interests:
Efficient Algorithm for ECG Coding

Download (.pdf)

Unit selection speech synthesis techniques lead the speech synthesis state of the art. Automatic segmentation of databases is necessary in order to build new voices. They may contain errors and segmentation processes may introduce some... more

Unit selection speech synthesis techniques lead the speech synthesis state of the art. Automatic segmentation of databases is necessary in order to build new voices. They may contain errors and segmentation processes may introduce some more. Quality systems require a significant effort to find and correct these segmentation errors. Phonetic transcription is crucial and is one of the manually supervised

Publication Date: 2006

Publication Name: International Conference on Acoustics, Speech, and Signal Processing

Research Interests:
Speech Synthesis, Speech Acoustics, Speech Recognition, Text to Speech, Quality system, and Unit Selection

Publication Date: 2000

Download (.pdf)

Hidden Markov Modeling (HMM) techniques have been applied successfully to speech recognition problems. However, it has been claimed [1]-[5] that a major weakness of HMM is that the state duration probability density functions (SDPDF) are... more

Hidden Markov Modeling (HMM) techniques have been applied successfully to speech recognition problems. However, it has been claimed [1]-[5] that a major weakness of HMM is that the state duration probability density functions (SDPDF) are exponential, which is not appropriate for speech signals. In order to cope with this deficiency some authors have proposed to model explicitly the state duration.

Publication Date: 2000

Research Interests:
Speech Recognition, hidden Markov model, Efficient Algorithm for ECG Coding, First-Order Logic, PROBABILITY DENSITY, and 4 moreProbability Density Function, Baum Welch, Exponential Function, and Gamma Function

Publication Date: 1993

Research Interests:
Database Design

Download (.pdf)

Publication Date: 2004

Publication Name: Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004.

Research Interests:
Signal Processing, Speech Recognition, Voice Conversion, Time Domain, and Frequency Domain

Download (.pdf)

Publication Date: 2005

Publication Name: Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.

Research Interests:
Speech Segmentation, Decision Tree, and Boundary Detection

Download (.pdf)

Publication Date: 2005

Publication Name: Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.

Research Interests:
Voice Conversion, LINEAR PREDICTIVE CODING, and Time Domain

Download (.pdf)

Abstract Many of the research efforts in voice morphing, or also called voice conversion (VC), has been carried out in the field of vocal tract mapping. It has been studied that in the vocal tract parameters there is the most relevant... more

Abstract Many of the research efforts in voice morphing, or also called voice conversion (VC), has been carried out in the field of vocal tract mapping. It has been studied that in the vocal tract parameters there is the most relevant part of the information about speaker ...

Publication Date: 2006

Publication Name: 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings

... The selected context dependent phones are those mphones and rigth context dependent phones which appear more than 100 times on the acoustic training data. 3 4 5 4. EVALUATION OF SETHOS 91.3 I 75.8 91.9 I 77.5 91.3 76.0 ...

Publication Date: 1996

Publication Name: Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96

Research Interests:
Spoken language

In this paper, the occupancy of the HMM states is modeled by means of a Markov chain. A linear estimator is introduced to compute the probabilities of the Markov chain. The distribution functions (DF) represents accurately the observed... more

In this paper, the occupancy of the HMM states is modeled by means of a Markov chain. A linear estimator is introduced to compute the probabilities of the Markov chain. The distribution functions (DF) represents accurately the observed data. Representing the DF as a Markov chain allows the use of standard HMM recognizers. The increase of complexity is negligible in

Publication Date: 1996

Publication Name: Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96

Research Interests:
Speech Recognition and Markov chain

Download (.pdf)

Publication Date: 2005

Publication Name: IEEE Workshop on Automatic Speech Recognition and Understanding, 2005.

The synthesis quality is influenced by many important factors, among which the correctness of the grapheme-to-phoneme (g2p) conversion is one of the crucial ones. Automatic letter-to-sound systems have been in the center of attention for... more

The synthesis quality is influenced by many important factors, among which the correctness of the grapheme-to-phoneme (g2p) conversion is one of the crucial ones. Automatic letter-to-sound systems have been in the center of attention for the last decade. One of the most effective and promising methods resulted to be the so-called ldquopronunciation by analogyrdquo method, based on the analogy in the grapheme context, allowing derivation of the correct pronunciation for a new word from the parts of similar words present in the dictionary. This paper aims at further development of this method. Novel scoring strategies for determining the best pronunciations were proposed. A word error rate reduction of 1.5-2.5 percent is obtained. A detailed analysis shows that one of the new strategies consistently outperforms the others. The results obtained are compared to other g2p methods using the same data.

Publication Date: 2009

Publication Name: 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

Research Interests:
Speech Synthesis

Publication Date: 2003

Publication Name: 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).

Research Interests:
Facial Animation and Facial Expression Recognition

Download (.pdf)

This paper presents the baseline text-to-speech system developed at UPC (Ogmios) plus our recent work on speech prosody generation and the procedures to create high quality language resources for speech synthesis. These contributions have... more

This paper presents the baseline text-to-speech system developed at UPC (Ogmios) plus our recent work on speech prosody generation and the procedures to create high quality language resources for speech synthesis. These contributions have been evaluated within the TC-STAR European project, which is focused on speech-to-speech translation. Several presented contributions have been developed in order to adapt the TTS component

Publication Date: 2000

Research Interests:
Speech Synthesis, Speech Recognition, Language Resources, System Development, Text to Speech, and 2 moreText to Speech synthesis and Speech Translation

Download (.pdf)

... of IEEE Conf. on Computer Vision, Puerto Rico, 1997. [9] С Padgett, G. Cottrell, Identifyingemotion in static face images, in Proc. Of the 2nd Joint Symp. on Neural Computation, Vol.5, pp.91-101, La Jolla, CA, Uni. of California, San... more

... of IEEE Conf. on Computer Vision, Puerto Rico, 1997. [9] С Padgett, G. Cottrell, Identifyingemotion in static face images, in Proc. Of the 2nd Joint Symp. on Neural Computation, Vol.5, pp.91-101, La Jolla, CA, Uni. of California, San Diego. ...

Publication Date: 2002

Publication Name: IEEE International Conference on Acoustics Speech and Signal Processing

Research Interests:
Speech Acoustics, Emotion Recognition, hidden Markov model, Facial Animation, and Facial Expression Recognition

... Full-size table. View Within Article. As training material we have used phonetically balanced sentences uttered by task-independent 680 speakers from four dialectal zones and including over 236 000 phonemes. This corpus comprises five... more

... Full-size table. View Within Article. As training material we have used phonetically balanced sentences uttered by task-independent 680 speakers from four dialectal zones and including over 236 000 phonemes. This corpus comprises five hours and a half of continuous speech. ...

Publication Date: 2000

Publication Name: Speech Communication

Research Interests:
Cognitive Science, Linguistics, Speech Communication, Parameter estimation, Speech, and 2 moreDecision Tree and Continuous Speech Recognition

Publication Date: 2013

Publication Name: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing

Research Interests:
Speech Processing and Gaussian processes

ABSTRACT In the literature many intonation models are trained using pa-rameters extracted sentence-by-sentence on contours interpolated in the unvoiced segments. This may introduce a bias in the final param-eters and a reduction of the... more

ABSTRACT In the literature many intonation models are trained using pa-rameters extracted sentence-by-sentence on contours interpolated in the unvoiced segments. This may introduce a bias in the final param-eters and a reduction of the generalization of the model due to ...

Publication Date: 2008

Publication Name: 2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Research Interests:
Speech Synthesis, Missing Data, IS success, Parameter Extraction, Training Algorithm, and 2 moreGaussian noise and Synthetic Data Generation

Publication Date: 2007

Publication Name: Lecture Notes in Computer Science

Research Interests:
Speech Synthesis

Download (.pdf)

Publication Date: 2010

Publication Name: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing

Research Interests:
Speech Synthesis, Speech Acoustics, and Point of View

Download (.pdf)

Publication Name: Primeras Jornadas de Tecnologıa …

Publisher: Citeseer

Publication Date: Jan 1, 2003

Publication Name: Proc. of …

Research Interests: Speech Recognition, Statistical Machine Translation, Language Resources, Spontaneous speech, and Speech Translation<div>()</div>

Research Interests: Language Model<div>()</div>

Publisher: isca-speech.org

Publication Date: 2009

Publication Name: Tenth Annual Conference of the …

Publisher: isca-speech.org

Publication Date: 2005

Publication Name: Ninth European Conference on Speech …

Publication Date: 2009

Research Interests: Natural Language Processing, Computational Linguistics, Greedy Algorithms, Algorithm, and Procesamiento del Lenguaje Natural<div>()</div>

Publication Date: 2000

Research Interests: Speech Recognition, Speech Processing, Voice Conversion, and Unit Selection<div>()</div>

Research Interests: Speech Recognition<div>()</div>

Research Interests: Speech Synthesis, English language, Database Design, Human Interaction, Speech analysis, and English Language<div>()</div>

Publication Date: 1993

Research Interests: Efficient Algorithm for ECG Coding<div>()</div>

Publication Date: 2006

Publication Name: International Conference on Acoustics, Speech, and Signal Processing

Research Interests: Speech Synthesis, Speech Acoustics, Speech Recognition, Text to Speech, Quality system, and Unit Selection<div>()</div>

Publication Date: 2000

Publication Date: 2000

Publication Date: 1993

Research Interests: Database Design<div>()</div>

Publication Date: 2004

Publication Name: Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004.

Research Interests: Signal Processing, Speech Recognition, Voice Conversion, Time Domain, and Frequency Domain<div>()</div>

Publication Date: 2005

Publication Name: Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.

Research Interests: Speech Segmentation, Decision Tree, and Boundary Detection<div>()</div>

Publication Date: 2005

Publication Name: Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.

Research Interests: Voice Conversion, LINEAR PREDICTIVE CODING, and Time Domain<div>()</div>

Publication Date: 2006

Publication Name: 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings

Publication Date: 1996

Publication Name: Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96

Research Interests: Spoken language<div>()</div>

Publication Date: 1996

Publication Name: Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96

Research Interests: Speech Recognition and Markov chain<div>()</div>

Publication Date: 2005

Publication Name: IEEE Workshop on Automatic Speech Recognition and Understanding, 2005.

Publication Date: 2009

Publication Name: 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

Research Interests: Speech Synthesis<div>()</div>

Publication Date: 2003

Publication Name: 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).

Research Interests: Facial Animation and Facial Expression Recognition<div>()</div>

Publication Date: 2000

Publication Date: 2002

Publication Name: IEEE International Conference on Acoustics Speech and Signal Processing

Research Interests: Speech Acoustics, Emotion Recognition, hidden Markov model, Facial Animation, and Facial Expression Recognition<div>()</div>

Publication Date: 2000

Publication Name: Speech Communication

Publication Date: 2013

Publication Name: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing

Research Interests: Speech Processing and Gaussian processes<div>()</div>

Publication Date: 2008

Publication Name: 2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Publication Date: 2007

Publication Name: Lecture Notes in Computer Science

Research Interests: Speech Synthesis<div>()</div>

Publication Date: 2010

Publication Name: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing

Research Interests: Speech Synthesis, Speech Acoustics, and Point of View<div>()</div>

Log In

Research Interests:
Speech Recognition, Statistical Machine Translation, Language Resources, Spontaneous speech, and Speech Translation

Research Interests:
Language Model

Research Interests:
Natural Language Processing, Computational Linguistics, Greedy Algorithms, Algorithm, and Procesamiento del Lenguaje Natural

Research Interests:
Speech Recognition, Speech Processing, Voice Conversion, and Unit Selection

Research Interests:
Speech Recognition

Research Interests:
Speech Synthesis, English language, Database Design, Human Interaction, Speech analysis, and English Language

Research Interests:
Efficient Algorithm for ECG Coding

Research Interests:
Speech Synthesis, Speech Acoustics, Speech Recognition, Text to Speech, Quality system, and Unit Selection

Research Interests:
Database Design

Research Interests:
Signal Processing, Speech Recognition, Voice Conversion, Time Domain, and Frequency Domain

Research Interests:
Speech Segmentation, Decision Tree, and Boundary Detection

Research Interests:
Voice Conversion, LINEAR PREDICTIVE CODING, and Time Domain

Research Interests:
Spoken language

Research Interests:
Speech Recognition and Markov chain

Research Interests:
Speech Synthesis

Research Interests:
Facial Animation and Facial Expression Recognition

Research Interests:
Speech Acoustics, Emotion Recognition, hidden Markov model, Facial Animation, and Facial Expression Recognition

Research Interests:
Speech Processing and Gaussian processes

Research Interests:
Speech Synthesis

Research Interests:
Speech Synthesis, Speech Acoustics, and Point of View