Stochastic processes

Applied Filters

People

Publications

Publication Date

Searched The ACM Guide to Computing Literature (3,766,563 records)|Limit your search to The ACM Full-Text Collection (759,377 records)

Showing 1 - 20of21 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

article
November 2008
A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System
IEICE - Transactions on Information and Systems (TROIS), Volume E91-D, Issue 11Pages 2693–2700https://doi.org/10.1093/ietisy/e91-d.11.2693

In a hidden Markov model (HMM), state duration probabilities decrease exponentially with time, which fails to adequately represent the temporal structure of speech. One of the solutions to this problem is integrating state duration probability ...
0
Metrics
Total Citations0
article
September 2008
HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis
IEICE - Transactions on Information and Systems (TROIS), Volume E91-D, Issue 9Pages 2360–2364https://doi.org/10.1093/ietisy/e91-d.9.2360

In this paper, we propose a new mask estimation method for the computational auditory scene analysis (CASA) of speech using two microphones. The proposed method is based on a hidden Markov model (HMM) in order to incorporate an observation that the mask ...
1
Metrics
Total Citations1
article
May 2007
A Hidden Semi-Markov Model-Based Speech Synthesis System
IEICE - Transactions on Information and Systems (TROIS), Volume E90-D, Issue 5Pages 825–834https://doi.org/10.1093/ietisy/e90-d.5.825

A statistical speech synthesis system based on the hidden Markov model (HMM) was recently proposed. In this system, spectrum, excitation, and duration of speech are modeled simultaneously by context-dependent HMMs, and speech parameter vector sequences ...
19
Metrics
Total Citations19
article
March 2007
State Duration Modeling for HMM-Based Speech Synthesis
IEICE - Transactions on Information and Systems (TROIS), Volume E90-D, Issue 3Pages 692–693https://doi.org/10.1093/ietisy/e90-d.3.692

This paper describes the explicit modeling of a state duration's probability density function in HMM-based speech synthesis. We redefine, in a statistically correct manner, the probability of staying in a state for a time interval used to obtain the ...
1
Metrics
Total Citations1
article
February 2007
Adaptive Tuning of Buffer Pool Size in Database Server Based on Iterative Algorithm
IEICE - Transactions on Information and Systems (TROIS), Volume E90-D, Issue 2Pages 594–597https://doi.org/10.1093/ietisy/e90-d.2.594

One of the system greatly affecting the performance of a database server is the size-division of buffer pools. This letter proposes an adaptive control method of the buffer pool sizes. This method obtains the nearly optimal division using only observed ...
0
Metrics
Total Citations0
article
February 2007
Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics
IEICE - Transactions on Information and Systems (TROIS), Volume E90-D, Issue 2Pages 554–561https://doi.org/10.1093/ietisy/e90-d.2.554

In real-time speech recognition applications, there is a need to implement a fast and reliable adaptation algorithm. We propose a method to reduce adaptation time of the rapid unsupervised speaker adaptation based on HMM-Sufficient Statistics. We use ...
0
Metrics
Total Citations0
article
February 2007
Average-Voice-Based Speech Synthesis Using HSMM-Based Speaker Adaptation and Adaptive Training
- Junichi Yamagishi,
- Takao Kobayashi
IEICE - Transactions on Information and Systems (TROIS), Volume E90-D, Issue 2Pages 533–543https://doi.org/10.1093/ietisy/e90-d.2.533

In speaker adaptation for speech synthesis, it is desirable to convert both voice characteristics and prosodic features such as F0 and phone duration. For simultaneous adaptation of spectrum, F0 and phone duration within the HMM framework, we need to ...
13
Metrics
Total Citations13
article
February 2007
Constructing a Multilayered Boundary to Defend against Intrusive Anomalies
- Zonghua Zhang,
- Hong Shen
IEICE - Transactions on Information and Systems (TROIS), Volume E90-D, Issue 2Pages 490–499https://doi.org/10.1093/ietisy/e90-d.2.490

We propose a model for constructing a multilayered boundary in an information system to defend against intrusive anomalies by correlating a number of parametric anomaly detectors. The model formulation is based on two observations. First, anomaly ...
0
Metrics
Total Citations0
article
January 2007
Two-Band Excitation for HMM-Based Speech Synthesis
- Sang-Jin Kim,
- Minsoo Hahn
IEICE - Transactions on Information and Systems (TROIS), Volume E90-D, Issue 1Pages 378–381https://doi.org/10.1093/ietisy/e90-1.1.378

This letter describes a two-band excitation model for HMM-based speech synthesis. The HMM-based speech synthesis system generates speech from the HMM training data of the spectral and excitation parameters. Synthesized speech has a typical quality of "...
2
Metrics
Total Citations2
article
January 2007
Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005
IEICE - Transactions on Information and Systems (TROIS), Volume E90-D, Issue 1Pages 325–333https://doi.org/10.1093/ietisy/e90-1.1.325

In January 2005, an open evaluation of corpus-based text-to-speech synthesis systems using common speech datasets, named Blizzard Challenge 2005, was conducted. Nitech group participated in this challenge, entering an HMM-based speech synthesis system ...
19
Metrics
Total Citations19
article
September 2006
Robust Scene Extraction Using Multi-Stream HMMs for Baseball Broadcast
IEICE - Transactions on Information and Systems (TROIS), Volume E89-D, Issue 9Pages 2553–2561https://doi.org/10.1093/ietisy/e89-d.9.2553

In this paper, we propose a robust statistical framework for extracting scenes from a baseball broadcast video. We apply multi-stream hidden Markov models (HMMs) to control the weights among different features. To achieve a large robustness against new ...
2
Metrics
Total Citations2
article
July 2006
HHMM Based Recognition of Human Activity*This paper was presented at MVA2005.
IEICE - Transactions on Information and Systems (TROIS), Volume E89-D, Issue 7Pages 2180–2185https://doi.org/10.1093/ietisy/e89-d.7.2180

In this paper, we present a method for recognition of human activity as a series of actions from an image sequence. The difficulty with the problem is that there is a chicken-egg dilemma that each action needs to be extracted in advance for its ...
4
Metrics
Total Citations4
article
July 2006
Surface Reconstruction from Stereo Data Using a Three-Dimensional Markov Random Field Model
- Hotaka Takizawa,
- Shinji Yamamoto
IEICE - Transactions on Information and Systems (TROIS), Volume E89-D, Issue 7Pages 2028–2035https://doi.org/10.1093/ietisy/e89-d.7.2028

In the present paper, we propose a method for reconstructing the surfaces of objects from stereo data. Both the fitness of stereo data to surfaces and interrelation between the surfaces are defined in the framework of a three-dimensional (3-D) Markov ...
1
Metrics
Total Citations1
article
May 2006
Non-saturated Throughput Analysis of IEEE 802.11 Ad Hoc Networks
- Changchun Xu,
- Zongkai Yang
IEICE - Transactions on Information and Systems (TROIS), Volume E89-D, Issue 5Pages 1676–1678https://doi.org/10.1093/ietisy/e89-d.5.1676

This letter presents a simple but accurate analytical model to evaluate the throughput of IEEE 802.11 distributed coordination function in non-saturated conditions. The influence of offered load on the throughput of both basic and RTS/CTS access ...
2
Metrics
Total Citations2
article
March 2006
A Non-stationary Noise Suppression Method Based on Particle Filtering and Polyak Averaging
- Masakiyo Fujimoto,
- Satoshi Nakamura
IEICE - Transactions on Information and Systems (TROIS), Volume E89-D, Issue 3Pages 922–930https://doi.org/10.1093/ietisy/e89-d.3.922

This paper addresses a speech recognition problem in non-stationary noise environments: the estimation of noise sequences. To solve this problem, we present a particle filter-based sequential noise estimation method for front-end processing of speech ...
1
Metrics
Total Citations1
article
March 2006
What HMMs Can Do
- Jeff A. Bilmes
IEICE - Transactions on Information and Systems (TROIS), Volume E89-D, Issue 3Pages 869–891https://doi.org/10.1093/ietisy/e89-d.3.869

Since their inception almost fifty years ago, hidden Markov models (HMMs) have have become the predominant methodology for automatic speech recognition (ASR) systems---today, most state-of-the-art speech systems are HMM-based. There have been a number ...
7
Metrics
Total Citations7
article
December 2005
Robust Speech Recognition Using Discrete-Mixture HMMs
IEICE - Transactions on Information and Systems (TROIS), Volume E88-D, Issue 12Pages 2811–2818https://doi.org/10.1093/ietisy/e88-d.12.2811

This paper introduces new methods of robust speech recognition using discrete-mixture HMMs (DMHMMs). The aim of this work is to develop robust speech recognition for adverse conditions that contain both stationary and non-stationary noise. In particular,...
0
Metrics
Total Citations0
article
December 2005
Behavioral Analysis of a Fault-Tolerant Software System with Rejuvenation
- Koichiro Rinsaka,
- Tadashi Dohi
IEICE - Transactions on Information and Systems (TROIS), Volume E88-D, Issue 12Pages 2681–2690https://doi.org/10.1093/ietisy/e88-d.12.2681

In recent years, considerable attention has been devoted to continuously running software systems whose performance characteristics are smoothly degrading in time. Software aging often affects the performance of a software system and eventually causes ...
5
Metrics
Total Citations5
article
September 2005
Tree-Structured Clustering Methods for Piecewise Linear-Transformation-Based Noise Adaptation
IEICE - Transactions on Information and Systems (TROIS), Volume E88-D, Issue 9Pages 2168–2176https://doi.org/10.1093/ietisy/e88-d.9.2168

This paper proposes the application of tree-structured clustering to the processing of noisy speech collected under various SNR conditions in the framework of piecewise-linear transformation (PLT)-based HMM adaptation for noisy speech. Three kinds of ...
0
Metrics
Total Citations0
article
June 2005
Extension of Hidden Markov Models for Multiple Candidates and Its Application to Gesture Recognition
IEICE - Transactions on Information and Systems (TROIS), Volume E88-D, Issue 6Pages 1239–1247https://doi.org/10.1093/ietisy/e88-d.6.1239

We propose a modified Hidden Markov Model (HMM) with a view to improve gesture recognition using a moving camera. The conventional HMM is formulated so as to deal with only one feature candidate per frame. However, for a mobile robot, the background and ...
0
Metrics
Total Citations0

Applied Filters

People

Names

Institutions

Authors

Publications

All Publications

Publisher

Publication Date

A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System

HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis

A Hidden Semi-Markov Model-Based Speech Synthesis System

State Duration Modeling for HMM-Based Speech Synthesis

Adaptive Tuning of Buffer Pool Size in Database Server Based on Iterative Algorithm

Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics

Average-Voice-Based Speech Synthesis Using HSMM-Based Speaker Adaptation and Adaptive Training

Constructing a Multilayered Boundary to Defend against Intrusive Anomalies

Two-Band Excitation for HMM-Based Speech Synthesis

Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005

Robust Scene Extraction Using Multi-Stream HMMs for Baseball Broadcast

HHMM Based Recognition of Human Activity*This paper was presented at MVA2005.

Surface Reconstruction from Stereo Data Using a Three-Dimensional Markov Random Field Model

Non-saturated Throughput Analysis of IEEE 802.11 Ad Hoc Networks

A Non-stationary Noise Suppression Method Based on Particle Filtering and Polyak Averaging

What HMMs Can Do

Robust Speech Recognition Using Discrete-Mixture HMMs

Behavioral Analysis of a Fault-Tolerant Software System with Rejuvenation

Tree-Structured Clustering Methods for Piecewise Linear-Transformation-Based Noise Adaptation

Extension of Hidden Markov Models for Multiple Candidates and Its Application to Gesture Recognition