Erich Zwyssig

Bruno Kessler Foundation, Speech-Acoustic Scene Analysis and Interpretation, Department Member

Followers

Following

Public Views

Interests

Uploads

Papers by Erich Zwyssig

Speech processing using digital MEMS microphones

Download

Recognition of overlapping speech using digital MEMS microphone arrays

The Sheffield Wargames Corpus

Download

Determining the number of speakers in a meeting using microphone array features

Download

On the effect of SNR and superdirective beamforming in speaker diarisation in meetings

This paper examines the effect of sensor performance on speaker diarisation in meetings and inves... more This paper examines the effect of sensor performance on speaker diarisation in meetings and investigates the use of more advanced beamforming techniques, beyond the typically employed delay-sum beamformer, for mitigating the effects of poorer sensor performance. We present superdirective beamforming and investigate how different time difference of arrival (TDOA) smoothing and beamforming techniques influence the performance of state-of-the-art diarisation systems. We produced and transcribed a new corpus of meetings recorded in the instrumented meeting room using a high SNR analogue and a newly developed low SNR digital MEMS microphone array (DMMA.2). This research demonstrates that TDOA smoothing has a significant effect on the diarisation error rate and that simple noise reduction and beamforming schemes suffice to overcome audio signal degradation due to the lower SNR of modern MEMS microphones.

Digital microphone array-design, implementation and speech recognition experiments

Download

A digital microphone array for distant speech recognition

In this paper, the design, implementation and testing of a digital microphone array is presented.... more In this paper, the design, implementation and testing of a digital microphone array is presented. The array uses digital MEMS microphones which integrate the microphone, amplifier and analogue to digital converter on a single chip in place of the analogue microphones and external audio interfaces currently used. The device has the potential to be smaller, cheaper and more flexible than typical analogue arrays, however the effect on speech recognition performance of using digital microphones is as yet unknown. In order to evaluate the effect, an analogue array and the new digital array are used to simultaneously record test data for a speech recognition experiment. Initial results employing no adaptation show that performance using the digital array is significantly worse (14% absolute WER) than the analogue device. Subsequent experiments using MLLR and CMLLR channel adaptation reduce this gap, and employing MLLR for both channel and speaker adaptation reduces the difference between the arrays to 4.5% absolute WER.

Download