About Me by Ian McLoughlin
Research by Ian McLoughlin
Various topics in wireless communications and networking explained (with links to papers)
This webpage overviews most of my recent speech work, with links to various papers.
Books by Ian McLoughlin
Applied Speech and Audio Processing is a MATLAB-based, one-stop resource that blends speech and h... more Applied Speech and Audio Processing is a MATLAB-based, one-stop resource that blends speech and hearing research in describing the key techniques of speech and audio processing. This practically oriented text provides MATLAB examples throughout to illustrate the concepts discussed and to give the reader hands-on experience with important techniques. Chapters on basic audio processing and the characteristics of speech and hearing lay the foundations of speech signal processing, which are built upon in subsequent sections explaining audio handling, coding, compression, and analysis techniques. The final chapter explores a number of advanced topics that use these techniques, including psychoacoustic modelling, a subject which underpins MP3 and related audio formats. With its hands-on nature and numerous MATLAB examples, this book is ideal for graduate students and practitioners working with speech or audio systems.
Papers by Ian McLoughlin
Abstract—Voice-over-IP is expected to become a popular service offered by the internet. Thus, it ... more Abstract—Voice-over-IP is expected to become a popular service offered by the internet. Thus, it is important to ensure high quality of service. In this paper, we look at two standards proposed for evaluating the intelligibility of Chinese speech. Adopting the philosophy and methodology of the Diagnostic Rhyme Test (DRT) for testing English speech, the Chinese Diagnostic Rhyme Test (CDRT) evaluates the six elementary phonemic attributes of Chinese words.
Proceedings of 13th International Conference on Digital Signal Processing, 2000
CELP coders commonly use line spectral pairs (LSP) to represent linear prediction parameters, giv... more CELP coders commonly use line spectral pairs (LSP) to represent linear prediction parameters, giving stable filters and efficient coding. However, manipulation of LSPs can alter frequencies within the represented signals. This paper describes two computationally efficient LSP-based processing methods designed to enhance the intelligibility of speech degraded by acoustic interference
2009 Ieee 20th International Symposium on Personal Indoor and Mobile Radio Communications, Sep 1, 2009
The 9th International Symposium on Chinese Spoken Language Processing, Sep 1, 2014
Proceedings of the 2014 ACM/SIGDA international symposium on Field-programmable gate arrays - FPGA '14, 2014
PloS one, 2014
A key problem in spoken language identification (LID) is to design effective representations whic... more A key problem in spoken language identification (LID) is to design effective representations which are specific to language information. For example, in recent years, representations based on both phonotactic and acoustic features have proven their effectiveness for LID. Although advances in machine learning have led to significant improvements, LID performance is still lacking, especially for short duration speech utterances. With the hypothesis that language information is weak and represented only latently in speech, and is largely dependent on the statistical properties of the speech content, existing representations may be insufficient. Furthermore they may be susceptible to the variations caused by different speakers, specific content of the speech segments, and background noise. To address this, we propose using Deep Bottleneck Features (DBF) for spoken LID, motivated by the success of Deep Neural Networks (DNN) in speech recognition. We show that DBFs can form a low-dimensio...
Proceedings of the IEEE 6th Circuits and Systems Symposium on Emerging Technologies: Frontiers of Mobile and Wireless Communication (IEEE Cat. No.04EX710), 2004
Circuits, Systems, and Signal Processing, 2014
ABSTRACT This paper develops, simulates and experimentally evaluates a novel method based on non-... more ABSTRACT This paper develops, simulates and experimentally evaluates a novel method based on non-contact low frequency (LF) ultrasound which can determine, from airborne reflection, whether the lips of a subject are open or closed. The method is capable of accurately distinguishing between open and closed lip states through the use of a low-complexity detection algorithm, and is highly robust to interfering audible noise. A novel voice activity detector is implemented and evaluated using the proposed method and shown to detect voice activity with high accuracy, even in the presence of high levels of background noise. The lip state detector is evaluated at a number of angles of incidence to the mouth and under various conditions of background noise. The underlying mouth state detection technique relies upon an inaudible LF ultrasonic excitation, generated in front of the face of a user, either reflecting back from their face as a simple echo in the closed mouth state or resonating inside the open mouth and vocal tract, affecting the spectral response of the reflected wave when the mouth is open. The difference between echo and resonance behaviours is used as the basis for automated lip opening detection, which implies determining whether the mouth is open or closed at the lips. Apart from this, potential applications include use in voice generation prosthesis for speech impaired patients, or as a hands-free control for electrolarynx and similar rehabilitation devices. It is also applicable to silent speech interfaces and may have use for speech authentication.
2014 IEEE 79th Vehicular Technology Conference (VTC Spring), 2014
2006 IEEE Singapore International Conference on Communication Systems, ICCS 2006, 2006
Abstract This paper presents the performance investigation and FPGA implementation aspects of a r... more Abstract This paper presents the performance investigation and FPGA implementation aspects of a real-time adaptive MIMO DFE system. MIMO communication systems have been a strong research topic for several years because of spectral efficiency advantages, ...
Uploads
About Me by Ian McLoughlin
Research by Ian McLoughlin
Books by Ian McLoughlin
Papers by Ian McLoughlin