2017 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM)
This paper presents an efficient deep learning framework for long-term monitoring of acoustic eve... more This paper presents an efficient deep learning framework for long-term monitoring of acoustic events from hydrophone big data. The large-scale noisy ONC (Ocean Networks Canada) data may contain rare acoustic events, which can be automatically recognized by utilizing a deep convolutional neural network. Few works have been reported in the area of deep learning for the recognition of different kinds of marine mammal calls which however is crucial for many applications such as marine navigation. In this proposed scheme, deep learning feature sets are adopted and processed by a support vector machione (SVM) classifier. The proposed method is tested with 28685 minutes of data, spanning a single year with 5573 whale calls/acoustic events, and using a human operator's annotations. The experimental results show that the average accuracy rate of recognition using deep feature learning are 98.69% (two-class) and 94.48% (multi-class), respectively, for the proposed recognition scheme, which outperforms the MFCC-based method.
2015 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM), 2015
The mathematical theory of the time-compression overlap add scheme for radio communications is pr... more The mathematical theory of the time-compression overlap add scheme for radio communications is presented. A simple time-shifting model is developed to explore performance in AWGN and multipath Rayleigh fading channels. Equivalent multirate filter structures for both the transmit and receive processes are developed and used to analyze the spectrum of the transmitted signal as it relates to the spectrum of the message signal.
— A Class-AB microwave-photonic link is presented as a means to reduce noise associated with DC-b... more — A Class-AB microwave-photonic link is presented as a means to reduce noise associated with DC-bias. By operating with two complementary, approximately half-wave-rectified, optical sources and generating the difference in a balanced detector, linear modulation with high modulation efficiency is achieved with near zero DC. Using two mode-converter intensity modulators, we demonstrate a 5 dB reduction in shot noise and elimination of intensity noise, relative to a quadrature-biased Mach-Zehnder, for the same received RF signal power. Index Terms — Microwave-photonic, analog links, subcarrier, low-bias, intensity-noise suppression, class AB.
This paper discusses an evolution in North Indian instruments in the designing of technology to c... more This paper discusses an evolution in North Indian instruments in the designing of technology to capture gestures from a performing artist. Modified traditional instruments use sensor technology and microcontrollers to digitize gestures, enabling a computer to analyze performance to synthesize sound and visual meaning. Specifically, systems were built to capture data from three traditional North Indian instruments: the tabla (a pair of tonal hand drums), the dholak (a barrel shaped folk drum played by two people), and the sitar (a 19-stringed, gourd-shelled instrument). This paper will discuss how these instruments are modified to capture gestural movement, how these signals are mapped to sounds and graphical feedback, and show examples of the new instruments being used in live performance. The hardware is built to try and preserve the techniques passed down from generations of tradition; however, modified performance techniques with the aid of a laptop are also introduced. 1
Abstract: Using the MPEG-4 Advanced Audio Coder (AAC) music as an example of streaming applicatio... more Abstract: Using the MPEG-4 Advanced Audio Coder (AAC) music as an example of streaming applications, we investigate the improvement of error performance for the streaming service by link-adaptation and power-control techniques in an Enhanced General Packet Radio Services (EGPRS) cellular network. A low packet error rate and variability are essential in providing short error-burst length so that error concealment techniques can be effectively applied to music packets. In this paper, we study the effects of a combined link adaptation and power control scheme (referred to as the error-based schme) for achieving a target error rate and reducing error variability. By simulation, we compare the error performance of the error-based scheme at both the EGPRS block and AAC frame level with another adaptation algorithm (referred to as the throughput-based scheme) with a goal of maximizing overall network throughput. It is found that when offered with similar traffic load, the former scheme can...
This convention paper has been reproduced from the author’s advance manuscript, without editing, ... more This convention paper has been reproduced from the author’s advance manuscript, without editing, corrections, or consideration by the Review Board. The AES takes no responsibility for the contents. Additional papers may be obtained by sending request and remittance to Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA; also see www.aes.org. All rights reserved. Reproduction of this paper, or any portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering Society.
2017 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM), 2017
We introduce a new scheme that integrates the Time Compression OverLap-Add (TC-OLA) spread spectr... more We introduce a new scheme that integrates the Time Compression OverLap-Add (TC-OLA) spread spectrum technique into radar systems, more specifically the Linear Frequency Modulation Pulse Compression (LFM-PC) radar. This technique increases the signal to noise ratio (SNR) and, as a consequence, enables a greater processing gain compared to the traditional radar LFM-PC systems. In addition, TC-OLA allows the radar designer to control the spreading of the signal and therefore provides a better immunity against powerful jamming techniques. In our simulation, we extend the conventional LFM-PC radar model by appropriately adding Time Compression (TC) and Overlap-add (OLA) blocks at the transmitter and receiver, respectively. The evaluation performance of the proposed system and the convention LFM are done under AWGN and under one of the smart jamming technique called Convolution Noise Jamming (CNJ) using different Constant False Alarm Rate (CFAR) algorithms, namely, Cell-Average (CA), Grea...
The University of Victoria has a new centre for computer music and digital media called MISTIC (M... more The University of Victoria has a new centre for computer music and digital media called MISTIC (Music Intelligence and Sound Technology Interdisciplinary Centre). Research and educational activities pertaining to MISTIC are described.
In this paper, we review 2nd and 3rd generation wireless telephony systems, both in terms of tech... more In this paper, we review 2nd and 3rd generation wireless telephony systems, both in terms of technology and service concepts. We point out some of the technical and service issues that may a ect the functionality and performance from the user point of view. In particular, we focus on one technical and one service concept important for long term success of 3rd generation systems: transmit diversity to increase capacity (quantity),and context-sensitive applications to increase user ac-
This paper proposes a novel method for anomaly and quality detection of marine mammal sounds usin... more This paper proposes a novel method for anomaly and quality detection of marine mammal sounds using multitaper spectrogram and hydrophone big data. The proposed method is aimed to automatically detect anomaly, such as high-frequency vessel noise, Doppler noise, in sperm whale (SPW) sound as well as the quality of the sound. A new signature function derived from a multi-taper spectrogram is able to detect the anomaly in the data and a new anomaly distortion measure can detect the sound quality into good/bad. The proposed method, is tested with 1905 minutes of data spanning a single year, and using a human operator's annotations. The experimental results reveal that the proposed multitaper spectrogram based approach is efficient in detecting anomaly as well as sperm whale sound quality for hydrophone big data and high detection accuracy (>85%) is achieved for raw input hydrophone data.
This paper presents a new radar system that combines any number and any kind of radar signals and... more This paper presents a new radar system that combines any number and any kind of radar signals and transmits them simultaneously, thereby obtaining all the advantages of each of the radar waveforms used. This new system is based on a low-rate coded time compression overlap-add (TC-OLA) technique, and allows radar designers to address the challenge of having many features, such as high-time resolution, high-Doppler tolerance, low-sidelobe level, and low probability of intercept (LPI), in a single waveform. Since different types of radar waveforms usually offer different features, the new system allows the designers to obtain optimality via pluralism. The performance evaluation of the new radar system shows its superiority when examined under noise and heavy-tailed clutter background over the classical radar system and other methods of combination by operating on $N$-radar signals simultaneously, making use of features from each waveform, canceling the Doppler from a Doppler-intolerant...
Acoustic musical instruments enable very rich and subtle control when used by experienced musicia... more Acoustic musical instruments enable very rich and subtle control when used by experienced musicians. Musicology has traditionally focused on analysis of scores and more recently audio recordings. However, most music from around the world is not notated, and many nuances of music performance are hard to recover from audio recordings. In this chapter, we describe hyperinstruments, i. e., acoustic instruments that are augmented with digital sensors for capturing performance information and in some cases offering additional playing possibilities. Direct sensors are integrated onto the physical instrument, possibly requiring modifications. Indirect sensors such as cameras and microphones can be used to analyze performer gestures without requiring modifications to the instrument. We describe some representative case studies of hyperinstruments from our own research as well as some representative case studies of the types of musicological analysis one can perform using this approach, such ...
2017 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM)
This paper presents an efficient deep learning framework for long-term monitoring of acoustic eve... more This paper presents an efficient deep learning framework for long-term monitoring of acoustic events from hydrophone big data. The large-scale noisy ONC (Ocean Networks Canada) data may contain rare acoustic events, which can be automatically recognized by utilizing a deep convolutional neural network. Few works have been reported in the area of deep learning for the recognition of different kinds of marine mammal calls which however is crucial for many applications such as marine navigation. In this proposed scheme, deep learning feature sets are adopted and processed by a support vector machione (SVM) classifier. The proposed method is tested with 28685 minutes of data, spanning a single year with 5573 whale calls/acoustic events, and using a human operator's annotations. The experimental results show that the average accuracy rate of recognition using deep feature learning are 98.69% (two-class) and 94.48% (multi-class), respectively, for the proposed recognition scheme, which outperforms the MFCC-based method.
2015 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM), 2015
The mathematical theory of the time-compression overlap add scheme for radio communications is pr... more The mathematical theory of the time-compression overlap add scheme for radio communications is presented. A simple time-shifting model is developed to explore performance in AWGN and multipath Rayleigh fading channels. Equivalent multirate filter structures for both the transmit and receive processes are developed and used to analyze the spectrum of the transmitted signal as it relates to the spectrum of the message signal.
— A Class-AB microwave-photonic link is presented as a means to reduce noise associated with DC-b... more — A Class-AB microwave-photonic link is presented as a means to reduce noise associated with DC-bias. By operating with two complementary, approximately half-wave-rectified, optical sources and generating the difference in a balanced detector, linear modulation with high modulation efficiency is achieved with near zero DC. Using two mode-converter intensity modulators, we demonstrate a 5 dB reduction in shot noise and elimination of intensity noise, relative to a quadrature-biased Mach-Zehnder, for the same received RF signal power. Index Terms — Microwave-photonic, analog links, subcarrier, low-bias, intensity-noise suppression, class AB.
This paper discusses an evolution in North Indian instruments in the designing of technology to c... more This paper discusses an evolution in North Indian instruments in the designing of technology to capture gestures from a performing artist. Modified traditional instruments use sensor technology and microcontrollers to digitize gestures, enabling a computer to analyze performance to synthesize sound and visual meaning. Specifically, systems were built to capture data from three traditional North Indian instruments: the tabla (a pair of tonal hand drums), the dholak (a barrel shaped folk drum played by two people), and the sitar (a 19-stringed, gourd-shelled instrument). This paper will discuss how these instruments are modified to capture gestural movement, how these signals are mapped to sounds and graphical feedback, and show examples of the new instruments being used in live performance. The hardware is built to try and preserve the techniques passed down from generations of tradition; however, modified performance techniques with the aid of a laptop are also introduced. 1
Abstract: Using the MPEG-4 Advanced Audio Coder (AAC) music as an example of streaming applicatio... more Abstract: Using the MPEG-4 Advanced Audio Coder (AAC) music as an example of streaming applications, we investigate the improvement of error performance for the streaming service by link-adaptation and power-control techniques in an Enhanced General Packet Radio Services (EGPRS) cellular network. A low packet error rate and variability are essential in providing short error-burst length so that error concealment techniques can be effectively applied to music packets. In this paper, we study the effects of a combined link adaptation and power control scheme (referred to as the error-based schme) for achieving a target error rate and reducing error variability. By simulation, we compare the error performance of the error-based scheme at both the EGPRS block and AAC frame level with another adaptation algorithm (referred to as the throughput-based scheme) with a goal of maximizing overall network throughput. It is found that when offered with similar traffic load, the former scheme can...
This convention paper has been reproduced from the author’s advance manuscript, without editing, ... more This convention paper has been reproduced from the author’s advance manuscript, without editing, corrections, or consideration by the Review Board. The AES takes no responsibility for the contents. Additional papers may be obtained by sending request and remittance to Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA; also see www.aes.org. All rights reserved. Reproduction of this paper, or any portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering Society.
2017 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM), 2017
We introduce a new scheme that integrates the Time Compression OverLap-Add (TC-OLA) spread spectr... more We introduce a new scheme that integrates the Time Compression OverLap-Add (TC-OLA) spread spectrum technique into radar systems, more specifically the Linear Frequency Modulation Pulse Compression (LFM-PC) radar. This technique increases the signal to noise ratio (SNR) and, as a consequence, enables a greater processing gain compared to the traditional radar LFM-PC systems. In addition, TC-OLA allows the radar designer to control the spreading of the signal and therefore provides a better immunity against powerful jamming techniques. In our simulation, we extend the conventional LFM-PC radar model by appropriately adding Time Compression (TC) and Overlap-add (OLA) blocks at the transmitter and receiver, respectively. The evaluation performance of the proposed system and the convention LFM are done under AWGN and under one of the smart jamming technique called Convolution Noise Jamming (CNJ) using different Constant False Alarm Rate (CFAR) algorithms, namely, Cell-Average (CA), Grea...
The University of Victoria has a new centre for computer music and digital media called MISTIC (M... more The University of Victoria has a new centre for computer music and digital media called MISTIC (Music Intelligence and Sound Technology Interdisciplinary Centre). Research and educational activities pertaining to MISTIC are described.
In this paper, we review 2nd and 3rd generation wireless telephony systems, both in terms of tech... more In this paper, we review 2nd and 3rd generation wireless telephony systems, both in terms of technology and service concepts. We point out some of the technical and service issues that may a ect the functionality and performance from the user point of view. In particular, we focus on one technical and one service concept important for long term success of 3rd generation systems: transmit diversity to increase capacity (quantity),and context-sensitive applications to increase user ac-
This paper proposes a novel method for anomaly and quality detection of marine mammal sounds usin... more This paper proposes a novel method for anomaly and quality detection of marine mammal sounds using multitaper spectrogram and hydrophone big data. The proposed method is aimed to automatically detect anomaly, such as high-frequency vessel noise, Doppler noise, in sperm whale (SPW) sound as well as the quality of the sound. A new signature function derived from a multi-taper spectrogram is able to detect the anomaly in the data and a new anomaly distortion measure can detect the sound quality into good/bad. The proposed method, is tested with 1905 minutes of data spanning a single year, and using a human operator's annotations. The experimental results reveal that the proposed multitaper spectrogram based approach is efficient in detecting anomaly as well as sperm whale sound quality for hydrophone big data and high detection accuracy (>85%) is achieved for raw input hydrophone data.
This paper presents a new radar system that combines any number and any kind of radar signals and... more This paper presents a new radar system that combines any number and any kind of radar signals and transmits them simultaneously, thereby obtaining all the advantages of each of the radar waveforms used. This new system is based on a low-rate coded time compression overlap-add (TC-OLA) technique, and allows radar designers to address the challenge of having many features, such as high-time resolution, high-Doppler tolerance, low-sidelobe level, and low probability of intercept (LPI), in a single waveform. Since different types of radar waveforms usually offer different features, the new system allows the designers to obtain optimality via pluralism. The performance evaluation of the new radar system shows its superiority when examined under noise and heavy-tailed clutter background over the classical radar system and other methods of combination by operating on $N$-radar signals simultaneously, making use of features from each waveform, canceling the Doppler from a Doppler-intolerant...
Acoustic musical instruments enable very rich and subtle control when used by experienced musicia... more Acoustic musical instruments enable very rich and subtle control when used by experienced musicians. Musicology has traditionally focused on analysis of scores and more recently audio recordings. However, most music from around the world is not notated, and many nuances of music performance are hard to recover from audio recordings. In this chapter, we describe hyperinstruments, i. e., acoustic instruments that are augmented with digital sensors for capturing performance information and in some cases offering additional playing possibilities. Direct sensors are integrated onto the physical instrument, possibly requiring modifications. Indirect sensors such as cameras and microphones can be used to analyze performer gestures without requiring modifications to the instrument. We describe some representative case studies of hyperinstruments from our own research as well as some representative case studies of the types of musicological analysis one can perform using this approach, such ...
Uploads
Papers by Peter Driessen