research-article

Custom architecture for multicore audio beamforming systems

Authors:

Dimitris Theodoropoulos,

Georgi Kuzmanov,

Georgi GaydadjievAuthors Info & Claims

ACM Transactions on Embedded Computing Systems (TECS), Volume 13, Issue 2

Article No.: 19, Pages 1 - 26

https://doi.org/10.1145/2514641.2514646

Published: 30 September 2013 Publication History

Abstract

The audio Beamforming (BF) technique utilizes microphone arrays to extract acoustic sources recorded in a noisy environment. In this article, we propose a new approach for rapid development of multicore BF systems. Research on literature reveals that the majority of such experimental and commercial audio systems are based on desktop PCs, due to their high-level programming support and potential of rapid system development. However, these approaches introduce performance bottlenecks, excessive power consumption, and increased overall cost. Systems based on DSPs require very low power, but their performance is still limited. Custom hardware solutions alleviate the aforementioned drawbacks, however, designers primarily focus on performance optimization without providing a high-level interface for system control and test. In order to address the aforementioned problems, we propose a custom platform-independent architecture for reconfigurable audio BF systems. To evaluate our proposal, we implement our architecture as a heterogeneous multicore reconfigurable processor and map it onto FPGAs. Our approach combines the software flexibility of General-Purpose Processors (GPPs) with the computational power of multicore platforms. In order to evaluate our system we compare it against a BF software application implemented to a low-power Atom 330, a middle-ranged Core2 Duo, and a high-end Core i3. Experimental results suggest that our proposed solution can extract up to 16 audio sources in real time under a 16-microphone setup. In contrast, under the same setup, the Atom 330 cannot extract any audio sources in real time, while the Core2 Duo and the Core i3 can process in real time only up to 4 and 6 sources respectively. Furthermore, a Virtex4-based BF system consumes more than an order less energy compared to the aforementioned GPP-based approaches.

References

[1]

Acoustic Camera. 2013. http://www.acoustic-camera.com.

[2]

Analog Devices Inc. 2004. SHARC processor adsp-21262. http://www.analog.com/en/evaluation/21262-ezlite/eb.html.

[3]

Beracoechea, J., Torres-Guijarro, S., Garcia, L., and Casajus-Quiros, F. 2006. On building immersive audio applications using robust adaptive beamforming and joint audio-video source localization. EURASIP J. Appl. Signal Process. 2006, 196.

Digital Library

[4]

Berkhout, A., De Vries, D., and Vogel, P. 1993. Acoustic control by wave field synthesis. J. Acoust. Soc. Amer. 93, 2764--2778.

[5]

Buchner, H., Spors, S., Kellermann, W., and Rabenstein, R. 2002. Full-duplex communication systems using loudspeaker arrays and microphone arrays. In Proceedings of the IEEE International Conference on Multimedia and Expo. 509--512.

[6]

Cedrick, R. 2005. Documentation of the microphone array mark III. http://www.nist.gov/smartspace/downloads/Microphone_Array_Mark_III.pdf.

[7]

Cutler, R., Rui, Y., Gupta, A., Cadiz, J., Tashev, I., He, L., Colburn, A., Zhang Z., Liu, Z., and Silverberg, S. 2002. Distributed meetings: A meeting capture and broadcasting system. In Proceedings of the International Conference on Multimedia. 503--512.

Digital Library

[8]

Farina, A., Glasgal, R., Armelloni, E., and Torger, A. 2001. Ambiophonic principles for the recording and reproduction of surround. In Proceedings of the 19^th AES International Conference.

[9]

Fiala, M., Green, D., and Roth, G. 2004. A panoramic video and acoustic beamforming sensor for videoconferencing. In Proceedings of the IEEE International Conference on Haptic, Audio and Visual Environments and their Applications. 47--52.

[10]

Fillinger, A., Diduch, L., Hamchi, I., Degre, S., and Stanford, V. 2007. NIST smart data flow system II: Speaker localization. In Proceedings of the 6^th International Conference on Information Processing in Sensor Networks. 549--550.

Digital Library

[11]

Theile, G. 2001. Multichannel natural music recording based on pychoacoustics principles. In Proceedings of the 19^th AES International Conference.

[12]

Intel Corporation. 2013. http://ark.intel.com/products/35641.

[13]

Intel Corporation. 2013. http://ark.intel.com/products/36500.

[14]

Intel Corporation. 2013. http://ark.intel.com/products/49020.

[15]

Kapralos, B., Jenkin, M., and Milios, E. 2003. Audio-visual localization of multiple speakers in a video teleconferencing setting. Int. J. Imaging Syst. Technol. 13, 1, 95--105.

[16]

Kyriakakis, C. 1998. Fundamental and technological limitations of immersive audio systems. Proc. IEEE 86, 941--951.

[17]

Mei, G., Xu, R., Lao, D., and Kwan, C. 2006. Real-time speaker verification with a microphone array. In Proceedings of the International Conference on Pervasive Systems and Computing.

[18]

Mihov, S.G., Gleghorn, T., and Tashev, I. 2008. Enhanced sound capture system for small devices. In Proceedings of the International Conference of Information, Communication and Energy Systems.

[19]

Mit Csail: Mit Project Oxygen. 2004. http://oxygen.lcs.mit.edu/.

[20]

Mouchtaris, A., Reveliotis, P., and Kyriakakis, C. 2000. Inverse of filter design for immersive audio rendering over loudspeakers. IEEE Trans. Multimedia 2, 77--87.

Digital Library

[21]

Nilsen, C.-I. C. and Hafizovic, I. 2009. Digital beamforming using a GPU. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. 609--612.

Digital Library

[22]

Polycom Inc. 2009. Polycom CX5000 unified conference station. http://www.polycom.com/products-services/products-for-microsoft/lync-optimized/cx5000-unified-conference-station.html.

[23]

Sallberg, B., Swartling, M., Grbic, N., and Ingvar C. 2006. Real-time implementation of a blind beamformer for subband speech enhancement using kurtosis maximization. In Proceedings of the International Workshop on Acoustic Echo and Noise Control. 485--489.

[24]

Snow, W. 1955. Basic principles of stereophonic sound. IRE Trans. Audio 2, 42--53.

[25]

Squarehead Technology. 2013. Audio scope zoom audio. http://www.sqhead.com/.

[26]

Teutsch, H., Spors, S., Herbordt, W., Kellermann, W., and Rabenstein, R. 2003. An integrated real-time system for immersive audio applications. In Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. 67--70.

[27]

Texas Instruments Inc. 2002. TMS320C62x/C67x power consumption summary. http://www.ti.com/lit/an/spra486c/spra486c.pdf.

[28]

Theodoropoulos, D., Kuzmanov, G., and Gaydadjiev, G. 2009. A reconfigurable beamformer for audio applications. In Proceedings of the IEEE Symposium on Application Specific Processors. 80--87.

[29]

Theodoropoulos, D., Kuzmanov, G., and Gaydadjiev, G. 2010. Minimalistic architecture for reconfigurable audio beamforming. In Proceedings of the International Conference on Field-Programmable Technology. 503--506.

[30]

Veen, B. V. and Buckley, K. 1988. Beamforming: A versatile approach to spatial filtering. IEEE ASSP Mag. 5, 4--24.

[31]

Wall, K. and Lockwood, G. R. 2005. Modern implementation of a realtime 3d beamformer and scan converter system. In Proceedings of the IEEE Ultrasonics Symposium. 1400--1403.

[32]

Weinstein, E., Steele, K., Agarwal, A., and Glass, J. 2004. LOUD: A 1020-node modular microphone array and beamformer for intelligent computing spaces. MIT/LCS Tech. memo MIT-LCS-TM-642.

[33]

Xilinx Inc. 2010a. The simple microblaze microcontroller concept. http://xilinx.eetrend.com/files-eetrend-xilinx/article/201105/1839-3354-xapp1141.pdf.

[34]

Xilinx Inc. 2010b. XPower estimator user guide. http://www.xilinx.com/support/documentation/sw_manuals/xilinx12_4/ise_n_power_user_guide.htm.

[35]

Xilinx Inc. 2007. Implementing a real-time beamformer on an fpga platform. XCell J. 36--40. http://www.techonline.com/electrical-engineers/education-training/tech-papers/4137944/Implementing-a-Real-Time-Beamformer-on-an-FPGA-Platform.

[36]

Yermeche, Z., Sallberg, B., Grbic, N., and Claesson, I. 2007. Real-time implementation of a subband beamforming algorithm for dual microphone speech enhancement. In Proceedings of the IEEE International Symposium on Circuits and Systems. 353--356.

[37]

Yiu, C. K., Ho, C. H., Lu, Y., Shi, X., and Luk, W. 2008. Reconfigurable acceleration of microphone array algorithms for speech enhancement. In Proceedings of the International Conference on Application-Specific Systems, Architectures and Processors. 203--208.

Digital Library

Index Terms

Custom architecture for multicore audio beamforming systems
1. Hardware
  1. Emerging technologies
  2. Very large scale integration design

Recommendations

Wave field synthesis for 3D audio: architectural prospectives
CF '09: Proceedings of the 6th ACM conference on Computing frontiers

In this paper, we compare the architectural perspectives of the Wave Field Synthesis (WFS) 3D-audio algorithm mapped on three different platforms: a General Purpose Processor (GPP), a Graphics Processor Unit (GPU) and a Field Programmable Gate Array (...
A Many-Core Co-Processor for Embedded Parallel Computing on FPGA
DSD '15: Proceedings of the 2015 Euromicro Conference on Digital System Design

Single processor architectures are unable to provide the required performance of high performance embedded systems. Parallel processing based on general-purpose processors can achieve these performances with a considerable increase of required ...
A Reconfigurable Processor Architecture Combining Multi-core and Reconfigurable Processing Unit
CIT '10: Proceedings of the 2010 10th IEEE International Conference on Computer and Information Technology

It’s a promising way to improve performance significantly by adding reconfigurable processing unit to a general purpose processor. In this paper, a Reconfigurable Multi-Core (RMC) architecture combining general multi-core and reconfigurable logic is ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Embedded Computing Systems

ACM Transactions on Embedded Computing Systems Volume 13, Issue 2

Special issue on application-specific processors

September 2013

254 pages

ISSN:1539-9087

EISSN:1558-3465

DOI:10.1145/2514641

Issue’s Table of Contents

Copyright © 2013 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Journal Family

ACM Journals for the Design of Smart and Connected Systems

Publication History

Published: 30 September 2013

Accepted: 01 May 2012

Revised: 01 February 2012

Received: 01 March 2011

Published in TECS Volume 13, Issue 2

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Funding Sources

Seventh Framework Programme
Sixth Framework Programme
Artemisia SMECY
Artemisia project

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
256
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 03 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Issue’s Table of Contents