Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Custom architecture for multicore audio beamforming systems

Published: 30 September 2013 Publication History

Abstract

The audio Beamforming (BF) technique utilizes microphone arrays to extract acoustic sources recorded in a noisy environment. In this article, we propose a new approach for rapid development of multicore BF systems. Research on literature reveals that the majority of such experimental and commercial audio systems are based on desktop PCs, due to their high-level programming support and potential of rapid system development. However, these approaches introduce performance bottlenecks, excessive power consumption, and increased overall cost. Systems based on DSPs require very low power, but their performance is still limited. Custom hardware solutions alleviate the aforementioned drawbacks, however, designers primarily focus on performance optimization without providing a high-level interface for system control and test. In order to address the aforementioned problems, we propose a custom platform-independent architecture for reconfigurable audio BF systems. To evaluate our proposal, we implement our architecture as a heterogeneous multicore reconfigurable processor and map it onto FPGAs. Our approach combines the software flexibility of General-Purpose Processors (GPPs) with the computational power of multicore platforms. In order to evaluate our system we compare it against a BF software application implemented to a low-power Atom 330, a middle-ranged Core2 Duo, and a high-end Core i3. Experimental results suggest that our proposed solution can extract up to 16 audio sources in real time under a 16-microphone setup. In contrast, under the same setup, the Atom 330 cannot extract any audio sources in real time, while the Core2 Duo and the Core i3 can process in real time only up to 4 and 6 sources respectively. Furthermore, a Virtex4-based BF system consumes more than an order less energy compared to the aforementioned GPP-based approaches.

References

[1]
Acoustic Camera. 2013. http://www.acoustic-camera.com.
[2]
Analog Devices Inc. 2004. SHARC processor adsp-21262. http://www.analog.com/en/evaluation/21262-ezlite/eb.html.
[3]
Beracoechea, J., Torres-Guijarro, S., Garcia, L., and Casajus-Quiros, F. 2006. On building immersive audio applications using robust adaptive beamforming and joint audio-video source localization. EURASIP J. Appl. Signal Process. 2006, 196.
[4]
Berkhout, A., De Vries, D., and Vogel, P. 1993. Acoustic control by wave field synthesis. J. Acoust. Soc. Amer. 93, 2764--2778.
[5]
Buchner, H., Spors, S., Kellermann, W., and Rabenstein, R. 2002. Full-duplex communication systems using loudspeaker arrays and microphone arrays. In Proceedings of the IEEE International Conference on Multimedia and Expo. 509--512.
[6]
Cedrick, R. 2005. Documentation of the microphone array mark III. http://www.nist.gov/smartspace/downloads/Microphone_Array_Mark_III.pdf.
[7]
Cutler, R., Rui, Y., Gupta, A., Cadiz, J., Tashev, I., He, L., Colburn, A., Zhang Z., Liu, Z., and Silverberg, S. 2002. Distributed meetings: A meeting capture and broadcasting system. In Proceedings of the International Conference on Multimedia. 503--512.
[8]
Farina, A., Glasgal, R., Armelloni, E., and Torger, A. 2001. Ambiophonic principles for the recording and reproduction of surround. In Proceedings of the 19th AES International Conference.
[9]
Fiala, M., Green, D., and Roth, G. 2004. A panoramic video and acoustic beamforming sensor for videoconferencing. In Proceedings of the IEEE International Conference on Haptic, Audio and Visual Environments and their Applications. 47--52.
[10]
Fillinger, A., Diduch, L., Hamchi, I., Degre, S., and Stanford, V. 2007. NIST smart data flow system II: Speaker localization. In Proceedings of the 6th International Conference on Information Processing in Sensor Networks. 549--550.
[11]
Theile, G. 2001. Multichannel natural music recording based on pychoacoustics principles. In Proceedings of the 19th AES International Conference.
[12]
Intel Corporation. 2013. http://ark.intel.com/products/35641.
[13]
Intel Corporation. 2013. http://ark.intel.com/products/36500.
[14]
Intel Corporation. 2013. http://ark.intel.com/products/49020.
[15]
Kapralos, B., Jenkin, M., and Milios, E. 2003. Audio-visual localization of multiple speakers in a video teleconferencing setting. Int. J. Imaging Syst. Technol. 13, 1, 95--105.
[16]
Kyriakakis, C. 1998. Fundamental and technological limitations of immersive audio systems. Proc. IEEE 86, 941--951.
[17]
Mei, G., Xu, R., Lao, D., and Kwan, C. 2006. Real-time speaker verification with a microphone array. In Proceedings of the International Conference on Pervasive Systems and Computing.
[18]
Mihov, S.G., Gleghorn, T., and Tashev, I. 2008. Enhanced sound capture system for small devices. In Proceedings of the International Conference of Information, Communication and Energy Systems.
[19]
Mit Csail: Mit Project Oxygen. 2004. http://oxygen.lcs.mit.edu/.
[20]
Mouchtaris, A., Reveliotis, P., and Kyriakakis, C. 2000. Inverse of filter design for immersive audio rendering over loudspeakers. IEEE Trans. Multimedia 2, 77--87.
[21]
Nilsen, C.-I. C. and Hafizovic, I. 2009. Digital beamforming using a GPU. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. 609--612.
[22]
Polycom Inc. 2009. Polycom CX5000 unified conference station. http://www.polycom.com/products-services/products-for-microsoft/lync-optimized/cx5000-unified-conference-station.html.
[23]
Sallberg, B., Swartling, M., Grbic, N., and Ingvar C. 2006. Real-time implementation of a blind beamformer for subband speech enhancement using kurtosis maximization. In Proceedings of the International Workshop on Acoustic Echo and Noise Control. 485--489.
[24]
Snow, W. 1955. Basic principles of stereophonic sound. IRE Trans. Audio 2, 42--53.
[25]
Squarehead Technology. 2013. Audio scope zoom audio. http://www.sqhead.com/.
[26]
Teutsch, H., Spors, S., Herbordt, W., Kellermann, W., and Rabenstein, R. 2003. An integrated real-time system for immersive audio applications. In Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. 67--70.
[27]
Texas Instruments Inc. 2002. TMS320C62x/C67x power consumption summary. http://www.ti.com/lit/an/spra486c/spra486c.pdf.
[28]
Theodoropoulos, D., Kuzmanov, G., and Gaydadjiev, G. 2009. A reconfigurable beamformer for audio applications. In Proceedings of the IEEE Symposium on Application Specific Processors. 80--87.
[29]
Theodoropoulos, D., Kuzmanov, G., and Gaydadjiev, G. 2010. Minimalistic architecture for reconfigurable audio beamforming. In Proceedings of the International Conference on Field-Programmable Technology. 503--506.
[30]
Veen, B. V. and Buckley, K. 1988. Beamforming: A versatile approach to spatial filtering. IEEE ASSP Mag. 5, 4--24.
[31]
Wall, K. and Lockwood, G. R. 2005. Modern implementation of a realtime 3d beamformer and scan converter system. In Proceedings of the IEEE Ultrasonics Symposium. 1400--1403.
[32]
Weinstein, E., Steele, K., Agarwal, A., and Glass, J. 2004. LOUD: A 1020-node modular microphone array and beamformer for intelligent computing spaces. MIT/LCS Tech. memo MIT-LCS-TM-642.
[33]
Xilinx Inc. 2010a. The simple microblaze microcontroller concept. http://xilinx.eetrend.com/files-eetrend-xilinx/article/201105/1839-3354-xapp1141.pdf.
[34]
Xilinx Inc. 2010b. XPower estimator user guide. http://www.xilinx.com/support/documentation/sw_manuals/xilinx12_4/ise_n_power_user_guide.htm.
[35]
Xilinx Inc. 2007. Implementing a real-time beamformer on an fpga platform. XCell J. 36--40. http://www.techonline.com/electrical-engineers/education-training/tech-papers/4137944/Implementing-a-Real-Time-Beamformer-on-an-FPGA-Platform.
[36]
Yermeche, Z., Sallberg, B., Grbic, N., and Claesson, I. 2007. Real-time implementation of a subband beamforming algorithm for dual microphone speech enhancement. In Proceedings of the IEEE International Symposium on Circuits and Systems. 353--356.
[37]
Yiu, C. K., Ho, C. H., Lu, Y., Shi, X., and Luk, W. 2008. Reconfigurable acceleration of microphone array algorithms for speech enhancement. In Proceedings of the International Conference on Application-Specific Systems, Architectures and Processors. 203--208.

Index Terms

  1. Custom architecture for multicore audio beamforming systems

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Embedded Computing Systems
      ACM Transactions on Embedded Computing Systems  Volume 13, Issue 2
      Special issue on application-specific processors
      September 2013
      254 pages
      ISSN:1539-9087
      EISSN:1558-3465
      DOI:10.1145/2514641
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Journal Family

      Publication History

      Published: 30 September 2013
      Accepted: 01 May 2012
      Revised: 01 February 2012
      Received: 01 March 2011
      Published in TECS Volume 13, Issue 2

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Immersive audio
      2. audio beamforming
      3. embedded processors
      4. reconfigurable computing
      5. reconfigurable processors

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Funding Sources

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 256
        Total Downloads
      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 03 Feb 2025

      Other Metrics

      Citations

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media