Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Distributed Combined Acoustic Echo Cancellation and Noise Reduction in Wireless Acoustic Sensor and Actuator Networks

Published: 05 January 2022 Publication History

Abstract

The paper presents distributed algorithms for combined acoustic echo cancellation (AEC) and noise reduction (NR) in a wireless acoustic sensor and actuator network (WASAN) where each node may have multiple microphones and multiple loudspeakers, and where the desired signal is a speech signal. A centralized integrated AEC and NR algorithm, i.e., multichannel Wiener filter (MWF), is used as starting point where echo signals are viewed as background noise signals and loudspeaker signals are used as additional input signals to the algorithm. By including prior knowledge (PK), namely that the loudspeaker signals do not contain any desired signal component, an alternative centralized cascade algorithm (PK-MWF) is obtained with an AEC stage first followed by an MWF-based NR stage which has a lower computational complexity. Distributed algorithms can then be obtained from the MWF and PK-MWF algorithm, i.e., the generalized eigenvalue decomposition (GEVD)-based distributed adaptive node-specific signal estimation (DANSE) and PK-GEVD-DANSE algorithm, respectively. In the former, each node performs a reduced dimensional integrated AEC and NR algorithm and broadcasts only 1 fused signal (instead of all its signals) to the other nodes. In the PK-GEVD-DANSE algorithm, each node performs a reduced dimensional cascade AEC and NR algorithm and broadcasts only 2 fused signals (instead of all its signals) to the other nodes. The distributed algorithms achieve the same performance, upon convergence, as the corresponding centralized integrated (MWF) and centralized cascade (PK-MWF) algorithm. It is observed, however, that the communication cost in the PK-GEVD-DANSE algorithm can also be reduced, where each node then broadcasts only 1 fused signal (instead of 2 signals) to the other nodes. The resulting algorithm, referred to as the pruned PK-GEVD-DANSE (pPK-GEVD-DANSE) algorithm, then effectively combines the lowest possible communication cost (as low as in the GEVD-DANSE algorithm) with a lowest possible computational complexity in each node (further reduced from the PK-GEVD-DANSE computational complexity), within the class of algorithms considered in this paper.

References

[1]
W. Herbordt, W. Kellermann, and S. Nakamura, “Joint optimization of acoustic echo cancellation and adaptive beamforming,” in Topics in Acoustic Echo and Noise Control, E. Hänsler and G. Schmidt, Eds. Berlin, Germany: Springer, 2006, pp. 19–50.
[2]
J. Benesty, J. R. Jensen, M. G. Christensen, and J. Chen, Speech Enhancement: A Signal Subspace Perspective. New York, NY, USA: Elsevier, 2014.
[3]
E. Böhmler, J. Freudenberger, and S. Stenzel, “Combined echo and noise reduction for distributed microphones,” in Proc. Joint Workshop Hands-Free Speech Commun. Microphone Arrays. Edinburgh, U.K., 2011, pp. 98–103.
[4]
S. Gustafsson, R. Martin, and P. Vary, “Combined acoustic echo control and noise reduction for hands-free telephony,” Signal Process., vol. 64, no. 1, pp. 21–32, 1998.
[5]
M. M. Sondhi, “An adaptive echo canceller,” Bell Syst. Tech. J., vol. 46, no. 3, pp. 497–511, 1967.
[6]
M. M. Sondhi, “The history of echo cancellation,” IEEE Signal Process. Mag., vol. 23, no. 5, pp. 95–102, Sep. 2006.
[7]
J. Benesty, T. Gänsler, D. R. Morgan, M. M. Sondhi, and S. L. Gay, Advances in Network and Acoustic Echo Cancellation. Berlin, Germany: Springer, 2001.
[8]
W. Kellermann, “Strategies for combining acoustic echo cancellation and adaptive beamforming microphone arrays,” Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. 1, pp. 219–222, 1997.
[9]
G. Enzner and P. Vary, “Frequency-domain adaptive Kalman filter for acoustic echo control in hands-free telephones,” Signal Process., vol. 86, no. 6, pp. 1140–1156, 2006.
[10]
K. Sridhar et al., “ICASSP 2021 acoustic echo cancellation challenge: Datasets, testing framework, and results,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2021, pp. 151–155.
[11]
A. Fazel, M. El-Khamy, and J. Lee, “CAD-AEC: Context-aware deep acoustic echo cancellation,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Barcelona, Spain, 2020, pp. 6919–6923.
[12]
G. Rombouts and M. Moonen, “An integrated approach to acoustic noise and echo cancellation,” Signal Process., vol. 85, no. 4, pp. 849–871, 2005.
[13]
W. Herbordt and W. Kellermann, “Frequency-domain integration of acoustic echo cancellation and a generalized sidelobe canceller with improved robustness,” Eur. Trans. Telecommun., vol. 13, no. 2, pp. 123–132, 2002.
[14]
S. J. Park, C. G. Cho, C. Lee, and D. H. Youn, “Integrated echo and noise canceler for hands-free applications,” IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process., vol. 49, no. 3, pp. 188–195, Mar. 2002.
[15]
A. Cohen, A. Barnov, S. Markovich-Golan, and P. Kroon, “Joint beamforming and echo cancellation combining QRD based multichannel AEC and MVDR for reducing noise and non-linear echo,” in Proc. 26th Eur. Signal Process. Conf., Rome, Italy, 2018, pp. 6–10.
[16]
W. Kellermann, “Joint design of acoustic echo cancellation and adaptive beamforming for microphone arrays,” in Proc. Int. Workshop Acoust. Echo Noise Control, 1997, pp. 81–84.
[17]
P. Meyer, S. Elshamy, and T. Fingscheidt, “A multichannel Kalman-based Wiener filter approach for speaker interference reduction in meetings,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Barcelona, Spain, 2020, pp. 451–455.
[18]
K. Nathwani, “Joint acoustic echo and noise cancellation using spectral domain Kalman filtering in double-talk scenario,” in Proc. Int. Workshop Acoust. Signal Enhancement, Tokyo, Japan, 2018, pp. 1–330.
[19]
H. Zhang, K. Tan, and D. Wang, “Deep learning for joint acoustic echo and noise cancellation with nonlinear distortions,” in Proc. 20th Annu. Conf. Int. Speech Commun. Assoc., Graz, Austria, 2019, pp. 4255–4259.
[20]
H. Seo, M. Lee, and J.-H. Chang, “Integrated acoustic echo and background noise suppression based on stacked deep neural networks,” Appl. Acoust., vol. 133, pp. 194–201, 2018.
[21]
I. F. Akyildiz, T. Melodia, and K. R. Chowdhury, “A survey on wireless multimedia sensor networks,” Comput. Netw., vol. 51, no. 4, pp. 921–960, 2007.
[22]
Y. Zeng and R. C. Hendriks, “Distributed delay and sum beamformer for speech enhancement via randomized gossip,” IEEE/ACM Trans. Audio Speech Lang. Process., vol. 22, no. 1, pp. 260–273, Jan. 2014.
[23]
R. Heusdens, G. Zhang, R. C. Hendriks, Y. Zeng, and W. B. Kleijn, “Distributed MVDR beamforming for (wireless) microphone networks using message passing,” in Proc. Int. Workshop Acoust. Signal Enhancement, Aachen, Germany, 2012, pp. 1–4.
[24]
A. Bertrand and M. Moonen, “Distributed adaptive node-specific signal estimation in fully connected sensor networks—Part II: Simultaneous and asynchronous node updating,” IEEE Trans. Signal Process., vol. 58, no. 10, pp. 5292–5306, Oct. 2010.
[25]
E. Ceolini and S. C. Liu, “Combining deep neural networks and beamforming for real-time multi-channel speech enhancement using a wireless acoustic sensor network,” in Proc. Int. Workshop Mach. Learn. Signal Process., Pittsburgh, PA, USA, 2019, pp. 1–6.
[26]
S. Ruiz, T. van Waterschoot, and M. Moonen, “Distributed combined acoustic echo cancellation and noise reduction using GEVD-based distributed adaptive node specific signal estimation with prior knowledge,” in Proc. 28th Eur. Signal Process. Conf., Amsterdam, The Netherlands, 2021, pp. 206–210.
[27]
R. Van Rompaey and M. Moonen, “Distributed adaptive node-specific signal estimation in a wireless sensor network with partial prior knowledge of the desired source steering vector,” in Proc. 27th Eur. Signal Process. Conf., A Coruña, Spain, 2019, pp. 1–5.
[28]
M. H. Bahari, A. Bertrand, and M. Moonen, “Blind sampling rate offset estimation for wireless acoustic sensor networks through weighted least-squares coherence drift estimation,” IEEE/ACM Trans. Audio Speech Lang. Process., vol. 25, no. 3, pp. 674–686, Mar. 2017.
[29]
S. Markovich-Golan, S. Gannot, and I. Cohen, “Blind sampling rate offset estimation and compensation in wireless acoustic sensor networks with application to beamforming,” in Proc. Int. Workshop Acoust. Signal Enhancement, 2012, pp. 1–4.
[30]
A. Chinaev, P. Thüne, and G. Enzner, “A Double-cross-correlation processor for blind sampling rate offset estimation in acoustic sensor networks,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2019, pp. 641–645.
[31]
R. Serizel, M. Moonen, B. Van Dijk, and J. Wouters, “Low-rank approximation based multichannel Wiener filter algorithms for noise reduction with application in cochlear implants,” IEEE/ACM Trans. Audio Speech Lang. Process., vol. 22, no. 4, pp. 785–799, Apr. 2014.
[32]
J. Benesty, J. Chen, Y. A. Huang, and S. Doclo, “Study of the Wiener filter for noise reduction,” in Speech Enhancement, J. Benesty, S. Makino, and J. Chen, Eds., Berlin, Germany: Springer, 2005, pp. 9–41.
[33]
A. Bertrand and M. Moonen, “Robust distributed noise reduction in hearing aids with external acoustic sensor nodes,” EURASIP J. Adv. Signal Process., vol. 2009, pp. 1–14, 2009.
[34]
F. Jabloun and B. Champagne, “Signal subspace techniques for speech enhancement,” in Speech Enhancement, J. Benesty, S. Makino, and J. Chen, Eds., Berlin, Germany: Springer, 2005, pp. 135–159.
[35]
S. Doclo, S. Gannot, M. Moonen, and A. Spriet, “Acoustic beamforming for hearing aid applications,” in Handbook on Array Processing and Sensor Networks, S. Haykin and K. R. Liu, Eds., New York, NY, USA: Wiley, 2010, pp. 269–302.
[36]
J. Szurley, A. Bertrand, and M. Moonen, “Improved tracking performance for distributed node-specific signal enhancement in wireless acoustic sensor networks,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Vancouver, Canada, 2013, pp. 336–340.
[37]
S. Haykin, Adaptive Filter Theory. Englewood Cliffs, NJ, USA: Prentice-Hall, 1996.
[38]
G. Rombouts and M. Moonen, “QRD-based unconstrained optimal filtering for acoustic noise reduction,” Signal Process., vol. 83, no. 9, pp. 1889–1904, 2003.
[39]
E. De Sena, N. Antonello, M. Moonen, and T. van Waterschoot, “On the modeling of rectangular geometries in room acoustic simulations,” IEEE/ACM Trans. Audio Speech Lang. Process., vol. 23, no. 4, pp. 774–786, Apr. 2015.
[40]
V. Berisha, H. Kwon, and A. Spanias, “Real-time implementation of a distributed voice activity detector,” in Proc. 4th IEEE Workshop Sensor Array Multichannel Process., Waltham, MA, USA, 2006, pp. 659–662.
[41]
S. Maraboina, D. Kolossa, P. Bora, and R. Orglmeister, “Multi-speaker voice activity detection using ICA and beampattern analysis,” in Proc. 14th Eur. Signal Process. Conf., Florence, Italy, 2006, pp. 1–5.
[42]
Y. Zhao, J. K. Nielsen, J. Chen, and M. G. Christensen, “Model-based distributed node clustering and multi-speaker speech presence probability estimation in wireless acoustic sensor networks,” J. Acoust. Soc. Amer., vol. 147, no. 6, pp. 4189–4201, 2020.
[43]
C. H. Taal, R. C. Hendriks, R. Heusdens, and J. Jensen, “A short-time objective intelligibility measure for time-frequency weighted noisy speech,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2010, pp. 4214–4217.
[44]
ITU-T Rec. P.862, “Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs,” in Proc. Int. Telecommun. Union, Geneva, Switzerland, 2001.

Cited By

View all
  • (2023)Cascade algorithms for combined acoustic feedback cancelation and noise reductionEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-023-00296-52023:1Online publication date: 21-Sep-2023

Index Terms

  1. Distributed Combined Acoustic Echo Cancellation and Noise Reduction in Wireless Acoustic Sensor and Actuator Networks
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image IEEE/ACM Transactions on Audio, Speech and Language Processing
          IEEE/ACM Transactions on Audio, Speech and Language Processing  Volume 30, Issue
          2022
          3239 pages
          ISSN:2329-9290
          EISSN:2329-9304
          Issue’s Table of Contents

          Publisher

          IEEE Press

          Publication History

          Published: 05 January 2022
          Published in TASLP Volume 30

          Qualifiers

          • Research-article

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)6
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 16 Oct 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2023)Cascade algorithms for combined acoustic feedback cancelation and noise reductionEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-023-00296-52023:1Online publication date: 21-Sep-2023

          View Options

          Get Access

          Login options

          Full Access

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media