An Audio-Visual Method for Room Boundary Estimation and Material Recognition

In applications such as virtual and augmented reality, a plausible and coherent audio-visual reproduction can be achieved by deeply understanding the reference scene acoustics. This requires knowledge of the scene geometry and related materials. In this paper, we present an audio-visual approach for acoustic scene understanding. We propose a novel material recognition algorithm, that exploits information carried by acoustic signals. The acoustic absorption coefficients are selected as features. The training dataset was constructed by combining information available in the literature, and additional labeled data that we recorded in a small room having short reverberation time (RT60). Classic machine learning methods are used to validate the model, by employing data recorded in five rooms, having different sizes and RT60s. The estimated materials are utilized to label room boundaries, reconstructed by a vision-based method. Results show 89% and 80% agreement between the estimated and reference room volumes and materials, respectively.


  • (2023)A Composite T60 Regression and Classification Approach for Speech DereverberationIEEE/ACM Transactions on Audio, Speech, and Language Processing10.1109/TASLP.2023.324542331(1013-1023)Online publication date: 2023
  • (2022)Room Acoustic Properties Estimation from a Single 360° Photo2022 30th European Signal Processing Conference (EUSIPCO)10.23919/EUSIPCO55093.2022.9909598(857-861)Online publication date: 29-Aug-2022
  • (2022)Human-Machine Cooperative Echolocation Using UltrasoundIEEE Access10.1109/ACCESS.2022.322446810(125264-125278)Online publication date: 2022



Author Tags

  1. acoustic absorption coefficient
  2. audio-visual
  3. knn
  4. material recognition
  5. room boundary estimation


Funding Sources


