research-article

Characterizing the Effect of Audio Degradation on Privacy Perception And Inference Performance in Audio-Based Human Activity Recognition

Authors:

Dawei Liang,

Wenting Song,

Edison ThomazAuthors Info & Claims

MobileHCI '20: 22nd International Conference on Human-Computer Interaction with Mobile Devices and Services

Article No.: 32, Pages 1 - 10

https://doi.org/10.1145/3379503.3403551

Published: 05 October 2020 Publication History

Get Access

Abstract

Audio has been increasingly adopted as a sensing modality in a variety of human-centered mobile applications and in smart assistants in the home. Although acoustic features can capture complex semantic information about human activities and context, continuous audio recording often poses significant privacy concerns. An intuitive way to reduce privacy concerns is to degrade audio quality such that speech and other relevant acoustic markers become unintelligible, but this often comes at the cost of activity recognition performance. In this paper, we employ a mixed-methods approach to characterize this balance. We first conduct an online survey with 266 participants to capture their perception of privacy qualitatively and quantitatively with degraded audio. Given our findings that privacy concerns can be significantly reduced at high levels of audio degradation, we then investigate how intentional degradation of audio frames can affect the recognition results of the target classes while maintaining effective privacy mitigation. Our results indicate that degradation of audio frames can leave minimal effects for audio recognition using frame-level features. Furthermore, degradation of audio frames can hurt the performance to some extend for audio recognition using segment-level features, though the usage of such features may still yield superior recognition performance. Given the different requirements on privacy mitigation and recognition performance for different sensing purposes, such trade-offs need to be balanced in actual implementations.

Supplementary Material

a32-liang-supplement (a32-liang-supplement.zip)

Sample audio clips with or without degradation; the degradation levels are shown as the filenames. They have the similar quality as presented to the Mechanical Turk participants on the online survey.

Download
3.59 MB

References

[1]

Tawfiq Ammari, Jofish Kaye, Janice Y Tsai, and Frank Bentley. 2019. Music, Search, and IoT: How People (Really) Use Voice Assistants. ACM Transactions on Computer-Human Interaction (TOCHI) 26, 3(2019), 1–28.

Abstract

Supplementary Material

References

Cited By

Recommendations

Precise pitch profile feature extraction from musical audio for key detection

TILES audio recorder: an unobtrusive wearable solution to track audio activity

Tempo and beat tracking for audio signals with music genre classification

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

HTML Format

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations