Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

PPG-Hear: A Practical Eavesdropping Attack with Photoplethysmography Sensors

Published: 15 May 2024 Publication History

Editorial Notes

The authors have requested minor, non-substantive changes to the Version of Record and, in accordance with ACM policies, a Corrected Version of Record was published on July 10, 2024. For reference purposes, the VoR may still be accessed via the Supplemental Material section on this page.

Abstract

Photoplethysmography (PPG) sensors have become integral components of wearable and portable health devices in the current technological landscape. These sensors offer easy access to heart rate and blood oxygenation, facilitating continuous long-term health monitoring in clinic and non-clinic environments. While people understand that health-related information provided by PPG is private, no existing research has demonstrated that PPG sensors are dangerous devices capable of obtaining sensitive information other than health-related data. This work introduces PPG-Hear, a novel side-channel attack that exploits PPG sensors to intercept nearby audio information covertly. Specifically, PPG-Hear exploits low-frequency PPG measurements to discern and reconstruct human speech emitted from proximate speakers. This technology allows attackers to eavesdrop on sensitive conversations (e.g., audio passwords, business decisions, or intellectual properties) without being noticed. To achieve this non-trivial attack on commodity PPG-enabled devices, we employ differentiation and filtering techniques to mitigate the impact of temperature drift and user-specific gestures. We develop the first PPG-based speech reconstructor, which can identify speech patterns in the PPG spectrogram and establish the correlation between PPG and speech spectrograms. By integrating a MiniRocket-based classifier with a PixelGAN model, PPG-Hear can reconstruct human speech using low-sampling-rate PPG measurements. Through an array of real-world experiments, encompassing common eavesdropping scenarios such as surrounding speakers and the device's own speakers, we show that PPG-Hear can achieve remarkable accuracy of 90% for recognizing human speech, surpassing the current state-of-the-art side-channel eavesdropping attacks using motion sensors operating at equivalent sampling rates (i.e., 50Hz to 500Hz).

Supplemental Material

PDF File - Version of Record
VoR for "PPG-Hear: A Practical Eavesdropping Attack with Photoplethysmography Sensors" by Su et al., Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, Volume 8, Issue 2 (IMWUT 8:2).

References

[1]
2017. Android Wear: How to get raw PPG data? https://stackoverflow.com/questions/47444302/android-wear-how-to-get-raw-ppg-data (Accessed on 2/2/2024).
[2]
2021. Wearable Device Data Breach. https://healthitsecurity.com/news/61m-fitbit-apple-users-had-data-exposed-in-wearable-device-data-breach (Accessed on 8/1/2023).
[3]
2023. AudioMNIST Dataset. https://github.com/soerenab/AudioMNIST (Accessed on 8/1/2023).
[4]
2023. Samsung Privilege Health SDK. https://developer.samsung.com/health/privileged-partner/guide/data-specifications.html#Setting-Events-for-Multiple-Tracker-Types (Accessed on 7/10/2023).
[5]
2024. HarmonyOS Developer. https://developer.huawei.com/consumer/cn/doc/ecosystem-Guides/03_03_03_02_03_data-0000001180474833 (Accessed on 2/2/2024).
[6]
Ludovic Alhaïtz, Thomas Brunet, Christophe Aristégui, Olivier Poncelet, and Diego Baresch. 2023. Confined Phase Singularities Reveal the Spin-to-Orbital Angular Momentum Conversion of Sound Waves. Physical Review Letters 131, 11 (2023), 114001--114007. https://doi.org/10.1103/PhysRevLett.131.114001
[7]
S Abhishek Anand and Nitesh Saxena. 2018. Speechless: Analyzing the Threat to Speech Privacy from Smartphone Motion Sensors. In IEEE Symposium on Security & Privacy (SP). 1000--1017. https://doi.org/10.1109/SP.2018.00004
[8]
S Abhishek Anand, Chen Wang, Jian Liu, Nitesh Saxena, and Yingying Chen. 2021. Spearphone: A Lightweight Speech Privacy Exploit via Accelerometer-Sensed Reverberations from Smartphone Loudspeakers. In Proceedings of the 14th ACM Conference on Security and Privacy in Wireless and Mobile Networks. Association for Computing Machinery, 288--299. https://doi.org/10.1145/3448300.3468499
[9]
Zhongjie Ba, Tianhang Zheng, Xinyu Zhang, Zhan Qin, Baochun Li, Xue Liu, and Kui Ren. 2020. Learning-based Practical Smartphone Eavesdropping with Built-in Accelerometer. In Symposium on Network and Distributed System Security. 1--18. https://api.semanticscholar.org/CorpusID:211268160
[10]
Amitava Biswas. 1998. Novel brick-wall filters based on the auditory system. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP'98, Seattle, Washington, USA, May 12-15, 1998. IEEE, 3637--3640. https://doi.org/10.1109/ICASSP.1998.679670
[11]
Denisse Castaneda, Aibhlin Esparza, Mohammad Ghamari, Cinna Soltanpur, and Homer Nazeran. 2018. A review on wearable photoplethysmography sensors and their potential future applications in health care. International Journal of Biosensors & Bioelectronics 4, 4 (2018), 195--202. https://doi.org/10.15406/ijbsbe.2018.04.00125
[12]
Yunzhong Chen, Jiadi Yu, Linghe Kong, Hao Kong, Yanmin Zhu, and Yi-Chao Chen. 2023. RF-Mic: Live Voice Eavesdropping via Capturing Subtle Facial Speech Dynamics Leveraging RFID. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 7, 2 (2023), 25 pages. https://doi.org/10.1145/3596259
[13]
Dean Cvetkovic, Elif Derya Übeyli, and Irena Cosic. 2008. Wavelet transform feature extraction from human PPG, ECG, and EEG signal responses to ELF PEMF exposures: A pilot study. Digital signal processing 18, 5 (2008), 861--874. https://doi.org/10.1016/j.dsp.2007.05.009
[14]
Abe Davis, Michael Rubinstein, Neal Wadhwa, Gautham J. Mysore, Frédo Durand, and William T. Freeman. 2014. The Visual Microphone: Passive Recovery of Sound from Video. ACM Transactions on Graphics 33, 4 (2014), 1--10. https://doi.org/10.1145/2601097.2601119
[15]
Angus Dempster, Daniel F. Schmidt, and Geoffrey I. Webb. 2021. MiniRocket: A Very Fast (Almost) Deterministic Transform for Time Series Classification. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (Virtual Event, Singapore) (KDD '21). Association for Computing Machinery, New York, NY, USA, 248--257. https://doi.org/10.1145/3447548.3467231
[16]
D. Griffin and Jae Lim. 1984. Signal estimation from modified short-time Fourier transform. IEEE Transactions on Acoustics, Speech, and Signal Processing 32, 2 (1984), 236--243. https://doi.org/10.1109/TASSP.1984.1164317
[17]
Pengfei Hu, Hui Zhuang, Panneer Selvam Santhalingam, Riccardo Spolaor, Parth Pathak, Guoming Zhang, and Xiuzhen Cheng. 2022. Accear: Accelerometer acoustic eavesdropping with unconstrained vocabulary. In IEEE Symposium on Security & Privacy (SP). 1757--1773. https://doi.org/10.1109/SP46214.2022.9833716
[18]
Vladimir Iashin and Esa Rahtu. 2021. Taming Visually Guided Sound Generation. In British Machine Vision Conference. https://api.semanticscholar.org/CorpusID:239015904
[19]
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei A. Efros. 2017. Image-to-Image Translation with Conditional Adversarial Networks. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society, Los Alamitos, CA, USA, 5967--5976. https://doi.org/10.1109/CVPR.2017.632
[20]
Kundan Kumar, Rithesh Kumar, Thibault de Boissiere, Lucas Gestin, Wei Zhen Teoh, Jose Sotelo, Alexandre de Brébisson, Yoshua Bengio, and Aaron C Courville. 2019. MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis. In Advances in Neural Information Processing Systems. Curran Associates, Inc., 32--43. https://proceedings.neurips.cc/paper_files/paper/2019/file/6804c9bca0a615bdb9374d00a9fcba59-Paper.pdf
[21]
Panicos A Kyriacou and John Allen. 2021. Photoplethysmography: Technology, Signal Analysis and Applications. Academic Press. https://doi.org/10.1016/C2020-0-00098-8
[22]
Yichong Leng, Xu Tan, Sheng Zhao, Frank Soong, Xiang-Yang Li, and Tao Qin. 2021. MBNET: MOS Prediction for Synthesized Speech with Mean-Bias Network. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021. IEEE, 391--395. https://doi.org/10.1109/ICASSP39728.2021.9413877
[23]
Jesus Lázaro, Spyridon Kontaxis, Raquel Bailón, Pablo Laguna, and Eduardo Gil. 2018. Respiratory Rate Derived from Pulse Photoplethysmographic Signal by Pulse Decomposition Analysis. In 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE, 5282--5285. https://doi.org/10.1109/embc.2018.8513188
[24]
Elisa Mejía-Mejía, Karthik Budidha, Tomas Ysehak Abay, James M. May, and Panayiotis A. Kyriacou. 2020. Heart Rate Variability (HRV) and Pulse Rate Variability (PRV) for the Assessment of Autonomic Responses. Frontiers in physiology 11 (2020), 779. https://doi.org/10.3389/fphys.2020.00779
[25]
Yan Michalevsky, Dan Boneh, and Gabi Nakibly. 2014. Gyrophone: Recognizing Speech from Gyroscope Signals. In 23rd USENIX Security Symposium (USENIX Security 14). USENIX Association, San Diego, CA, 1053--1067. https://www.usenix.org/conference/usenixsecurity14/technical-sessions/presentation/michalevsky
[26]
Enric Monte-Moreno. 2011. Non-invasive estimate of blood glucose and blood pressure from a photoplethysmograph by means of machine learning techniques. Artificial intelligence in medicine 53, 2 (2011), 127--138. https://doi.org/10.1016/j.artmed.2011.05.001
[27]
Ralph P Muscatell. 1984. Laser microphone. Journal of the Acoustical Society of America 76, 4 (1984), 1284--1284. https://doi.org/10.1121/1.391370
[28]
Ben Nassi, Yaron Pirutin, Raz Swisa, Adi Shamir, Yuval Elovici, and Boris Zadov. 2022. Lamphone: Passive Sound Recovery from a Desk Lamp's Light Bulb Vibrations. In 31st USENIX Security Symposium (USENIX Security 22). USENIX Association, Boston, MA, 4401--4417. https://www.usenix.org/conference/usenixsecurity22/presentation/nassi
[29]
Fatemeh Sarhaddi, Kianoosh Kazemi, Iman Azimi, Rui Cao, Hannakaisa Niela-Vilén, Anna Axelin, Pasi Liljeberg, and Amir M Rahmani. 2022. A comprehensive accuracy assessment of Samsung smartwatch heart rate and heart rate variability. PloS one 17, 12 (2022), 1--19. https://doi.org/10.1371/journal.pone.0268361
[30]
Yuchen Su, Guoqing Jiang, Yicong Du, Yuefeng Chen, Hongbo Liu, Yanzhi Ren, Huan Dai, Yan Wang, Shuai Li, and Yingying Chen. 2023. P2Auth: Two-Factor Authentication Leveraging PIN and Keystroke-Induced PPG Measurements. In IEEE 43rd International Conference on Distributed Computing Systems (ICDCS). 726--737. https://doi.org/10.1109/ICDCS57875.2023.00074
[31]
Akira Tamamori, Tomoki Hayashi, Kazuhiro Kobayashi, K. Takeda, and Tomoki Toda. 2017. Speaker-Dependent WaveNet Vocoder. In Conference of the International Speech Communication Association (Interspeech). https://api.semanticscholar.org/CorpusID:31460711
[32]
Yu-Chih Tung and Kang G. Shin. 2016. Expansion of Human-Phone Interface By Sensing Structure-Borne Sound Propagation. In Proceedings of the 14th Annual International Conference on Mobile Systems, Applications, and Services (Singapore, Singapore) (MobiSys '16). 277--289. https://doi.org/10.1145/2906388.2906394
[33]
Goran Udovičić, Jurica Đerek, Mladen Russo, and Marjan Sikora. 2017. Wearable Emotion Recognition System based on GSR and PPG Signals. In Proceedings of the 2nd International Workshop on Multimedia for Personal Health and Health Care (Mountain View, California, USA). Association for Computing Machinery, 53--59. https://doi.org/10.1145/3132635.3132641
[34]
Zhibo Wang, Tengda Zhao, Jinxin Ma, Hongkai Chen, Kaixin Liu, Huajie Shao, Qian Wang, and Ju Ren. 2022. Hear Sign Language: A Real-Time End-to-End Sign Language Recognition System. IEEE Transactions on Mobile Computing 21, 7 (2022), 2398--2410. https://doi.org/10.1109/TMC.2020.3038303
[35]
Pete Warden. 2018. Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition. ArXiv abs/1804.03209 (2018). https://api.semanticscholar.org/CorpusID:4719239
[36]
Chen Yan, Guoming Zhang, Xiaoyu Ji, Tianchen Zhang, Taimin Zhang, and Wenyuan Xu. 2021. The Feasibility of Injecting Inaudible Voice Commands to Voice Assistants. IEEE Transactions on Dependable and Secure Computing 18, 3 (2021), 1108--1124. https://doi.org/10.1109/TDSC.2019.2906165
[37]
Richard Zhang, Phillip Isola, Alexei A Efros, Eli Shechtman, and Oliver Wang. 2018. The unreasonable effectiveness of deep features as a perceptual metric. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society, 586--595. https://github.com/richzhang/PerceptualSimilarity
[38]
Zhilin Zhang. 2015. Photoplethysmography-based heart rate monitoring in physical activities via joint sparse spectrum reconstruction. IEEE Transactions on Biomedical Engineering 62, 8 (2015), 1902--1910. https://doi.org/10.1109/TBME.2015.2406332
[39]
Tianming Zhao, Jian Liu, Yan Wang, Hongbo Liu, and Yingying Chen. 2018. PPG-based finger-level gesture recognition leveraging wearables. In IEEE International Conference on Computer Communications (INFOCOM). 1457--1465. https://doi.org/10.1109/INFOCOM.2018.8486006
[40]
Tianming Zhao, Yan Wang, Jian Liu, and Yingying Chen. 2018. Your heart won't lie: PPG-based continuous authentication on wrist-worn wearable devices. In Proceedings of the 24th Annual International Conference on Mobile Computing and Networking (MobiCom '18). Association for Computing Machinery, New York, NY, USA, 783--785. https://doi.org/10.1145/3241539.3267748

Cited By

View all
  • (2024)RF-GymCare: Introducing Respiratory Prior for RF Sensing in Gym EnvironmentsProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36785688:3(1-28)Online publication date: 9-Sep-2024
  • (2024)BSENSE: In-vehicle Child Detection and Vital Sign Monitoring with a Single mmWave Radar and Synthetic ReflectorsProceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems10.1145/3666025.3699352(478-492)Online publication date: 4-Nov-2024
  • (2024)Pushing the Limits of Acoustic Spatial Perception via Incident Angle EncodingProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36595838:2(1-28)Online publication date: 15-May-2024

Index Terms

  1. PPG-Hear: A Practical Eavesdropping Attack with Photoplethysmography Sensors

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
    Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies  Volume 8, Issue 2
    June 2024
    1330 pages
    EISSN:2474-9567
    DOI:10.1145/3665317
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 May 2024
    Published in IMWUT Volume 8, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Eavesdropping attack
    2. PPG
    3. Side-channel

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)336
    • Downloads (Last 6 weeks)30
    Reflects downloads up to 11 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)RF-GymCare: Introducing Respiratory Prior for RF Sensing in Gym EnvironmentsProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36785688:3(1-28)Online publication date: 9-Sep-2024
    • (2024)BSENSE: In-vehicle Child Detection and Vital Sign Monitoring with a Single mmWave Radar and Synthetic ReflectorsProceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems10.1145/3666025.3699352(478-492)Online publication date: 4-Nov-2024
    • (2024)Pushing the Limits of Acoustic Spatial Perception via Incident Angle EncodingProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36595838:2(1-28)Online publication date: 15-May-2024

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media