Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3300061.3300125acmconferencesArticle/Chapter ViewAbstractPublication PagesmobicomConference Proceedingsconference-collections
research-article

PDVocal: Towards Privacy-preserving Parkinson's Disease Detection using Non-speech Body Sounds

Published: 05 August 2019 Publication History

Abstract

Parkinson's disease (PD) is a chronic neurodegenerative disorder resulting from the progressive loss of dopaminergic nerve cells. People with PD usually demonstrate deficits in performing basic daily activities, and the relevant annual social cost can reach about $25 billion in the United States. Early detection of PD plays an important role in symptom relief and improvement in the performance of activities in daily life (ADL), which eventually reduces societal and economic burden. However, conventional PD detection methods are inconvenient in daily life (e.g., requiring users to wear sensors). To overcome this challenge, we propose and identify the non-speech body sounds as the new PD biomarker, and utilize the data in smartphone usage to realize the passive PD detection in daily life without interrupting the user. Specifically, we present PDVocal, an end-to-end smartphone-based privacy-preserving system towards early PD detection. PDVocal can passively recognize the PD digital biomarkers in the voice data during daily phone conversation. At the user end, PDVocal filters the audio stream and only extracts the non-speech body sounds (e.g., breathing, clearing throat and swallowing) which contain no privacy-sensitive content. At the cloud end, PDVocal analyzes the body sounds of interest and assesses the health condition using a customized residual network. For the sake of reliability in real-world PD detection, we investigate the method of the performance optimizer including an opportunistic learning knob and a long-term tracking protocol. We evaluate our proposed PDVocal on a collected dataset from 890 participants and real-life conversations from publicly available data sources. Results indicate that non-speech body sounds are a promising digital biomarker for privacy-preserving PD detection in daily life.

References

[1]
D. Hirtz, D. Thurman, K. Gwinn-Hardy, M. Mohamed, A. Chaudhuri, and R. Zalutsky, “How common are the “common” neurologic disorders?” Neurology, vol. 68, no. 5, pp. 326--337, 2007.
[2]
E. R. Dorsey and B. R. Bloem, “The parkinson pandemic-a call to action,” JAMA neurology, vol. 75, no. 1, pp. 9--10, 2018.
[3]
J. Jankovic, “Parkinson's disease: clinical features and diagnosis,” Journal of Neurology, Neurosurgery & Psychiatry, vol. 79, no. 4, pp. 368--376, 2008.
[4]
W. Poewe, “The natural history of parkinson's disease,” Journal of neurology, vol. 253, no. 7, pp. vii2--vii6, 2006.
[5]
K. R. Chaudhuri, D. G. Healy, and A. H. Schapira, “Non-motor symptoms of parkinson's disease: diagnosis and management,” The Lancet Neurology, vol. 5, no. 3, pp. 235--245, 2006.
[6]
M. M. Goldenberg, “Medical management of parkinson's disease,” Pharmacy and Therapeutics, vol. 33, no. 10, p. 590, 2008.
[7]
R. L. Brey, “Muhammad ali's message: Keep moving forward,” Neurology Now, vol. 2, no. 2, p. 8, 2006.
[8]
M. H. Polymeropoulos, C. Lavedan, E. Leroy, S. E. Ide, A. Dehejia, A. Dutra, B. Pike, H. Root, J. Rubenstein, R. Boyer, et al., “Mutation in the α-synuclein gene identified in families with parkinson's disease,” science, vol. 276, no. 5321, pp. 2045--2047, 1997.
[9]
S. Fahn, “Description of parkinson's disease as a clinical syndrome,” Annals of the New York Academy of Sciences, vol. 991, no. 1, pp. 1--14, 2003.
[10]
S. Arora, V. Venkataraman, A. Zhan, S. Donohue, K. Biglan, E. Dorsey, and M. Little, “Detecting and monitoring the symptoms of parkinson's disease using smartphones: A pilot study,” Parkinsonism & related disorders, vol. 21, no. 6, pp. 650--653, 2015.
[11]
M. A. Little, P. E. McSharry, E. J. Hunter, J. Spielman, L. O. Ramig, et al., “Suitability of dysphonia measurements for telemonitoring of parkinson's disease,” IEEE transactions on biomedical engineering, vol. 56, no. 4, pp. 1015--1022, 2009.
[12]
B. E. Sakar, M. E. Isenkul, C. O. Sakar, A. Sertbas, F. Gurgen, S. Delil, H. Apaydin, and O. Kursun, “Collection and analysis of a parkinson speech dataset with multiple types of sound recordings,” IEEE Journal of Biomedical and Health Informatics, vol. 17, no. 4, pp. 828--834, 2013.
[13]
B. M. Bot, C. Suver, E. C. Neto, M. Kellen, A. Klein, C. Bare, M. Doerr, A. Pratap, J. Wilbanks, E. R. Dorsey, et al., “The mpower study, parkinson disease mobile data collected using researchkit,” Scientific data, vol. 3, p. 160011, 2016.
[14]
M. Little, “Ubiquitous, inexpensive non-invasive technologies for objective detection and monitoring of parkinson's symptoms,” 2013.
[15]
F. L. Pagan, “Improving outcomes through early diagnosis of parkinson's disease.” The American journal of managed care, vol. 18, no. 7 Suppl, pp. S176--82, 2012.
[16]
A. K. Ho, R. Iansek, C. Marigliani, J. L. Bradshaw, and S. Gates, “Speech impairment in a large sample of patients with parkinson's disease,” Behavioural neurology, vol. 11, no. 3, pp. 131--137, 1999.
[17]
J. A. Logemann, H. B. Fisher, B. Boshes, and E. R. Blonsky, “Frequency and cooccurrence of vocal tract dysfunctions in the speech of a large sample of parkinson patients,” Journal of Speech and Hearing Disorders, vol. 43, no. 1, pp. 47--57, 1978.
[18]
J. R. Duffy, Motor Speech Disorders-E-Book: Substrates, Differential Diagnosis, and Management. hskip 1em plus 0.5em minus 0.4emrelax Elsevier Health Sciences, 2013.
[19]
B. Harel, M. Cannizzaro, and P. J. Snyder, “Variability in fundamental frequency during speech in prodromal and incipient parkinson's disease: A longitudinal case study,” Brain and cognition, vol. 56, no. 1, pp. 24--29, 2004.
[20]
L. Jeancolas, H. Benali, B.-E. Benkelfat, G. Mangone, J.-C. Corvol, M. Vidailhet, S. Lehericy, and D. Petrovska-Delacrétaz, “Automatic detection of early stages of parkinson's disease through acoustic voice analysis with mel-frequency cepstral coefficients,” in Advanced Technologies for Signal and Image Processing (ATSIP), 2017 International Conference on. hskip 1em plus 0.5em minus 0.4emrelax IEEE, 2017, pp. 1--6.
[21]
T. Rahman, A. T. Adams, M. Zhang, E. Cherry, B. Zhou, H. Peng, and T. Choudhury, “Bodybeat: a mobile system for sensing non-speech body sounds.” in MobiSys, vol. 14, 2014, pp. 2--13.
[22]
S. Reichert, R. Gass, C. Brandt, and E. Andrès, “Analysis of respiratory sounds: state of the art,” Clinical medicine. Circulatory, respiratory and pulmonary medicine, vol. 2, pp. CCRPM--S530, 2008.
[23]
L. Pepa, F. Verdini, M. Capecci, and M. Ceravolo, “Smartphone based freezing of gait detection for parkinsonian patients,” in Consumer Electronics (ICCE), 2015 IEEE International Conference on. hskip 1em plus 0.5em minus 0.4emrelax IEEE, 2015, pp. 212--215.
[24]
B. Takavc, A. Català, D. R. Mart'in, N. Van Der Aa, W. Chen, and M. Rauterberg, “Position and orientation tracking in a ubiquitous monitoring system for parkinson disease patients with freezing of gait symptom,” JMIR mHealth and uHealth, vol. 1, no. 2, 2013.
[25]
G. Hinton, L. Deng, D. Yu, G. E. Dahl, A.-r. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath, et al., “Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups,” IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 82--97, 2012.
[26]
L. Deng, G. Hinton, and B. Kingsbury, “New types of deep neural network learning for speech recognition and related applications: An overview,” in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. hskip 1em plus 0.5em minus 0.4emrelax IEEE, 2013, pp. 8599--8603.
[27]
L. Deng, “Achievements and challenges of deep learning-from speech analysis and recognition to language and multimodal processing,” in Fifteenth Annual Conference of the International Speech Communication Association, 2014.
[28]
A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam, “Mobilenets: Efficient convolutional neural networks for mobile vision applications,” arXiv preprint arXiv:1704.04861, 2017.
[29]
K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” arXiv preprint arXiv:1409.1556, 2014.
[30]
J. Kaiser and R. Schafer, “On the use of the i 0-sinh window for spectrum analysis,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 28, no. 1, pp. 105--107, 1980.
[31]
Y. LeCun, C. Cortes, and C. Burges, “Mnist handwritten digit database,” AT&T Labs {Online}. Available: http://yann. lecun. com/exdb/mnist, vol. 2, 2010.
[32]
A. Krizhevsky, V. Nair, and G. Hinton, “The cifar-10 dataset,” online: http://www. cs. toronto. edu/kriz/cifar. html, 2014.
[33]
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, “Imagenet: A large-scale hierarchical image database,” in Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. hskip 1em plus 0.5em minus 0.4emrelax Ieee, 2009, pp. 248--255.
[34]
K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770--778.
[35]
G. A. Kang, J. M. Bronstein, D. L. Masterman, M. Redelings, J. A. Crum, and B. Ritz, “Clinical characteristics in early parkinson's disease in a central california population-based study,” Movement disorders: official journal of the Movement Disorder Society, vol. 20, no. 9, pp. 1133--1142, 2005.
[36]
A. Tsanas, M. A. Little, P. E. McSharry, and L. O. Ramig, “Accurate telemonitoring of parkinson's disease progression by noninvasive speech tests,” IEEE transactions on Biomedical Engineering, vol. 57, no. 4, pp. 884--893, 2010.
[37]
A. Tsanas, M. A. Little, P. E. McSharry, J. Spielman, and L. O. Ramig, “Novel speech signal processing algorithms for high-accuracy classification of parkinson's disease,” IEEE Transactions on biomedical engineering, vol. 59, no. 5, pp. 1264--1271, 2012.
[38]
M. Shahbakhi, D. T. Far, E. Tahami, et al., “Speech analysis for diagnosis of parkinson's disease using genetic algorithm and support vector machine,” Journal of Biomedical Science and Engineering, vol. 7, no. 4, pp. 147--156, 2014.
[39]
I. Rustempasic and M. Can, “Diagnosis of parkinson's disease using fuzzy c-means clustering and pattern recognition,” Southeast Europe Journal of Soft Computing, vol. 2, no. 1, 2013.
[40]
L. Giancardo, A. Sanchez-Ferro, T. Arroyo-Gallego, I. Butterworth, C. S. Mendoza, P. Montero, M. Matarazzo, J. A. Obeso, M. L. Gray, and R. S. J. Estépar, “Computer keyboard interaction as an indicator of early parkinson's disease,” Scientific reports, vol. 6, p. 34468, 2016.
[41]
C. R. Pereira, S. A. Weber, C. Hook, G. H. Rosa, and J. P. Papa, “Deep learning-aided parkinson's disease diagnosis from handwritten dynamics,” in Graphics, Patterns and Images (SIBGRAPI), 2016 29th SIBGRAPI Conference on. hskip 1em plus 0.5em minus 0.4emrelax IEEE, 2016, pp. 340--346.
[42]
B. M. Eskofier, S. I. Lee, J.-F. Daneault, F. N. Golabchi, G. Ferreira-Carvalho, G. Vergara-Diaz, S. Sapienza, G. Costante, J. Klucken, T. Kautz, et al., “Recent machine learning advancements in sensor-based mobility analysis: Deep learning for parkinson's disease assessment,” in Engineering in Medicine and Biology Society (EMBC), 2016 IEEE 38th Annual International Conference of the. hskip 1em plus 0.5em minus 0.4emrelax IEEE, 2016, pp. 655--658.
[43]
K. Sha, G. Zhan, W. Shi, M. Lumley, C. Wiholm, and B. Arnetz, “Spa: a smart phone assisted chronic illness self-management system with participatory sensing,” in Proceedings of the 2nd International Workshop on Systems and Networking Support for Health Care and Assisted Living Environments. hskip 1em plus 0.5em minus 0.4emrelax ACM, 2008, p. 5.
[44]
T. Denning, A. Andrew, R. Chaudhri, C. Hartung, J. Lester, G. Borriello, and G. Duncan, “Balance: towards a usable pervasive wellness application with accurate activity inference,” in Proceedings of the 10th workshop on Mobile Computing Systems and Applications. hskip 1em plus 0.5em minus 0.4emrelax ACM, 2009, p. 5.
[45]
N. Oliver and F. Flores-Mangas, “Healthgear: Automatic sleep apnea detection and monitoring with a mobile phone.” JCM, vol. 2, no. 2, pp. 1--9, 2007.
[46]
Z. Jin, J. Oresko, S. Huang, and A. C. Cheng, “Hearttogo: a personalized medicine technology for cardiovascular disease prevention and detection,” in Life Science Systems and Applications Workshop, 2009. LiSSA 2009. IEEE/NIH. hskip 1em plus 0.5em minus 0.4emrelax IEEE, 2009, pp. 80--83.
[47]
O. Akinbode, O. Longe, and B. Amosa, “Mobile-phone based patient compliance system for chronic illness care in nigeria,” Journal of Computer Science & Technology, vol. 12, 2012.
[48]
H. Lu, D. Frauendorfer, M. Rabbi, M. S. Mast, G. T. Chittaranjan, A. T. Campbell, D. Gatica-Perez, and T. Choudhury, “Stresssense: Detecting stress in unconstrained acoustic environments using smartphones,” in Proceedings of the 2012 ACM Conference on Ubiquitous Computing. hskip 1em plus 0.5em minus 0.4emrelax ACM, 2012, pp. 351--360.
[49]
N. Bui, A. Nguyen, P. Nguyen, H. Truong, A. Ashok, T. Dinh, R. Deterding, and T. Vu, “Pho2: Smartphone based blood oxygen level measurement systems using near-ir and red wave-guided light,” in Proceedings of the 15th ACM Conference on Embedded Network Sensor Systems. hskip 1em plus 0.5em minus 0.4emrelax ACM, 2017, p. 26.
[50]
A. A. Farhan, C. Yue, R. Morillo, S. Ware, J. Lu, J. Bi, J. Kamath, A. Russell, A. Bamis, and B. Wang, “Behavior vs. introspection: refining prediction of clinical depression via smartphone sensing data.” in Wireless Health, 2016, pp. 30--37.
[51]
C. Gao, F. Kong, and J. Tan, “Healthaware: Tackling obesity with health aware smart phone systems,” in Robotics and Biomimetics (ROBIO), 2009 IEEE International Conference on. hskip 1em plus 0.5em minus 0.4emrelax Ieee, 2009, pp. 1549--1554.
[52]
K. Yatani and K. N. Truong, “Bodyscope: a wearable acoustic sensor for activity recognition,” in Proceedings of the 2012 ACM Conference on Ubiquitous Computing. hskip 1em plus 0.5em minus 0.4emrelax ACM, 2012, pp. 341--350.
[53]
T. Hao, G. Xing, and G. Zhou, “isleep: unobtrusive sleep quality monitoring using smartphones,” in Proceedings of the 11th ACM Conference on Embedded Networked Sensor Systems. hskip 1em plus 0.5em minus 0.4emrelax ACM, 2013, p. 4.
[54]
E. C. Larson, T. Lee, S. Liu, M. Rosenfeld, and S. N. Patel, “Accurate and privacy preserving cough sensing using a low-cost microphone,” in Proceedings of the 13th international conference on Ubiquitous computing. hskip 1em plus 0.5em minus 0.4emrelax ACM, 2011, pp. 375--384.
[55]
O. Amft, H. Junker, and G. Troster, “Detection of eating and drinking arm gestures using inertial body-worn sensors,” in Wearable computers, 2005. proceedings. ninth ieee international symposium on. hskip 1em plus 0.5em minus 0.4emrelax IEEE, 2005, pp. 160--163.
[56]
S. Nirjon, R. F. Dickerson, Q. Li, P. Asare, J. A. Stankovic, D. Hong, B. Zhang, X. Jiang, G. Shen, and F. Zhao, “Musicalheart: A hearty way of listening to music,” in Proceedings of the 10th ACM Conference on Embedded Network Sensor Systems. hskip 1em plus 0.5em minus 0.4emrelax ACM, 2012, pp. 43--56.
[57]
J. Chauhan, Y. Hu, S. Seneviratne, A. Misra, A. Seneviratne, and Y. Lee, “Breathprint: Breathing acoustics-based user authentication,” in Proceedings of the 15th Annual International Conference on Mobile Systems, Applications, and Services. hskip 1em plus 0.5em minus 0.4emrelax ACM, 2017, pp. 278--291.

Cited By

View all
  • (2024)Artificial Intelligence of Things: A SurveyACM Transactions on Sensor Networks10.1145/369063921:1(1-75)Online publication date: 30-Aug-2024
  • (2024)Privacy-preserving human activity sensing: A surveyHigh-Confidence Computing10.1016/j.hcc.2024.1002044:1(100204)Online publication date: Mar-2024
  • (2024)Exploring Machine Learning Models for Federated Learning: A Review of Approaches, Performance, and LimitationsDynamics of Disasters10.1007/978-3-031-74006-0_4(87-121)Online publication date: 25-Sep-2024
  • Show More Cited By

Index Terms

  1. PDVocal: Towards Privacy-preserving Parkinson's Disease Detection using Non-speech Body Sounds

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MobiCom '19: The 25th Annual International Conference on Mobile Computing and Networking
    August 2019
    1017 pages
    ISBN:9781450361699
    DOI:10.1145/3300061
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 05 August 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. acoustic sensing
    2. mobile health
    3. parkinson's disease

    Qualifiers

    • Research-article

    Conference

    MobiCom '19
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 440 of 2,972 submissions, 15%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)70
    • Downloads (Last 6 weeks)8
    Reflects downloads up to 24 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Artificial Intelligence of Things: A SurveyACM Transactions on Sensor Networks10.1145/369063921:1(1-75)Online publication date: 30-Aug-2024
    • (2024)Privacy-preserving human activity sensing: A surveyHigh-Confidence Computing10.1016/j.hcc.2024.1002044:1(100204)Online publication date: Mar-2024
    • (2024)Exploring Machine Learning Models for Federated Learning: A Review of Approaches, Performance, and LimitationsDynamics of Disasters10.1007/978-3-031-74006-0_4(87-121)Online publication date: 25-Sep-2024
    • (2023)Wearable sensors and features for diagnosis of neurodegenerative diseases: A systematic reviewDIGITAL HEALTH10.1177/205520762311735699Online publication date: 16-May-2023
    • (2023)PDAssess: A Privacy-preserving Free-speech based Parkinson's Disease Daily Assessment SystemProceedings of the 21st ACM Conference on Embedded Networked Sensor Systems10.1145/3625687.3625805(251-264)Online publication date: 12-Nov-2023
    • (2023)DAPPERProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/35962567:2(1-27)Online publication date: 12-Jun-2023
    • (2023)PDGes: An Interpretable Detection Model for Parkinson’s Disease Using SmartphonesACM Transactions on Sensor Networks10.1145/358531419:4(1-21)Online publication date: 20-Apr-2023
    • (2023)Paralinguistic Privacy Protection at the EdgeACM Transactions on Privacy and Security10.1145/357016126:2(1-27)Online publication date: 13-Apr-2023
    • (2023)Akte-Liquid: Acoustic-based Liquid Identification with SmartphonesACM Transactions on Sensor Networks10.1145/355164019:1(1-24)Online publication date: 21-Feb-2023
    • (2023)Towards Fairer and More Efficient Federated Learning via Multidimensional Personalized Edge Models2023 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN54540.2023.10191956(1-8)Online publication date: 18-Jun-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    EPUB

    View this article in ePub.

    ePub

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media