Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Understanding the performance of machine learning models from data- to patient-level

Published: 11 December 2024 Publication History

Abstract

Machine Learning (ML) models have the potential to support decision-making in healthcare by grasping complex patterns within data. However, decisions in this domain are sensitive and require active involvement of domain specialists with deep knowledge of the data. To address this task, clinicians need to understand how predictions are generated so they can provide feedback for model refinement. There is usually a gap in the communication between data scientists and domain specialists that needs to be addressed. Specifically, many ML studies are only concerned with presenting average accuracies over an entire dataset, losing valuable insights that can be obtained at a more fine-grained patient-level analysis of classification performance. In this article, we present a case study aimed at explaining the factors that contribute to specific predictions for individual patients. Our approach takes a data-centric perspective, focusing on the structure of the data and its correlation with ML model performance. We utilize the concept of Instance Hardness, which measures the level of difficulty an instance poses in being correctly classified. By selecting the hardest and easiest to classify instances, we analyze and contrast the distributions of specific input features and extract meta-features to describe each instance. Furthermore, we individually examine certain instances, offering valuable insights into why they offer challenges for classification, enabling a better understanding of both the successes and failures of the ML models. This opens up the possibility for discussions between data scientists and domain specialists, supporting collaborative decision-making.

References

[1]
José L. M. Arruda, Ricardo B. C. Prudêncio, and Ana C. Lorena. 2020. Measuring instance hardness using data complexity measures. In Brazilian Conference on Intelligent Systems. Springer, 483–497.
[2]
André Calero Valdez, Martina Ziefle, Katrien Verbert, Alexander Felfernig, and Andreas Holzinger. 2016. Recommender systems for health informatics: State-of-the-art and future perspectives. In Machine Learning for Health Informatics. Springer, 391–414.
[3]
Angelos Chatzimparmpas, Fernando V. Paulovich, and Andreas Kerren. 2022. HardVis: Visual analytics to handle instance hardness using undersampling and oversampling techniques. In Computer graphics forum (Print), Vol. 42. John Wiley & Sons, 135–154.
[4]
Guang Chen, DI Wu, Wei Guo, Yong Cao, Da Huang, Hongwu Wang, Tao Wang, Xiaoyun Zhang, Huilong Chen, Haijing Yu, et al. 2020. Clinical and immunological features of severe and moderate coronavirus disease 2019. The Journal of Clinical Investigation 130, 5 (2020), 2620–2629.
[5]
Alexander Decruyenaere, Philippe Decruyenaere, Patrick Peeters, Frank Vermassen, Tom Dhaene, and Ivo Couckuyt. 2015. Prediction of delayed graft function after kidney transplantation: Comparison between logistic regression and machine learning methods. BMC Medical Informatics and Decision Making 15 (2015), 1–10.
[6]
Menglu Gao, Qianying Wang, Jianhao Wei, Zhaoqin Zhu, and Haicong Li. 2020. Severe Coronavirus disease 2019 pneumonia patients showed signs of aggravated renal impairment. Journal of Clinical Laboratory Analysis 34, 10 (2020), e23535.
[7]
Andreas Holzinger. 2016. Interactive machine learning for health informatics: When do we need the human-in-the-loop?Brain Informatics 3, 2 (2016), 119–131.
[8]
Andrew Houston, Georgina Cosma, Phillipa Turner, and Alexander Bennett. 2021. Predicting surgical outcomes for chronic exertional compartment syndrome using a machine learning framework with embedded trust by interrogation strategies. Scientific Reports 11, 1 (2021), 1–15.
[9]
Grey Leonard, Charles South, Courtney Balentine, Matthew Porembka, John Mansour, Sam Wang, Adam Yopp, Patricio Polanco, Herbert Zeh, and Mathew Augustine. 2022. Machine learning improves prediction over logistic regression on resected colon cancer patients. Journal of Surgical Research 275 (2022), 181–193.
[10]
Jing Li, Yinghua Zhang, Fang Wang, Bing Liu, Hui Li, Guodong Tang, Zhigang Chang, Aihua Liu, Chunyi Fu, Jing Gao, et al. 2020. Sex differences in clinical findings among patients with coronavirus disease 2019 (COVID-19) and severe condition. MedRxiv (2020), 2020–02.
[11]
Camila Castro Moreno, Pedro Yuri Arbs Paiva, Gustavo H. Nunes, and Ana Carolina Lorena. 2021. Contrasting the profiles of easy and hard observations in a dataset. In Proc. NeurIPS DCAI Workshop.
[12]
Shreya Mukherjee and Kalipada Pahan. 2021. Is COVID-19 gender-sensitive? Journal of Neuroimmune Pharmacology 16 (2021), 38–47.
[13]
Fesih Ok, Omer Erdogan, Emrullah Durmus, Serkan Carkci, and Aggul Canik. 2021. Predictive values of blood urea nitrogen/creatinine ratio and other routine blood parameters on disease severity and survival of COVID-19 patients. Journal of Medical Virology 93, 2 (2021), 786–793.
[14]
Pedro Yuri Arbs Paiva, Camila Castro Moreno, Kate Smith-Miles, Maria Gabriela Valeriano, and Ana Carolina Lorena. 2022. Relating instance hardness to classification performance in a dataset: A visual approach. Machine Learning (2022), 1–39.
[15]
Markus Plass, Michaela Kargl, Patrick Nitsche, Emilian Jungwirth, Andreas Holzinger, and Heimo Müller. 2022. Understanding and explaining diagnostic paths: Toward augmented decision making. IEEE Computer Graphics and Application 42, 6 (2022), 47–57.
[16]
Nabeel Seedat, Jonathan Crabbé, Ioana Bica, and Mihaela van der Schaar. 2022. Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data. Advances in Neural Information Processing Systems 35 (2022), 23660–23674.
[17]
Milad Sharifpour, Srikant Rangaraju, Michael Liu, Darwish Alabyad, Fadi B. Nahab, Christina M. Creel-Bulos, Craig S. Jabaley, and Emory COVID-19 Quality & Clinical Research Collaborative. 2020. C-Reactive protein as a prognostic indicator in hospitalized patients with COVID-19. PloS One 15, 11 (2020), e0242400.
[18]
Anil Shrestha, Gaurav Jung Shah, Sagar Neupane, and Richa Shrestha. 2022. C-Reactive protein as a prognostic marker in hospitalized patients with COVID-19. Journal of Nepalgunj Medical College 20, 1 (2022), 70–73.
[19]
Michael R. Smith, Tony Martinez, and Christophe Giraud-Carrier. 2014. An instance level analysis of data complexity. Machine Learning 95, 2 (2014), 225–256.
[20]
Herdiantri Sufriyana, Atina Husnayain, Ya-Lin Chen, Chao-Yang Kuo, Onkar Singh, Tso-Yang Yeh, Yu-Wei Wu, Emily Chia-Yu Su, et al. 2020. Comparison of multivariable logistic regression and other machine learning algorithms for prognostic prediction studies in pregnancy care: Systematic review and meta-analysis. JMIR Medical Informatics 8, 11 (2020), e16503.
[21]
Li Tan, Qi Wang, Duanyang Zhang, Jinya Ding, Qianchuan Huang, Yi-Quan Tang, Qiongshu Wang, and Hongming Miao. 2020. Lymphopenia predicts disease severity of COVID-19: A descriptive and predictive study. Signal Transduction and Targeted Therapy 5, 1 (2020), 1–3.
[22]
Matthew Zirui Tay, Chek Meng Poh, Laurent Rénia, Paul A. MacAry, and Lisa F. P. Ng. 2020. The trinity of COVID-19: Immunity, inflammation and intervention. Nature Reviews Immunology 20, 6 (2020), 363–374.
[23]
Sana Tonekaboni, Shalmali Joshi, Melissa D. McCradden, and Anna Goldenberg. 2019. What clinicians want: Contextualizing explainable machine learning for clinical end use. In Machine Learning for Healthcare Conference. PMLR, 359–380.
[24]
Maria Gabriela Valeriano, Carlos R. V. Kiffer, Giane Higino, Paloma Zanão, Dulce A. Barbosa, Patrícia A. Moreira, Paulo Caleb J. L. Santos, Renato Grinbaum, and Ana Carolina Lorena. 2022. Let the data speak: Analysing data from multiple health centers of the São Paulo metropolitan area for COVID-19 clinical deterioration prediction. In Proceedings of the 22nd IEEE International Symposium on Cluster, Cloud and Internet Computing (CCGrid’22). IEEE, 948–951.
[25]
Maria Gabriela Valeriano, Carlos Roberto Veiga Kiffer, and Ana Carolina Lorena. 2022. Supporting decision making in health scenarios with machine learning models. In Anais do Simposio Brasileiro de Pesquisa Operacional (Juiz de Fora). Anais eletronicos. Campinas, Galoa.

Cited By

View all
  • (2024)Improving models performance in a data-centric approach applied to the healthcare domainAnais do XII Symposium on Knowledge Discovery, Mining and Learning (KDMiLe 2024)10.5753/kdmile.2024.244519(57-64)Online publication date: 17-Nov-2024

Index Terms

  1. Understanding the performance of machine learning models from data- to patient-level

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Journal of Data and Information Quality
    Journal of Data and Information Quality  Volume 16, Issue 4
    December 2024
    122 pages
    EISSN:1936-1963
    DOI:10.1145/3613719
    • Editor:
    • Felix Naumann
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 December 2024
    Online AM: 13 September 2024
    Accepted: 19 May 2024
    Revised: 26 January 2024
    Received: 30 May 2023
    Published in JDIQ Volume 16, Issue 4

    Check for updates

    Author Tags

    1. Machine learning
    2. Instance Hardness
    3. Data-centric
    4. Healthcare

    Qualifiers

    • Research-article

    Funding Sources

    • Fundação de Amparo à Pesquisa do Estado de São Paulo
    • Coordination for the Improvement of Higher Education Personnel – Brazil (CAPES)

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)137
    • Downloads (Last 6 weeks)45
    Reflects downloads up to 12 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Improving models performance in a data-centric approach applied to the healthcare domainAnais do XII Symposium on Knowledge Discovery, Mining and Learning (KDMiLe 2024)10.5753/kdmile.2024.244519(57-64)Online publication date: 17-Nov-2024

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media