research-article

Understanding the performance of machine learning models from data- to patient-level

Authors:

Maria Gabriela Valeriano,

Ana Matran-Fernandez,

Ana Carolina LorenaAuthors Info & Claims

ACM Journal of Data and Information Quality, Volume 16, Issue 4

Article No.: 23, Pages 1 - 19

https://doi.org/10.1145/3687267

Published: 11 December 2024 Publication History

Abstract

Machine Learning (ML) models have the potential to support decision-making in healthcare by grasping complex patterns within data. However, decisions in this domain are sensitive and require active involvement of domain specialists with deep knowledge of the data. To address this task, clinicians need to understand how predictions are generated so they can provide feedback for model refinement. There is usually a gap in the communication between data scientists and domain specialists that needs to be addressed. Specifically, many ML studies are only concerned with presenting average accuracies over an entire dataset, losing valuable insights that can be obtained at a more fine-grained patient-level analysis of classification performance. In this article, we present a case study aimed at explaining the factors that contribute to specific predictions for individual patients. Our approach takes a data-centric perspective, focusing on the structure of the data and its correlation with ML model performance. We utilize the concept of Instance Hardness, which measures the level of difficulty an instance poses in being correctly classified. By selecting the hardest and easiest to classify instances, we analyze and contrast the distributions of specific input features and extract meta-features to describe each instance. Furthermore, we individually examine certain instances, offering valuable insights into why they offer challenges for classification, enabling a better understanding of both the successes and failures of the ML models. This opens up the possibility for discussions between data scientists and domain specialists, supporting collaborative decision-making.

References

[1]

José L. M. Arruda, Ricardo B. C. Prudêncio, and Ana C. Lorena. 2020. Measuring instance hardness using data complexity measures. In Brazilian Conference on Intelligent Systems. Springer, 483–497.

Digital Library

[2]

André Calero Valdez, Martina Ziefle, Katrien Verbert, Alexander Felfernig, and Andreas Holzinger. 2016. Recommender systems for health informatics: State-of-the-art and future perspectives. In Machine Learning for Health Informatics. Springer, 391–414.

[3]

Angelos Chatzimparmpas, Fernando V. Paulovich, and Andreas Kerren. 2022. HardVis: Visual analytics to handle instance hardness using undersampling and oversampling techniques. In Computer graphics forum (Print), Vol. 42. John Wiley & Sons, 135–154.

[4]

Guang Chen, DI Wu, Wei Guo, Yong Cao, Da Huang, Hongwu Wang, Tao Wang, Xiaoyun Zhang, Huilong Chen, Haijing Yu, et al. 2020. Clinical and immunological features of severe and moderate coronavirus disease 2019. The Journal of Clinical Investigation 130, 5 (2020), 2620–2629.

[5]

Alexander Decruyenaere, Philippe Decruyenaere, Patrick Peeters, Frank Vermassen, Tom Dhaene, and Ivo Couckuyt. 2015. Prediction of delayed graft function after kidney transplantation: Comparison between logistic regression and machine learning methods. BMC Medical Informatics and Decision Making 15 (2015), 1–10.

[6]

Menglu Gao, Qianying Wang, Jianhao Wei, Zhaoqin Zhu, and Haicong Li. 2020. Severe Coronavirus disease 2019 pneumonia patients showed signs of aggravated renal impairment. Journal of Clinical Laboratory Analysis 34, 10 (2020), e23535.

[7]

Andreas Holzinger. 2016. Interactive machine learning for health informatics: When do we need the human-in-the-loop?Brain Informatics 3, 2 (2016), 119–131.

[8]

Andrew Houston, Georgina Cosma, Phillipa Turner, and Alexander Bennett. 2021. Predicting surgical outcomes for chronic exertional compartment syndrome using a machine learning framework with embedded trust by interrogation strategies. Scientific Reports 11, 1 (2021), 1–15.

[9]

Grey Leonard, Charles South, Courtney Balentine, Matthew Porembka, John Mansour, Sam Wang, Adam Yopp, Patricio Polanco, Herbert Zeh, and Mathew Augustine. 2022. Machine learning improves prediction over logistic regression on resected colon cancer patients. Journal of Surgical Research 275 (2022), 181–193.

[10]

Jing Li, Yinghua Zhang, Fang Wang, Bing Liu, Hui Li, Guodong Tang, Zhigang Chang, Aihua Liu, Chunyi Fu, Jing Gao, et al. 2020. Sex differences in clinical findings among patients with coronavirus disease 2019 (COVID-19) and severe condition. MedRxiv (2020), 2020–02.

[11]

Camila Castro Moreno, Pedro Yuri Arbs Paiva, Gustavo H. Nunes, and Ana Carolina Lorena. 2021. Contrasting the profiles of easy and hard observations in a dataset. In Proc. NeurIPS DCAI Workshop.

[12]

Shreya Mukherjee and Kalipada Pahan. 2021. Is COVID-19 gender-sensitive? Journal of Neuroimmune Pharmacology 16 (2021), 38–47.

[13]

Fesih Ok, Omer Erdogan, Emrullah Durmus, Serkan Carkci, and Aggul Canik. 2021. Predictive values of blood urea nitrogen/creatinine ratio and other routine blood parameters on disease severity and survival of COVID-19 patients. Journal of Medical Virology 93, 2 (2021), 786–793.

[14]

Pedro Yuri Arbs Paiva, Camila Castro Moreno, Kate Smith-Miles, Maria Gabriela Valeriano, and Ana Carolina Lorena. 2022. Relating instance hardness to classification performance in a dataset: A visual approach. Machine Learning (2022), 1–39.

[15]

Markus Plass, Michaela Kargl, Patrick Nitsche, Emilian Jungwirth, Andreas Holzinger, and Heimo Müller. 2022. Understanding and explaining diagnostic paths: Toward augmented decision making. IEEE Computer Graphics and Application 42, 6 (2022), 47–57.

[16]

Nabeel Seedat, Jonathan Crabbé, Ioana Bica, and Mihaela van der Schaar. 2022. Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data. Advances in Neural Information Processing Systems 35 (2022), 23660–23674.

[17]

Milad Sharifpour, Srikant Rangaraju, Michael Liu, Darwish Alabyad, Fadi B. Nahab, Christina M. Creel-Bulos, Craig S. Jabaley, and Emory COVID-19 Quality & Clinical Research Collaborative. 2020. C-Reactive protein as a prognostic indicator in hospitalized patients with COVID-19. PloS One 15, 11 (2020), e0242400.

[18]

Anil Shrestha, Gaurav Jung Shah, Sagar Neupane, and Richa Shrestha. 2022. C-Reactive protein as a prognostic marker in hospitalized patients with COVID-19. Journal of Nepalgunj Medical College 20, 1 (2022), 70–73.

[19]

Michael R. Smith, Tony Martinez, and Christophe Giraud-Carrier. 2014. An instance level analysis of data complexity. Machine Learning 95, 2 (2014), 225–256.

Digital Library

[20]

Herdiantri Sufriyana, Atina Husnayain, Ya-Lin Chen, Chao-Yang Kuo, Onkar Singh, Tso-Yang Yeh, Yu-Wei Wu, Emily Chia-Yu Su, et al. 2020. Comparison of multivariable logistic regression and other machine learning algorithms for prognostic prediction studies in pregnancy care: Systematic review and meta-analysis. JMIR Medical Informatics 8, 11 (2020), e16503.

[21]

Li Tan, Qi Wang, Duanyang Zhang, Jinya Ding, Qianchuan Huang, Yi-Quan Tang, Qiongshu Wang, and Hongming Miao. 2020. Lymphopenia predicts disease severity of COVID-19: A descriptive and predictive study. Signal Transduction and Targeted Therapy 5, 1 (2020), 1–3.

[22]

Matthew Zirui Tay, Chek Meng Poh, Laurent Rénia, Paul A. MacAry, and Lisa F. P. Ng. 2020. The trinity of COVID-19: Immunity, inflammation and intervention. Nature Reviews Immunology 20, 6 (2020), 363–374.

[23]

Sana Tonekaboni, Shalmali Joshi, Melissa D. McCradden, and Anna Goldenberg. 2019. What clinicians want: Contextualizing explainable machine learning for clinical end use. In Machine Learning for Healthcare Conference. PMLR, 359–380.

[24]

Maria Gabriela Valeriano, Carlos R. V. Kiffer, Giane Higino, Paloma Zanão, Dulce A. Barbosa, Patrícia A. Moreira, Paulo Caleb J. L. Santos, Renato Grinbaum, and Ana Carolina Lorena. 2022. Let the data speak: Analysing data from multiple health centers of the São Paulo metropolitan area for COVID-19 clinical deterioration prediction. In Proceedings of the 22nd IEEE International Symposium on Cluster, Cloud and Internet Computing (CCGrid’22). IEEE, 948–951.

[25]

Maria Gabriela Valeriano, Carlos Roberto Veiga Kiffer, and Ana Carolina Lorena. 2022. Supporting decision making in health scenarios with machine learning models. In Anais do Simposio Brasileiro de Pesquisa Operacional (Juiz de Fora). Anais eletronicos. Campinas, Galoa.

Cited By

Valeriano MKiffer CLorena A(2024)Improving models performance in a data-centric approach applied to the healthcare domainAnais do XII Symposium on Knowledge Discovery, Mining and Learning (KDMiLe 2024)10.5753/kdmile.2024.244519(57-64)Online publication date: 17-Nov-2024
https://doi.org/10.5753/kdmile.2024.244519

Index Terms

Understanding the performance of machine learning models from data- to patient-level
1. Computing methodologies
  1. Machine learning

Recommendations

Enhancing Intensive Care Patient Prognostics with Machine Learning
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication Technology

This article delves into the challenge of foreseeing patient discharges and unplanned returns to the intensive care unit. Its primary objective is to enhance the decision-making process for healthcare providers and administrators, facilitate resource ...
Evaluation of classification methods for the prediction of hospital length of stay using medicare claims data
PETRA '14: Proceedings of the 7th International Conference on PErvasive Technologies Related to Assistive Environments

In this paper, we investigate the performance of a series of classification methods for the prediction of the hospital Length of Stay (LOS), based on two temporally sequential clinical scenarios. We used a 2012 Medicare Provider Analysis and Review (...
Prediction on diabetes patient's hospital readmission rates
ICAICR '19: Proceedings of the Third International Conference on Advanced Informatics for Computing Research

Hospital Readmission is considered as an effective measurement of service and care provided within the hospital. Emergency readmission to hospital is frequently used as a measure of the quality of a hospital because a high proportion of readmissions ...

Comments

Information & Contributors

Information

Published In

cover image Journal of Data and Information Quality

Journal of Data and Information Quality Volume 16, Issue 4

December 2024

122 pages

EISSN:1936-1963

DOI:10.1145/3613719

Editor:
Felix Naumann
Hasso Plattner Institute, Germany

Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 December 2024

Online AM: 13 September 2024

Accepted: 19 May 2024

Revised: 26 January 2024

Received: 30 May 2023

Published in JDIQ Volume 16, Issue 4

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Fundação de Amparo à Pesquisa do Estado de São Paulo
Coordination for the Improvement of Higher Education Personnel – Brazil (CAPES)

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
137
Total Downloads

Downloads (Last 12 months)137
Downloads (Last 6 weeks)45

Reflects downloads up to 12 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Valeriano MKiffer CLorena A(2024)Improving models performance in a data-centric approach applied to the healthcare domainAnais do XII Symposium on Knowledge Discovery, Mining and Learning (KDMiLe 2024)10.5753/kdmile.2024.244519(57-64)Online publication date: 17-Nov-2024
https://doi.org/10.5753/kdmile.2024.244519

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Full Text

View this article in Full Text.

Figures

Tables

Media

View full text|Download PDF

View Issue’s Table of Contents