Designing clinically translatable artificial intelligence systems for high-dimensional medical imaging

Shad, Rohan; Cunningham, John P.; Ashley, Euan A.; Langlotz, Curtis P.; Hiesinger, William

doi:10.1038/s42256-021-00399-8

Perspective
Published: 16 November 2021

Designing clinically translatable artificial intelligence systems for high-dimensional medical imaging

Nature Machine Intelligence volumeÂ 3,Â pages 929â935 (2021)Cite this article

12k Accesses
35 Altmetric
Metrics details

Subjects

Abstract

The National Institutes of Health in 2018 identified key focus areas for the future of artificial intelligence in medical imaging, creating a foundational roadmap for research in image acquisition, algorithms, data standardization and translatable clinical decision support systems. Among the key issues raised in the report, data availability, the need for novel computing architectures and explainable artificial intelligence algorithms are still relevant, despite the tremendous progress made over the past few years alone. Furthermore, translational goals of data sharing, validation of performance for regulatory approval, generalizability and mitigation of unintended bias must be accounted for early in the development process. In this Perspective, we explore challenges unique to high-dimensional clinical imaging data, in addition to highlighting some of the technical and ethical considerations involved in developing machine learning systems that better represent the high-dimensional nature of many imaging modalities. Furthermore, we argue that methods that attempt to address explainability, uncertainty and bias should be treated as core components of any clinical machine learning system.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Cloud-based collaborative annotation workflows.**

**Fig. 2: Quantifying uncertainty in machine learning outputs.**

**Fig. 3: Misleading nature of post-hoc model explanations.**

Machine learning for medical imaging: methodological failures and recommendations for the future

Article Open access 12 April 2022

Artificial intelligence and machine learning in cancer imaging

Article Open access 27 October 2022

Predicting cancer outcomes with radiomics and artificial intelligence in radiology

Article 18 October 2021

References

Rajpurkar, P. et al. Deep learning for chest radiograph diagnosis: a retrospective comparison of the CheXNeXt algorithm to practicing radiologists. PLoS Med. 15, e1002686 (2018).
ArticleÂ Google ScholarÂ
Rajpurkar, P. et al. AppendiXNet: deep learning for diagnosis of appendicitis from a small dataset of CT exams using video pretraining. Sci. Rep. 10, 3958 (2020).
ArticleÂ Google ScholarÂ
Huang, S.-C. et al. PENetâa scalable deep-learning model for automated diagnosis of pulmonary embolism using volumetric CT imaging. npj Digit. Med. 3, 61 (2020).
ArticleÂ Google ScholarÂ
Ouyang, D. et al. Video-based AI for beat-to-beat assessment of cardiac function. Nature https://doi.org/10.1038/s41586-020-2145-8 (2020).
Ghorbani, A. et al. Deep learning interpretation of echocardiograms. npj Digit. Med. 3, 10 (2020).
ArticleÂ Google ScholarÂ
Poplin, R. et al. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nat. Biomed. Eng. 2, 158â164 (2018).
ArticleÂ Google ScholarÂ
McKinney, S. M. et al. International evaluation of an AI system for breast cancer screening. Nature 577, 89â94 (2020).
ArticleÂ Google ScholarÂ
Yim, J. et al. Predicting conversion to wet age-related macular degeneration using deep learning. Nat. Med. 26, 892â899 (2020).
ArticleÂ Google ScholarÂ
Beede, E. et al. A human-centered evaluation of a deep learning system deployed in clinics for the detection of diabetic retinopathy. In Proc. 2020 CHI Conference on Human Factors in Computing Systems 1â12 (ACM, 2020); https://doi.org/10.1145/3313831.3376718
Allen, B. et al. A road map for translational research on artificial intelligence in medical imaging: from the 2018 National Institutes of Health/RSNA/ACR/The Academy Workshop. J. Am. Coll. Radiol. 16, 1179â1189 (2019).
ArticleÂ Google ScholarÂ
Paszke, A. et al. PyTorch: an imperative style, high-performance deep learning library. Preprint at https://arxiv.org/abs/1912.01703 (2019).
Abadi, M. et al. TensorFlow: large-scale machine learning on heterogeneous distributed systems. Preprint at https://arxiv.org/abs/1603.04467v2 (2016).
Langlotz, C. P. et al. A roadmap for foundational research on artificial intelligence in medical imaging: from the 2018 NIH/RSNA/ACR/The Academy Workshop. Radiology 291, 781â791 (2019).
ArticleÂ Google ScholarÂ
Ulloa Cerna, A. E. et al. Deep-learning-assisted analysis of echocardiographic videos improves predictions of all-cause mortality. Nat. Biomed. Eng. https://doi.org/10.1038/s41551-020-00667-9 (2021).
Raghunath, S. et al. Prediction of mortality from 12-lead electrocardiogram voltage data using a deep neural network. Nat. Med. 26, 886â891 (2020).
ArticleÂ Google ScholarÂ
Oren, O., Gersh, B. J. & Bhatt, D. L. Artificial intelligence in medical imaging: switching from radiographic pathological data to clinically meaningful endpoints. Lancet Digit. Health 2, e486âe488 (2020).
ArticleÂ Google ScholarÂ
Mildenberger, P., Eichelberg, M. & Martin, E. Introduction to the DICOM standard. Eur. Radiol. 12, 920â927 (2002).
ArticleÂ Google ScholarÂ
Mesterhazy, J., Olson, G. & Datta, S. High performance on-demand de-identification of a petabyte-scale medical imaging data lake. Preprint at https://arxiv.org/abs/2008.01827 (2020).
Mason, D. et al. pydicom/pydicom: pydicom 2.1.0. Zenodo https://doi.org/10.5281/ZENODO.4197955 (2020).
Harris, C. R. et al. Array programming with NumPy. Nature 585, 357â362 (2020).
ArticleÂ Google ScholarÂ
Rubin, D. L. et al. Automated tracking of quantitative assessments of tumor burden in clinical trials. Transl. Oncol. 7, 23â35 (2014).
ArticleÂ Google ScholarÂ
Kaissis, G. A., Makowski, M. R., RÃ¼ckert, D. & Braren, R. F. Secure, privacy-preserving and federated machine learning in medical imaging. Nat. Mach. Intell. 2, 305â311 (2020).
ArticleÂ Google ScholarÂ
Chang, K. et al. Distributed deep learning networks among institutions for medical imaging. J. Am. Med. Inform. Assoc. 25, 945â954 (2018).
ArticleÂ Google ScholarÂ
Balachandar, N., Chang, K., Kalpathy-Cramer, J. & Rubin, D. L. Accounting for data variability in multi-institutional distributed deep learning for medical imaging. J. Am. Med. Inform. Assoc. 27, 700â708 (2020).
ArticleÂ Google ScholarÂ
Xu, Y. et al. A collaborative online AI engine for CT-based COVID-19 diagnosis. Preprint at medRxiv https://doi.org/10.1101/2020.05.10.20096073 (2020).
Kaissis, G. et al. End-to-end privacy preserving deep learning on multi-institutional medical imaging. Nat. Mach. Intell. 3, 473â484 (2021).
ArticleÂ Google ScholarÂ
Warnat-Herresthal, S. et al. Swarm learning for decentralized and confidential clinical machine learning. Nature 594, 265â270 (2021).
ArticleÂ Google ScholarÂ
Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet classification with deep convolutional neural networks. Commun. ACM 60, 84â90 (2017).
ArticleÂ Google ScholarÂ
Anwar, S., Barnes, N. & Petersson, L. A systematic evaluation: fine-grained CNN vs. traditional CNN classifiers. Preprint at https://arxiv.org/abs/2003.11154 (2020).
He, K., Zhang, X., Ren, S. & Sun, J. Identity mappings in deep residual networks. Preprint at https://arxiv.org/abs/1603.05027 (2016).
Hara, K., Kataoka, H. & Satoh, Y. Learning spatio-temporal features with 3D residual networks for action recognition. Preprint at https://arxiv.org/abs/1708.07632 (2017).
Tan, M. & Le, Q. V. EfficientNet: rethinking model scaling for convolutional neural networks. Preprint at https://arxiv.org/abs/1905.11946 (2019).
Carreira, J. & Zisserman, A. Quo vadis, action recognition? A new model and the kinetics dataset. Preprint at https://arxiv.org/abs/1705.07750 (2018).
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. Preprint at http://arxiv.org/abs/1409.1556 (2014).
Marcel, S. & Rodriguez, Y. Torchvision the machine-vision package of torch. In Proc. International Conference on Multimedia - MM â10 1485 (ACM, 2010); https://doi.org/10.1145/1873951.1874254
Zhang, J. et al. Fully automated echocardiogram interpretation in clinical practice. Circulation 138, 1623â1635 (2018).
ArticleÂ Google ScholarÂ
Taleb, A. et al. 3D self-supervised methods for medical imaging. Preprint at https://arxiv.org/abs/2006.03829v3 (2020).
Shad, R. et al. Predicting post-operative right ventricular failure using video-based deep learning. Nat. Commun. 12, 5192 (2021).
ArticleÂ Google ScholarÂ
Carreira, J., Noland, E., Banki-Horvath, A., Hillier, C. & Zisserman, A. A short note about Kinetics-600. Preprint at https://arxiv.org/abs/1808.01340 (2018).
Raghu, M., Zhang, C., Kleinberg, J. & Bengio, S. Transfusion: understanding transfer learning for medical imaging. Preprint at https://arxiv.org/abs/1902.07208 (2019).
Zhang, Y., Jiang, H., Miura, Y., Manning, C. D. & Langlotz, C. P. Contrastive learning of medical visual representations from paired images and text. Preprint at https://arxiv.org/abs/2010.00747 (2020).
Real, E., Aggarwal, A., Huang, Y. & Le, Q. V. Regularized evolution for image classifier architecture search. Preprint at https://arxiv.org/abs/1802.01548 (2019).
Piergiovanni, A., Angelova, A., Toshev, A. & Ryoo, M. Evolving space-time neural architectures for videos. In 2019 IEEE/CVF International Conf. Computer Vision (ICCV) 1793â1802 (IEEE, 2019); https://doi.org/10.1109/ICCV.2019.00188
Yamashita, R., Long, J., Saleem, A., Rubin, D. L. & Shen, J. Deep learning predicts postsurgical recurrence of hepatocellular carcinoma from digital histopathologic images. Sci. Rep. 11, 2047 (2021).
ArticleÂ Google ScholarÂ
Mobadersany, P. et al. Predicting cancer outcomes from histology and genomics using convolutional networks. Proc. Natl Acad. Sci. USA 115, E2970âE2979 (2018).
ArticleÂ Google ScholarÂ
Kvamme, H., Borgan, Ã. & Scheel, I. Time-to-event prediction with neural networks and Cox regression. Preprint at https://arxiv.org/abs/1907.00825 (2019).
Sensoy, M., Kaplan, L. & Kandemir, M. Evidential deep learning to quantify classification uncertainty. Preprint at https://arxiv.org/abs/1806.01768 (2018).
Callaway, E. âIt will change everythingâ: DeepMindâs AI makes gigantic leap in solving protein structures. Nature 588, 203â204 (2020).
ArticleÂ Google ScholarÂ
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature https://doi.org/10.1038/s41586-021-03819-2 (2021).
Abdar, M. et al. A review of uncertainty quantification in deep learning: Techniques, applications and challenges. Inform. Fusion 76, 243â297 (2021).
ArticleÂ Google ScholarÂ
Goddard, K., Roudsari, A. & Wyatt, J. C. Automation bias: a systematic review of frequency, effect mediators, and mitigators. J. Am. Med. Inform. Assoc. 19, 121â127 (2012).
ArticleÂ Google ScholarÂ
Bach, S. et al. On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PLoS ONE 10, e0130140 (2015).
ArticleÂ Google ScholarÂ
Selvaraju, R. R. et al. Grad-CAM: visual explanations from deep networks via gradient-based localization. Int. J. Comput. Vis. 128, 336â359 (2020).
ArticleÂ Google ScholarÂ
Adebayo, J. et al. Sanity checks for saliency maps. Preprint at https://arxiv.org/abs/1810.03292 (2020).
Rudin, C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat. Mach. Intell. 1, 206â215 (2019).
ArticleÂ Google ScholarÂ
Arun, N. et al. Assessing the (un)trustworthiness of saliency maps for localizing abnormalities in medical imaging. Preprint at https://arxiv.org/abs/2008.02766 (2020).
Hughes, J. W. et al. Deep learning prediction of biomarkers from echocardiogram videos. Preprint at medRxiv https://doi.org/10.1101/2021.02.03.21251080 (2021).
DeGrave, A. J., Janizek, J. D. & Lee, S.-I. AI for radiographic COVID-19 detection selects shortcuts over signal. Nat. Mach. Intell. https://doi.org/10.1038/s42256-021-00338-7 (2021).
Pierson, E., Cutler, D. M., Leskovec, J., Mullainathan, S. & Obermeyer, Z. An algorithmic approach to reducing unexplained pain disparities in underserved populations. Nat. Med. 27, 136â140 (2021).
ArticleÂ Google ScholarÂ
Obermeyer, Z., Powers, B., Vogeli, C. & Mullainathan, S. Dissecting racial bias in an algorithm used to manage the health of populations. Science 366, 447â453 (2019).
ArticleÂ Google ScholarÂ
Chen, I. Y. et al. Ethical machine learning in health care. Preprint at https://arxiv.org/abs/2009.10576 (2020).
Huang, S.-C., Pareek, A., Seyyedi, S., Banerjee, I. & Lungren, M. P. Fusion of medical imaging and electronic health records using deep learning: a systematic review and implementation guidelines. npj Digit. Med. 3, 136 (2020).
ArticleÂ Google ScholarÂ
TomaÅ¡ev, N. et al. A clinically applicable approach to continuous prediction of future acute kidney injury. Nature 572, 116â119 (2019).
ArticleÂ Google ScholarÂ
Esteva, A. et al. Deep learning-enabled medical computer vision. npj Digit. Med. 4, 5 (2021).
ArticleÂ Google ScholarÂ
Shrikumar, A., Greenside, P. & Kundaje, A. Learning important features through propagating activation differences. Preprint at https://arxiv.org/abs/1704.02685 (2019).
Lundberg, S. M. et al. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2, 56â67 (2020).
ArticleÂ Google ScholarÂ
Pfohl, S. R., Foryciarz, A. & Shah, N. H. An empirical characterization of fair machine learning for clinical risk prediction. J. Biomed. Inform. 113, 103621 (2021).
ArticleÂ Google ScholarÂ
Agarwal, A., Beygelzimer, A., DudÃk, M., Langford, J. & Wallach, H. A Reductions approach to fair classification. Preprint at https://arxiv.org/abs/1803.02453 (2018).
Shapley, L. S. A value for n-person games. Contrib. Theory Games 2, 307â317 (1953).
MathSciNetÂ MATHÂ Google ScholarÂ

Download references

Acknowledgements

R.S. was supported in part by the American Heart Association Postdoctoral Fellowship Award (grant number 834986).

Author information

Authors and Affiliations

Department of Cardiothoracic Surgery, Stanford University, Palo Alto, CA, USA
Rohan ShadÂ &Â William Hiesinger
Department of Statistics, Columbia University, New York, NY, USA
John P. Cunningham
Department of Cardiovascular Medicine, Genetics, and Biomedical Data Science, Stanford University, Stanford, CA, USA
Euan A. Ashley
Center for Artificial Intelligence in Medicine and Imaging, Stanford University, Stanford, CA, USA
Euan A. Ashley,Â Curtis P. LanglotzÂ &Â William Hiesinger
Department of Radiology and Biomedical Informatics, Stanford University, Stanford, CA, USA
Curtis P. Langlotz

Authors

Rohan Shad
View author publications
You can also search for this author in PubMedÂ Google Scholar
John P. Cunningham
View author publications
You can also search for this author in PubMedÂ Google Scholar
Euan A. Ashley
View author publications
You can also search for this author in PubMedÂ Google Scholar
Curtis P. Langlotz
View author publications
You can also search for this author in PubMedÂ Google Scholar
William Hiesinger
View author publications
You can also search for this author in PubMedÂ Google Scholar

Corresponding author

Correspondence to William Hiesinger.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Machine Intelligence thanks Pearse Keane, Yipeng Hu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shad, R., Cunningham, J.P., Ashley, E.A. et al. Designing clinically translatable artificial intelligence systems for high-dimensional medical imaging. Nat Mach Intell 3, 929â935 (2021). https://doi.org/10.1038/s42256-021-00399-8

Download citation

Received: 23 March 2021
Accepted: 07 September 2021
Published: 16 November 2021
Issue Date: November 2021
DOI: https://doi.org/10.1038/s42256-021-00399-8