Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Weakly Semi-supervised phenotyping using Electronic Health records

Published: 01 October 2022 Publication History

Graphical abstract

Display Omitted

Highlights

WSS-DL yields high phenotyping performance despite using very small numbers of labels.
WSS-DL combines the strengths of deep learning and weakly supervised learning.
Unlike weakly supervised learning algorithms fail for episodic phenotypes that cannot be consistently detected via ICD and NLP data.
WSS-DL has the potential in aiding doctors to phenotype rare diseases as well as conditions susceptible to misdiagnosis.

Abstract

Objective

Electronic Health Record (EHR) based phenotyping is a crucial yet challenging problem in the biomedical field. Though clinicians typically determine patient-level diagnoses via manual chart review, the sheer volume and heterogeneity of EHR data renders such tasks challenging, time-consuming, and prohibitively expensive, thus leading to a scarcity of clinical annotations in EHRs. Weakly supervised learning algorithms have been successfully applied to various EHR phenotyping problems, due to their ability to leverage information from large quantities of unlabeled samples to better inform predictions based on a far smaller number of patients. However, most weakly supervised methods are subject to the challenge to choose the right cutoff value to generate an optimal classifier. Furthermore, since they only utilize the most informative features (i.e., main ICD and NLP counts) they may fail for episodic phenotypes that cannot be consistently detected via ICD and NLP data. In this paper, we propose a label-efficient, weakly semi-supervised deep learning algorithm for EHR phenotyping (WSS-DL), which overcomes the limitations above.

Materials and Methods

WSS-DL classifies patient-level disease status through a series of learning stages: 1) generating silver standard labels, 2) deriving enhanced-silver-standard labels by fitting a weakly supervised deep learning model to data with silver standard labels as outcomes and high dimensional EHR features as input, and 3) obtaining the final prediction score and classifier by fitting a supervised learning model to data with a minimal number of gold standard labels as the outcome, and the enhanced-silver-standard labels and a minimal set of most informative EHR features as input. To assess the generalizability of WSS-DL across different phenotypes and medical institutions, we apply WSS-DL to classify a total of 17 diseases, including both acute and chronic conditions, using EHR data from three healthcare systems. Additionally, we determine the minimum quantity of training labels required by WSS-DL to outperform existing supervised and semi-supervised phenotyping methods.

Results

The proposed method, in combining the strengths of deep learning and weakly semi-supervised learning, successfully leverages the crucial phenotyping information contained in EHR features from unlabeled samples. Indeed, the deep learning model’s ability to handle high-dimensional EHR features allows it to generate strong phenotype status predictions from silver standard labels. These predictions, in turn, provide highly effective features in the final logistic regression stage, leading to high phenotyping accuracy in notably small subsets of labeled data (e.g. n = 40 labeled samples).

Conclusion

Our method’s high performance in EHR datasets with very small numbers of labels indicates its potential value in aiding doctors to diagnose rare diseases as well as conditions susceptible to misdiagnosis.

References

[1]
E.P. Ambinder, Electronic Health Records, J. Oncol. Pract. 1 (2) (2005) 57–63.
[2]
K.P. Liao, T. Cai, G.K. Savova, S.N. Murphy, E.W. Karlson, A.N. Ananthakrishnan, V.S. Gainer, S.Y. Shaw, Z. Xia, P. Szolovits, S. Churchill, I. Kohane, Development of phenotype algorithms using electronic medical records and incorporating natural language processing, BMJ 350 (apr24 11) (2015).
[3]
R.J. Carroll, A.E. Eyler, J.C. Denny, Intelligent use and clinical benefits of electronic health records in rheumatoid arthritis, Expert Rev. Clin. Immunol. 11 (3) (2015) 329–337.
[4]
R.L. Figueroa, Q. Zeng-Treitler, S. Kandula, L.H. Ngo, Predicting sample size required for classification performance, BMC Med. Inform. Decis. Mak. 12 (2012) 8.
[5]
Y. Cheng, F. Wang, P. Zhang, J. Hu, Risk Prediction with Electronic Health Records: A Deep Learning Approach, in: Proceedings of the 2016 SIAM International Conference on Data Mining, 2016,.
[6]
K.B. Wagholikar, H. Estiri, M. Murphy, S.N. Murphy, J. Wren, Polar labeling: silver standard algorithm for training disease classifiers, Bioinformatics 36 (10) (2020) 3200–3206.
[7]
Y. Halpern, S. Horng, Y. Choi, D. Sontag, Electronic medical record phenotyping using the anchor and learn framework, J. Am. Med. Inform. Assoc. 23 (2016) 731–740.
[8]
V. Agarwal et al., Learning statistical models of phenotypes using noisy labeled training data. J. Am. Med. Inform. Assoc. 23, 1166–1173 (2016).
[9]
S. Yu et al., Enabling phenotypic big data with PheNorm. J. Am. Med. Inform. Assoc. 25, 54–60 (2018).
[10]
K.P. Liao, et al., High-throughput multimodal automated phenotyping (MAP) with application to PheWAS, J. Am. Med. Inform. Assoc. 26 (2019) 1255–1262.
[11]
Y. Wang, et al., A clinical text classification paradigm using weak supervision and deep representation, BMC Med. Inform. Decis. Mak. 19 (2019) 1–13.
[12]
Y. Ahuja, et al., sureLDA: A multidisease automated phenotyping method for the electronic health record, J. Am. Med. Inform. Assoc. 27 (2020) 1235–1243.
[13]
T. Ferté, S. Cossin, T. Schaeverbeke, T. Barnetche, V. Jouhet, B.P. Hejblum, PheVis algorithm, J. Biomed. Inform. 117 (2021).
[14]
A. Krizhevsky, I. Sutskever, G.E. Hinton, ImageNet classification with deep convolutional neural networks, Commun. ACM 60 (6) (2017) 84–90.
[15]
L. Oakden-Rayner, Exploring Large-scale Public Medical Image Datasets, Acad. Radiol. 27 (1) (2020) 106–112.
[16]
Y.-H. Hu, W.-C. Lin, C.-F. Tsai, S.-W. Ke, C.-W. Chen, An efficient data preprocessing approach for large scale medical data mining, Technol. Health Care 23 (2015) 153–160.
[17]
K. Yan, X. Wang, L. Lu, R.M. Summers, DeepLesion: automated mining of large-scale lesion annotations and universal lesion detection with deep learning. J. Med. Imag. (Bellingham) 5, 036501 (2018).
[18]
A.N. Jagannatha, H. Yu, Bidirectional RNN for Medical Event Detection in Electronic Health Records, Proc. Conf. 2016 (2016) 473–482.
[19]
I. Nogues, et al. Automatic Lymph Node Cluster Segmentation Using Holistically-Nested Neural Networks and Structured Optimization in CT Images. Medical Image Computing and Computer-Assisted Intervention – MICCAI 2016 388–397 (2016)
[20]
B.K. Beaulieu-Jones, C.S. Greene, & Pooled Resource Open-Access ALS Clinical Trials Consortium. Semi-supervised learning of the electronic health record for phenotype stratification, J. Biomed Inform. 64 (2016) 168–178.
[21]
Y. Zhang, et al., High-throughput phenotyping with electronic medical record data using a common semi-supervised approach (PheCAP), Nat. Protoc. 14 (2019) 3426–3444.
[22]
C. Benesch, D.M. Witter, A.L. Wilder, P.W. Duncan, G.P. Samsa, D.B. Matchar, Inaccuracy of the International Classification of Diseases (ICD-9-CM) in identifying the diagnosis of ischemic cerebrovascular disease, Neurology. 49 (3) (1997) 660–664.
[23]
E. Birman-Deych, A.D. Waterman, Y. Yan, D.S. Nilasena, M.J. Radford, B.F. Gage, Accuracy of ICD-9-CM codes for identifying cardiovascular and stroke risk factors, Med. Care. 43 (5) (2005) 480–485.
[24]
R.H. White, M. Garcia, B. Sadeghi, D.J. Tancredi, P. Zrelak, J. Cuny, P. Sama, H. Gammon, S. Schmaltz, P.S. Romano, Evaluation of the predictive value of ICD-9-CM coded administrative data for venous thromboembolism in the United States, Thromb. Res. 126 (1) (2010) 61–67.
[25]
C. Zhan, J. Battles, Y.-P. Chiang, et al., The validity of ICD-9-CM codes in identifying postoperative deep vein thrombosis and pulmonary embolism, Jt Comm. J. Qual. Patient Saf. 33 (2007) 326–331.
[26]
S.K. Tedeschi, D.H. Solomon, K.P. Liao, Pseudogout among Patients Fulfilling a Billing Code Algorithm for Calcium Pyrophosphate Deposition Disease, Rheumatol. Int. 38 (6) (2018) 1083–1088.
[27]
Tianrun Cai, Zeling He, Chuan Hong, Yichi Zhang, Yuk-Lam Ho, Jacqueline Honerlaw, Alon Geva, Vidul Ayakulangara Panickan, Amanda King, David R Gagnon, Michael Gaziano, Kelly Cho, Katherine Liao Tianxi Cai, 2022. Scalable Relevance Ranking Algorithm via Semantic Similarity Assessment Improves Efficiency of Medical Chart Review (manuscript under revision).
[28]
S. Yu, T. Cai, T. Cai, T., 2013. NILE: fast natural language processing for electronic health records. arXiv preprint arXiv:1311.6063.

Index Terms

  1. Weakly Semi-supervised phenotyping using Electronic Health records
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Journal of Biomedical Informatics
    Journal of Biomedical Informatics  Volume 134, Issue C
    Oct 2022
    331 pages

    Publisher

    Elsevier Science

    San Diego, CA, United States

    Publication History

    Published: 01 October 2022

    Author Tags

    1. Deep learning
    2. EHR phenotyping
    3. Label efficient
    4. Silver-standard labels
    5. Weakly supervised

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 0
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Nov 2024

    Other Metrics

    Citations

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media