Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3503181.3503202acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccseConference Proceedingsconference-collections
research-article

A Survey on Knowledge Enhanced EHR Data Mining

Published: 01 March 2022 Publication History

Abstract

EHR contains detailed information about a large number of patients. In the past ten years, EHR-related research has involved various fields, and research in various fields is inseparable from the acquisition of knowledge in EHR. The rapid development of various fields in recent years has brought new challenges to the acquisition of knowledge in EHR. At the same time, the knowledge enhancement technology that has emerged in recent years has effectively improved this problem. Therefore, more and more researches personnel applied knowledge enhancement technology to EHR. In this article, we summarized the literature in this area. We first summarized the knowledge types of EHR, and then made a sub-statement on knowledge extraction and modeling. Next, the correct representation method of knowledge is given, and finally, the specific application in each field is summarized. We hope this article can continue to promote the development of knowledge enhancement technology in EHR.

References

[1]
D. Arya, D. K. Gupta, S. Rudinac, and M. Worring. 2020. HyperSAGE: Generalizing Inductive Representation Learning on Hypergraphs. (2020).
[2]
Junjie Bai, Biao Gong, Yining Zhao, Fuqiang Lei, Chenggang Yan, and Yue Gao. 2021. Multi-Scale Representation Learning on Hypergraph for 3D Shape Retrieval and Recognition. IEEE Transactions on Image Processing(2021).
[3]
Syed Khairul Bashar, Md Billal Hossain, Eric Ding, Allan J Walkey, David D McManus, and Ki H Chon. 2020. Atrial fibrillation detection during sepsis: study on MIMIC III ICU data. IEEE Journal of Biomedical and Health Informatics 24, 11(2020), 3124–3135.
[4]
Seda Bilaloglu, Vincent J Major, Himanshu Grover, Isabel Metzger, and Yindalon Aphinyanaphongs. 2021. Expanding the Reach of Structured EHR Data with Clinical Notes. In The International FLAIRS Conference Proceedings, Vol. 34.
[5]
A. G. Bready, M. Simons, and S. Mo Ss. 2017. The Diagnosis of Celiac Disease in the EHR: Garbage in, Garbage out?: 1201. Official journal of the American College of Gastroenterology | ACG 112 (2017).
[6]
Andrius Budrionis, Magda Miara, Piotr Miara, Szymon Wilk, and Johan Gustav Bellika. 2021. Benchmarking PySyft Federated Learning Framework on MIMIC-III Dataset. IEEE Access 9(2021), 116869–116878.
[7]
Lw. C. Chan, Y. Liu, C. R. Shyu, and I. F. F. Benzie. 2011. A SNOMED supported ontological vector model for subclinical disorder detection using EHR similarity. Engineering Applications of Artificial Intelligence 24, 8(2011), 1398–1409.
[8]
Mingxuan Che, Kui Yao, Chao Che, Zhangwei Cao, and Fanchen Kong. 2021. Knowledge-Graph-Based Drug Repositioning against COVID-19 by Graph Convolutional Network with Attention Mechanism. Future Internet 13, 1 (2021), 13.
[9]
I. Y. Chen, M. Agrawal, S. Horng, and D. Sontag. 2019. Robustly Extracting Medical Knowledge from EHRs: A Case Study of Learning a Health Knowledge Graph.
[10]
C. Chronaki, A. Shahin, and R. Mark. 2015. Designing reliable cohorts of cardiac patients across MIMIC and eICU. In Computing in Cardiology Conference.
[11]
Spiros Denaxas, Helen Parkinson, Natalie Fitzpatrick, Cathie Sudlow, and Harry Hemingway. 2019. Analyzing the heterogeneity of rule-based EHR phenotyping algorithms in CALIBER and the UK Biobank. (2019).
[12]
Z. Di and C. Weng. 2011. Combining knowledge and EHR data to develop a weighted bayesian network for pancreatic cancer prediction. Journal of Biomedical Informatics 44, 5 (2011), 859–868.
[13]
C. Ehrentraut, K. Sundström, and H. Dalianis. 2015. Exploration of known and unknown early symptoms of cervical cancer and development of a symptom spectrum - Outline of a data and text mining based approach. (2015).
[14]
J. A. Fries. 2016. Brundlefly at SemEval-2016 Task 12: Recurrent Neural Networks vs. Joint Inference for Clinical Temporal Information Extraction. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016).
[15]
M. Gupta, Tlt Phan, T. Bunnell, and R. Beheshti. 2019. Obesity Prediction with EHR Data: A deep learning approach with interpretable elements. (2019).
[16]
Guthrie, S., Birkhead, Michael, Klompas, Nirav, R., and Shah. 2015. Uses of Electronic Health Records for Public Health Surveillance to Advance Public Health. Annual Review of Public Health(2015).
[17]
S. Hirano and S. Tsumoto. 2014. Mining Typical Order Sequences from EHR for Building Clinical Pathways. Springer International Publishing(2014).
[18]
A. Jagannatha and H. Yu. 2016. Bidirectional Recurrent Neural Networks for Medical Event Detection in Electronic Health Records. (2016).
[19]
A. Jagannatha and H. Yu. 2016. Structured prediction models for RNN based sequence labeling in clinical text. In Proc Conf Empir Methods Nat Lang Process.
[20]
X. Jing, S. Kay, T. Marley, and N. R. Hardiker. 2014. Integration of an OWL-DL Knowledge Base with an EHR Prototype and Providing Customized Information. Journal of Medical Systems 38, 9 (2014).
[21]
Aew Johnson, T. J. Pollard, L. A. Celi, and R. G. Mark. 2017. Analyzing the eICU Collaborative Research Database. In the 8th ACM International Conference.
[22]
Eunji Jun, Ahmad Wisnu Mulyadi, Jaehun Choi, and Heung-Il Suk. 2020. Uncertainty-gated stochastic sequential model for ehr mortality prediction. IEEE Transactions on Neural Networks and Learning Systems (2020).
[23]
R. J. Kate, N. Pearce, D Mazumdar, and V. Nilakantan. 2019. Continual Prediction from EHR Data for Inpatient Acute Kidney Injury. (2019).
[24]
Vipina K. Keloth, Shuxin Zhou, Luke Lindemann, Gai Elhanan, and Yehoshua Perl. 2020. Mining Concepts for a COVID Interface Terminology for Annotation of EHRs. In 2020 IEEE International Conference on Big Data (Big Data).
[25]
Jie Wei Kong, Yonghui Xu, and Han Yu. 2019. Deep Transfer Learning For Abnormality Detection. In Proceedings of the 4th International Conference on Crowd Science and Engineering. 233–237.
[26]
Alexander L Kostrinsky-Thomas, Fuki M Hisama, and Thomas H Payne. 2021. Searching the PDF Haystack: Automated Knowledge Discovery in Scanned EHR Documents. Applied Clinical Informatics 12, 02 (2021), 245–250.
[27]
Z. Liang, J. Liu, A. Ou, H. Zhang, Z. Li, and J. Huang. 2018. Deep generative learning for automated EHR diagnosis of traditional Chinese medicine.Computer Methods Programs in Biomedicine(2018), S0169260717303826.
[28]
C. Y. Liu, W. B. Sun, W. H. Chao, and W. X. Che. 2013. Convolution Neural Network for Relation Extraction. Springer Berlin Heidelberg(2013).
[29]
Xinbo Lv, Yi Guan, Jinfeng Yang, and Jiawei Wu. 2016. Clinical Relation Extraction with Deep Learning. International Journal of Hybrid Information Technology 9, 7 (2016), 237–248.
[30]
Scott Alexander Malec, Elmer Victor Bernstam, Peng Wei, Trevor Cohen, and Richard David Boyce. 2020. Using computable knowledge mined from the literature to elucidate confounders for EHR-based pharmacovigilance. (2020).
[31]
D. J. Mcinerney, B. Dabiri, A. S. Touret, G. Young, Vdm Jan-Willem, and B. C. Wallace. 2020. Query-Focused EHR Summarization to Aid Imaging Diagnosis. (2020).
[32]
C. Meng, L. Trinh, N. Xu, and Y. Liu. 2021. MIMIC-IF: Interpretability and Fairness Evaluation of Deep Learning Models on MIMIC-IV Dataset. (2021).
[33]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Distributed Representations of Words and Phrases and their Compositionality. Advances in neural information processing systems 26 (2013).
[34]
Chérubin Mugisha and Incheon Paik. 2020. Pneumonia Outcome Prediction Using Structured And Unstructured Data From EHR. In 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 2640–2646.
[35]
Zhale Nowroozilarki, Arash Pakbin, James Royalty, Donald K. K. Lee, and Bobak J. Mortazavi. 2021. Real-time Mortality Prediction Using MIMIC-IV ICU Data Via Boosted Nonparametric Hazards. In 2021 IEEE EMBS International Conference on Biomedical and Health Informatics (BHI).
[36]
M. Panahiazar, V. Taslimitehrani, N. L. Pereira, and J. Pathak. 2015. Using EHRs for Heart Failure Therapy Recommendation Using Multidimensional Patient Similarity Analytics. other 210(2015).
[37]
R. Pavani. 2018. Implementing a Social Knowledge Networking (SKN) system to enable meaningful use of an EHR medication reconciliation system. Risk Management Healthcare Policy 11 (2018), 45–53.
[38]
Suresh Pokharel, Guido Zuccon, and Yu Li. 2020. Representing EHRs with Temporal Tree and Sequential Pattern Mining for Similarity Computing. Advanced Data Mining and Applications.
[39]
Atefeh Rajabalizadeh, Shaodong Wang, Mahdi Javadi, Nima Safaei, Mojtaba Talafidaryani, Qing Li, Wenli Zhang, and Mahdi Moqri. 2020. In-depth Evaluation of APACHE Scoring System Using eICU Database. (2020).
[40]
P. Rangachari, K. C. Dellsperger, and R. K. Rethemeyer. 2019. A Health System’s Pilot Experience with Using Social Knowledge Networking (SKN) Technology to Enable Meaningful Use of EHR Medication Reconciliation Technology. other 3, 22 (2019).
[41]
Laila Rasmy, Jie Zhu, Zhiheng Li, Xin Hao, and Degui Zhi. 2019. Simple Recurrent Neural Networks is all we need for clinical events predictions using EHR data. In Medinfo 2019.
[42]
M. A. Schwertner, S. J. Rigo, D. A. Araujo, A. B. Silva, and B. Eskofier. 2019. Fostering Natural Language Question Answering Over Knowledge Bases in Oncology EHR. In 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS).
[43]
T. Searle, Z. Ibrahim, and R. Dobson. 2020. Experimental Evaluation and Development of a Silver-Standard for the MIMIC-III Clinical Coding Dataset. (2020).
[44]
Yong Shang, Yu Tian, Min Zhou, Tianshu Zhou, and Jingsong Li. 2021. EHR-Oriented Knowledge Graph System: Toward Efficient Utilization of Non-Used Information Buried in Routine Clinical Practice. IEEE Journal of Biomedical and Health Informatics PP, 99(2021), 1–1.
[45]
S. Sheikhalishahi, V. Balaraman, and V. Osmani. 2019. Benchmarking machine learning models on multi-centre eICU critical care dataset. (2019).
[46]
Hannes Ulrich, Paul Behrend, Joshua Wiedekopf, Cora Drenkhahn, Ann-Kristin Kock-Schoppenhauer, and Josef Ingenerf. 2021. Hands on the Medical Informatics Initiative Core Data Set—Lessons Learned from Converting the MIMIC-IV. In German Medical Data Sciences 2021: Digital Medicine: Recognize–Understand–Heal. IOS Press, 119–126.
[47]
B. Westra, S. Johnson, S. Ali, K. Bavuso, C. Cruz, S. Collins, M. Furukawa, M. Hook, A. Laflamme, and K. Lytle. 2018. Validation and Refinement of a Pain Information Model from EHR Flowsheet Data. Applied Clinical Informatics 09, 01 (2018), 185–198.
[48]
T. Wu, Y. Wang, Y. Wang, E. Zhao, Y. Yuan, and Z. Yang. 2019. Representation Learning of EHR Data via Graph-Based Medical Entity Embedding. (2019).
[49]
A Xm, A Ti, B Es, C Ska, D Kk, E Rk, and C Koa. 2021. EHR2CCAS: A framework for mapping EHR to disease knowledge presenting causal chain of disorders – chronic kidney disease example - ScienceDirect. Journal of Biomedical Informatics 115 (2021).
[50]
Yonghui Xu, Shengjie Sun, Huiguo Zhang, Chang’an Yi, Yuan Miao, Dong Yang, Xiaonan Meng, Yi Hu, Ke Wang, Huaqing Min, 2021. Time-aware graph embedding: A temporal smoothness and task-oriented approach. ACM Transactions on Knowledge Discovery from Data (TKDD) 16, 3(2021), 1–23.
[51]
Yonghui Xu, Han Yu, Yuguang Yan, Yang Liu, 2020. Multi-component transfer metric learning for handling unrelated source domain samples. Knowledge-Based Systems 203 (2020), 106132.
[52]
P. Yadav, M. Steinbach, V. Kumar, and G. Simon. 2017. Mining Electronic Health Records: A Survey. Acm Computing Surveys 50, 6 (2017).
[53]
Dapeng Yan, Hui Cao, Tao Wang, Ruilin Chen, and Shuangsi Xue. 2021. Graph-based Knowledge Acquisition with Convolutional Networks for Distribution Network Patrol Robots. IEEE Transactions on Artificial Intelligence (2021).
[54]
S. Yang, Y. Liu, Y. Xu, C. Miao, and J. Zhang. 2020. Contextualized Graph Attention Network for Recommendation with Item Knowledge Graph. (2020).
[55]
D. Zeng, K. Liu, S. Lai, G. Zhou, and J. Zhao. 2014. Relation classification via convolutional deep neural network. (2014).
[56]
D. Zhang and D. Wang. 2015. Relation Classification via Recurrent Neural Network. Computer Science (2015).
[57]
Peng Zhou, Wei Shi, Jun Tian, Zhenyu Qi, and Bo Xu. 2016. Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers).

Cited By

View all
  • (2023)Integrating domain knowledge for biomedical text analysis into deep learningJournal of Biomedical Informatics10.1016/j.jbi.2023.104418143:COnline publication date: 1-Jul-2023

Index Terms

  1. A Survey on Knowledge Enhanced EHR Data Mining
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Other conferences
          ICCSE '21: 5th International Conference on Crowd Science and Engineering
          October 2021
          182 pages
          ISBN:9781450395540
          DOI:10.1145/3503181
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 01 March 2022

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. Data Mining
          2. EHR
          3. Knowledge Enhancement
          4. Knowledge Graph

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Funding Sources

          • Fundamental Research Funds of Shandong University

          Conference

          ICCSE '21

          Acceptance Rates

          Overall Acceptance Rate 92 of 247 submissions, 37%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)40
          • Downloads (Last 6 weeks)1
          Reflects downloads up to 30 Aug 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2023)Integrating domain knowledge for biomedical text analysis into deep learningJournal of Biomedical Informatics10.1016/j.jbi.2023.104418143:COnline publication date: 1-Jul-2023

          View Options

          Get Access

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media