Authors
Wencheng Sun, Zhiping Cai, Yangyang Li, Fang Liu, Shengqun Fang, Guoyan Wang
Publication date
2018
Source
Journal of healthcare engineering
Volume
2018
Issue
1
Pages
4302425
Publisher
Hindawi
Description
Currently, medical institutes generally use EMR to record patient’s condition, including diagnostic information, procedures performed, and treatment results. EMR has been recognized as a valuable resource for large‐scale analysis. However, EMR has the characteristics of diversity, incompleteness, redundancy, and privacy, which make it difficult to carry out data mining and analysis directly. Therefore, it is necessary to preprocess the source data in order to improve data quality and improve the data mining results. Different types of data require different processing technologies. Most structured data commonly needs classic preprocessing technologies, including data cleansing, data integration, data transformation, and data reduction. For semistructured or unstructured data, such as medical text, containing more health information, it requires more complex and challenging processing methods. The task of …
Total citations
20182019202020212022202320249284164716230
Scholar articles
W Sun, Z Cai, Y Li, F Liu, S Fang, G Wang - Journal of healthcare engineering, 2018