Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Effect of a Process Mining based Pre-processing Step in Prediction of the Critical Health Outcomes

[Uncaptioned image] Negin Ashrafi
Department of Industrial and Systems Engineering
University of Southern California
Los Angeles, CA 90089
ashrafin@usc.edu
&[Uncaptioned image] Armin Abdollahi
Department of Electrical Engineering
University of Southern California
Los Angeles, CA 90089
arminabd@usc.edu
\AND[Uncaptioned image] Greg Placencia
Department of Industrial and Manufacturing Engineering
California State Polytechnic University-Pomona
Pomona, CA 91768
gvplacencia@cpp.edu
&[Uncaptioned image] Maryam Pishgar
Department of Industrial and Systems Engineering
University of Southern California
Los Angeles, CA 90089
pishgar@usc.edu
Abstract

Predicting critical health outcomes such as patient mortality and hospital readmission is essential for improving survivability. However, healthcare datasets have many concurrences that create complexities, leading to poor predictions. Consequently, pre-processing the data is crucial to improve its quality. In this study, we use an existing pre-processing algorithm, concatenation, to improve data quality by decreasing the complexity of datasets. Sixteen healthcare datasets were extracted from two databases - MIMIC III and University of Illinois Hospital - converted to the event logs, they were then fed into the concatenation algorithm. The pre-processed event logs were then fed to the Split Miner (SM) algorithm to produce a process model. Process model quality was evaluated before and after concatenation using the following metrics: fitness, precision, F-Measure, and complexity. The pre-processed event logs were also used as inputs to the Decay Replay Mining (DREAM) algorithm to predict critical outcomes. We compared predicted results before and after applying the concatenation algorithm using Area Under the Curve (AUC) and Confidence Intervals (CI). Results indicated that the concatenation algorithm improved the quality of the process models and predictions of the critical health outcomes.

Keywords Critical health outcomes  \cdot concatenation algorithm  \cdot deep learning  \cdot predictions  \cdot process mining

1 Introduction

Predicting critical health outcomes such as patient mortality and hospital readmission is essential for improving survivability. However, healthcare datasets have many concurrences that create complexities, leading to poor predictions. Consequently, pre-processing the data is crucial to improve its quality. In this study, we use an existing pre-processing algorithm, concatenation, to improve data quality by decreasing the complexity of datasets.

2 Methods

Sixteen healthcare datasets were extracted from two databases - MIMIC III and University of Illinois Hospital - converted to the event logs, they were then fed into the concatenation algorithm. The pre-processed event logs were then fed to the Split Miner (SM) algorithm to produce a process model. Process model quality was evaluated before and after concatenation using the following metrics: fitness, precision, F-Measure, and complexity. The pre-processed event logs were also used as inputs to the Decay Replay Mining (DREAM) algorithm to predict critical outcomes.

Refer to caption
Figure 1: An example figure.

3 Results

We compared predicted results before and after applying the concatenation algorithm using Area Under the Curve (AUC) and Confidence Intervals (CI). Results indicated that the concatenation algorithm improved the quality of the process models and predictions of the critical health outcomes.

Table 1: An example table
Metric Value
Fitness 0.95
Precision 0.85
S(τ)=F(τ)C(τ)M(τ)𝑆𝜏direct-sum𝐹𝜏𝐶𝜏𝑀𝜏S(\tau)=F(\tau)\oplus C(\tau)\oplus M(\tau)italic_S ( italic_τ ) = italic_F ( italic_τ ) ⊕ italic_C ( italic_τ ) ⊕ italic_M ( italic_τ ) (1)

4 Conclusion

The concatenation algorithm improved the quality of the process models and predictions of the critical health outcomes.

References

  • (1) A. Author, B. Author, and C. Author, "Title of the paper," Journal Name, vol. xx, no. xx, pp. xx-xx, Year.