An improved Random Forest based on Feature Selection and Feature weighting for case retrieval in CBR system Application to medical data
: The medical diagnostic process works very similarly to the Case Based Reasoning (CBR) cycle scheme. CBR is a problem solving approach based on the reuse of past experiences called cases. To improve the performance of the retrieval phase, a Random Forest (RF) model is proposed, in this respect we used this algorithm in three different ways (three different algorithms): Classic Random Forest (CRF) algorithm, Random Forest with Feature Selection (RF_FS) algorithm where we selected the most important attributes and deleted the less important ones and Weighted Random Forest (WRF) algorithm where we weighted the most important attributes by giving them more weight. We did this by multiplying the entropy with the weight corresponding to each attribute.We tested our three algorithms CRF, RF_FS and WRF with CBR on data from 11 medical databases and compared the results they produced. We found that WRF and RF_FS give better results than CRF. The experiemental results show the performance and robustess of the proposed approach.