Abstract
Classification in an imbalanced dataset is a current challenge in machine learning communities, as the class-imbalanced problem deteriorates the performance of numerous classifiers. This study introduces a two-stage intelligent data preprocessing approach to tackle the class-imbalanced problem. By modifying the penalty parameter of the support vector machine (SVM), the discriminating boundary will move toward the majority class and in turn misclassify the majority class examples as minority class examples. That is, more misclassifications for the majority class examples are equivalent to a greater number of minority class examples. Executing the SVM as a preprocessor can be used to overcome the class imbalanced problem. Sequentially, the modified dataset undergoes the random forest to defy the curse of dimensionality. Finally, the preprocessed data are fed into a rule-based classifier to generate comprehensive decision rules. According to the empirical results, the presented architecture is a promising alternative for the class-imbalanced problem.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Ashfaq RAR, Wang XZ, Huang JZ, Abbas H, He YL (2016) Fuzziness based semi-supervised learning approach for intrusion detection system. Inf Sci. doi:10.1016/j.ins.2016.04.019
Bang S, Kang J, Jhun M, Kim E (2016) Hierarchically penalized support vector machine with grouped variables. Int J Mach Learn Cyber. doi:10.1007/s13042-016-0494-2
Barakat N, Diederich J (2005) Eclectic rule-extraction from support vector machines. Int J Comput Intel 2:59–62
Bazzazi AA, Osanloo M, Karimi B (2011) Deriving preference order of open pit mines equipment through MADM methods application of modified VIKOR method. Expert Syst Appl 38:2550–2556
Borkar P, Sarode MV, Malik LG (2016) Modality of adaptive neuro-fuzzy classifier for acoustic signal-based traffic density state estimation employing linguistic hedges for feature selection. Int J Fuzzy Syst 18:379–394
Breiman L (2001) Random forests. Mach Learn 45:5–32
Chang CW, Wu CR, Lin CT, Chen HC (2007) An application of AHP and sensitivity analysis for selecting the best slicing machine. Comput Ind Eng 52:296–307
Chen X, Fan K, Liu W, Zhang X, Xue M (2015) Discriminative structure discovery via dimensionality reduction for facial image manifold. Neural Comput Appl 26:373–381
Das SP, Padhy S (2015) A novel hybrid model using teaching–learning-based optimization and a support vector machine for commodity futures index forecasting. Int J Mach Learn Cyber. doi:10.1007/s13042-015-0359-0
Eichberger J, Guerdjikova A (2010) Case-based belief formation under ambiguity. Math Soc Sci 60:161–177
Farquad MAH, Bose I (2012) Preprocessing unbalanced data using support vector machine. Decis Support Syst 53:226–233
Feng L, Li T, Ruan D, Gou S (2011) A vague-rough set approach for uncertain knowledge acquisition. Knowl-Based Syst 24:837–843
Feng HM, Wang XZ (2015) Performance improvement of classifier fusion for batch samples based on upper integral. Neural Netw 63:87–93
Friedman M (1974) Explanation and scientific understanding. J Philos 71:5–19
Gaganis C (2009) Classification techniques for the identification of falsified financial statements a comparative analysis. Intel Syst Account Financ Manag 16:207–229
García S, Fernández A, Herrera F (2009) Enhancing the effectiveness and interpretability of decision tree and rule induction classifiers with evolutionary training set selection over imbalanced problems. Appl Soft Comput 9:304–1314
Gao M, Hong X, Chen S, Harris CJ (2011) A combined SMOTE and PSO based RBF classifier for two-class imbalanced problems. Neurocomputing 74:3456–3466
Gao X, Fan L, Xu H (2015) Multiple rank multi-linear kernel support vector machine for matrix data classification. Int J Mach Learn Cyber. doi:10.1007/s13042-015-0383-0
Gallant SI (1998) Connectionist expert systems. Commun ACM 31:152–169
Genuer R, Poggi JM, Tuleau-Malot C (2010) Variable selection using random forests. Pattern Recogn Lett 31:2225–2236
Gonzalez-Abril L, Cuberos FJ, Velasco F, Ortega JA (2009) Ameva an autonomous discretization algorithm. Expert Syst Appl 36:5327–5332
Goode S, Lacey D (2011) Detecting complex account fraud in the enterprise the role of technical and non-technical controls. Decis Support Syst 50:702–714
Grzymala-Busse JW, Stefanowski J, Wilk S (2005) A comparison of two approaches to data mining from imbalanced data. J Intell Manuf 16:565–573
He Y, Liu NK, Hu Y, Wang X (2015) OWA operator based link prediction ensemble for social network. Expert Syst Appl 42:21–50
He YL, Wang XZ, Huang JZ (2016) Fuzzy nonlinear regression analysis using a random weight network. Inf Sci 364–365:222–240
Kang X, Miao D (2016) A variable precision rough set model based on the granularity of tolerance relation. Knowl Based Syst 102:103–115
Kim HS, Sohn SY (2010) Support vector machines for default prediction of SMEs based on technology credit. Eur J Oper Res 201:838–846
Kwak N, Choi CH (2002) Input feature selection for classification problems. IEEE Trans Neural Netw 13:143–159
Liu Y, Yu X, Huang JX, An A (2011) Combining integrated sampling with SVM ensembles for learning from imbalanced datasets. Inf Process Manag 47:617–631
Ling CX, Sheng VS, Yang Q (2006) Test strategies for cost-sensitive decision tree. IEEE Trans Knowl Data Eng 18:1055–1067
Lin SJ (2016) Hybrid kernelized fuzzy clustering and multiple attributes decision analysis for corporate risk management. Int J Fuzzy Syst. doi:10.1007/s40815-016-0196-7
Lin SJ, Hsu MF (2016) Incorporated risk metrics and hybrid AI techniques for risk management. Neural Comput Appl. doi:10.1007/s00521-016-2253-4
Lin SJ, Chen TF (2016) Multi-agent architecture for corporate operating performance assessment. Neural Process Lett 43:115–132
Liu NK, He YL, Lim HY, Wang XZ (2014) Domain ontology graph model and its application in Chinese text classification. Neural Comput Appl 24:779–798
Mirza B, Lin Z, Toh KA (2013) Weighted online sequential extreme learning machine for class imbalance learning. Neural Process Lett 38:465–486
Nebot V, Berlanga R (2012) Finding association rules in semantic web data. Knowl Based Syst 25:51–62
Opricovic S (1998) Multicriteria optimization of civil engineering systems. Faculty of Civil Engineering, Belgra
Opricovic S, Tzeng GH (2002) Multicriteria planning of post-earthquake sustainable reconstruction. Comput Aided Civil Inf 17:211–220
Opricovic S, Tzeng GH (2004) Compromise solution by MCDM methods a comparative analysis of VIKOR and TOPSIS. Eur J Oper Res 156:445–455
Orriols-Puig A, Bernadó-Mansilla E (2009) Evolutionary rule-based systems for imbalanced data sets. Soft Comput 13:213–225
Paelinck JHP (1976) Qualitative multiple criteria analysis, environment protection and multiregional development. Region Sci Assoc 36:56–59
Pawlak Z (1982) Rough sets. Int J Comput Inf Sci 11:341–356
Peng Y, Wang G, Kou G, Shi Y (2011) An empirical study of classification algorithm evaluation for financial risk prediction. Appl Soft Comput 11:2906–2915
Rokach L (2010) Ensemble-based classifiers. Artif Intell Rev 33:1–39
Sestito S, Dillon T (1992) Automated knowledge acquisition of rules with continuously valued attributes. In Proceedings of 12th international conference on expert systems and their applications (AVIGNON’92), Avignon –France, pp 645–656
Sun A, Lim EP, Liu Y (2009) On strategies for imbalanced text classification using SVM a comparative study. Decis Support Syst 48:191–201
Tan A, Wu W, Li J, Lin G (2016) Evidence-theory-based numerical characterization of multigranulation rough sets in incomplete information systems. Fuzzy Set Syst 294:18–35
Tavana M, Mavi RK, Santos-Arteaga FJ, Doust ER (2016) An extended VIKOR method using stochastic data and subjective judgments. Comput Ind Eng 97:240–247
Wang XZ, Ashfaq RAR, Fu AM (2015) Fuzziness based sample categorization for classifier performance improvement. J Intell Fuzzy Syst 29:1185–1196
Wang XZ (2015) Learning from big data with uncertainty. J Intell Fuzzy Syst 28:2329–2330
Wang L, Chen J, Fan M, Zhao X, Cui H, Cui H (2011) Feature selection and prediction of sub-health state using random forest. Energy Proc 13:5223–5228
Wang G, Ma J, Huang L, Xu K (2012) Two credit scoring models based on dual strategy ensemble trees. Knowl Based Syst 26:61–68
Wang Y (2013) Smooth nonparametric copula estimation with least squares support vector regression. Neural Process Lett 38:81–96
Wu S, Sun M, Yang J (2011) Stochastic neighbor projection on manifold for feature extraction. Neurocomputing 74:2780–2789
Zhao HX, Xing HJ, Wang XZ (2011) Two-stage dimensionality reduction approach based on 2DLDA and fuzzy rough sets technique. Neurocomputing 74:3722–3727
Acknowledgments
The author would like to thanks Ministry of Science and Technology of the Republic of China, Taiwan for financially supporting this work under Contract No. 104-2410-H-034 -023 -MY2.
Author information
Authors and Affiliations
Corresponding author
Appendix A VlseKriterijumska Optimizacija I Kompromisno Resenje (VIKOR)
Appendix A VlseKriterijumska Optimizacija I Kompromisno Resenje (VIKOR)
VIKOR was proposed by Opricovic [37] and Opricovic and Tzeng [38] for multi-criteria optimization of complicated problems. Opricovic [37] indicated that the VIKOR ranked alternatives in the occurrence of conflicting criteria by generating the multi-criteria ranking index, which was ground on the specific evaluation of closeness to the ideal alternative. The VIKOR was expressed as follow [39].
Step 1 Calculate the best g * i and the worst g - i values of whole criterion functions, i = 1, … , n.
where the number of alternatives denotes as J, the number of criteria is expressed as n and the rating of i-th criterion function for alternative b j .
Step 2 Calculate the values of X j and Y j , j = 1, …, J.
where the weight of i-th criteria is expressed as w i , the ranking evaluation are measured by X j and Y j .
Step 3 Calculate the value Z j , j = 1, … , J.
where X *is the solution with the maximum group utility, Y *is the solution with a minimum single regret of the opponent, and the weight of the strategy of the majority of criteria is represented inv. This compromise solution is stable within a decision making process, which could be “voting by majority rule” (when v > 0.5 is need), or “by consensus” v ≈ 0.5 or “with veto” v < 0.5 [39]. Followed by the prior researches [39, 43], the value of v is set to 0.5.
Step 4 Ranking the alternatives in decreasing order. There are three ranking lists X, Y and Z.
Step 5 Generate the alternative b′, which was measured by Z and ranked the best, as a compromise solution if the following two conditions are satisfied [43]
-
(a)
Z(b″)–Z(b′) ≥ 1 – (J − 1)
-
(b)
Alternative b′ is ranked the best by X and/or Y.
If only the condition (b) is violated, the alternatives b′ and b″ are taken as compromise solutions, where b″ was measured by Z was ranked the second. If the condition (a) is violated, alternatives b′, … , b M were viewed as compromise solution, where b M was evaluated by Z was ranked the M-th and was according to the relation Z(b M) - Z(b′) < 1(J − 1)for maximum M.
Rights and permissions
About this article
Cite this article
Lin, SJ. Integrated artificial intelligence-based resizing strategy and multiple criteria decision making technique to form a management decision in an imbalanced environment. Int. J. Mach. Learn. & Cyber. 8, 1981–1992 (2017). https://doi.org/10.1007/s13042-016-0574-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-016-0574-3