Abstract
We propose a new method for building a classifier ensemble, based on subgroup discovery techniques in data mining. We apply subgroup discovery techniques to a labeled training dataset to discover interesting subsets, characterized by a conjuctive logical expression (rule), where such subset has an unusually high dominance of one class. Treating these rules as base classifiers, we propose several simple ensemble methods to construct a single classifier. Another novel aspect of the paper is that it applies these ensemble methods, along with standard anomaly detection and classification, to automatically identify high potential (HIPO) employees - an important problem in management. HIPO employees are critical for future-proofing the organization in the face of attrition, economic uncertainties and business challenges. Current HR processes for HIPO identification are manual and suffer from subjectivity, bias and disagreements. Proposed data-driven analytics algorithms address some of these issues. We show that the new ensemble methods perform better than other methods, including other ensemble methods on a real-life case-study dataset of a large multinational IT services company.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Hewitt Associates: Getting to high potential: how organizations define and calibrate their critical talent. Hewitt Associates (2008). http://www.hewitt.com
Atzmüller, M., Puppe, F.: SD-Map – a fast algorithm for exhaustive subgroup discovery. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) PKDD 2006. LNCS (LNAI), vol. 4213, pp. 6–17. Springer, Heidelberg (2006)
Atzmuller, M.: Knowledge intensive subgroup mining: techniques for automatic and interactive discovery. Aka Akademische Verlagsgsellschaft (2007)
Azzara, J.: Identifying High Potential Employees. PeopleTalentSolutions (2007)
Barnett, R.: Identifying High Potential Talent. MDA Leadership Consulting Inc. (2008)
Breiman, L.: Bagging predictors. Mach. Learn. 24(2), 123–140 (1996)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Buckingham, M., Vosburgh, R.M.: The 21st century human resource function: it’s the talent, stupid!. Hum. Resour. Plann. 24(4), 17–23 (2001)
Bueno, C.M., Tubbs, S.L.: Identifying global leadership competencies: an exploratory study. J. Am. Acad. Bus. 5(1–2), 80–87 (2004)
Corporate Leadership Council: Realizing the Full Potential of Rising Talent (Volume I): A Quantitative Analysis of the Identification and Development of High Potential Employees. Corporate Executive Board (2005)
Dries, N., Pepermans, R.: Using emotional intelligence to identify high potential: a metacompetency perspective. Leadersh. Organ. Dev. J. 28(8), 749–770 (2007)
Freund, Y., Schapire, R.E.: Experiments with a new boosting algorithm. In: Proceedings of the Thirteenth International Conference on Machine Learning (ICML), pp. 148–156 (1996)
Friedman, J., Fisher, I.: Bump hunting in high-dimensional data. Stat. Comput. 9, 123–143 (1999)
Gelens, J., Hofmans, J., Dries, N., Pepermans, R.: Talent management and organisational justice: employee reactions to high potential identification. Hum. Resour. Manag. J. 24(2), 159–175 (2014)
Gemberger, D., Lavrac, N.: Expert guided subgroup discovery: methodology and application. J. Artif. Intell. Res. 17, 501–527 (2002)
Jerusalim, R.S., Hausdorf, P.A.: Managers’ justice perceptions of high potential identification practices. J. Manag. Dev. 26(10), 933–950 (2007)
Kavsek, B., Lavrac, N., Jovanoski, V.: APRIORI-SD: adapting association rule learning to subgroup discovery. In: Berthold, M., Lenz, H.-J., Bradley, E., Kruse, R., Borgelt, C. (eds.) IDA 2003. LNCS, vol. 2810, pp. 230–241. Springer, Heidelberg (2003)
Klosgen, W.: Explora: a multipattern and multistrategy discovery assistant. In: Advances in Knowledge Discovery and Data Mining, pp. 249–271. MIT Press (1996)
Knorr, E.M., Ng, R.T., Tucakov, V.: Distance-based outliers: algorithms and applications. VLDB J. 8(3–4), 237–253 (2000)
Lavrac, N., Cestnik, B., Gemberger, D., Flach, P.: Subgroup discovery with CN2-SD. Mach. Learn. 57, 115–143 (2004)
Lavrac, N., Kavsek, B., Flach, P., Todorovski, L.: Subgroup discovery with CN2-SD. J. Mach. Learn. Res. 5, 153–188 (2004)
Lombardo, M.M., Eichinger, R.W.: Do rising stars avoid risk?: status-based labels and decision making. High Potentials High Learn. 39(4), 321–329 (2000)
Pepermans, R., Vloeberghs, D., Perkisas, B.: High potential identification policies: an empirical study among belgian companies. J. Manag. Dev. 22(8), 660–678 (2003)
Ramaswamy, S., Rastogi, R., Shim, K.: Efficient algorithms for mining outliers from large datasets. In: Proceedings of SIGMOD 2000, pp. 162–172 (2000)
Rogers, R.W., Smith, A.B.: Finding future perfect senior leaders: spotting executive potential. In: Development Dimensions International (2007)
Scheffer, T., Wrobel, S.: Finding the most interesting patterns in a database quickly by using sequential sampling. J. Mach. Learn. Res. 3, 833–862 (2002)
Scholtz, M.: Sampling based sequential subgroup mining. In: Proceedings of the 11th SIG KDD, pp. 265–274 (2005)
Spreitzer, G.M., McCall, M.W., Mahoney, J.D.: Early identification of international executive potential. J. Appl. Psychol. 82(1), 6–29 (1997)
Wells, S.J.: Who’s next: creating a formal program for developing new leaders can pay huge dividends, but many firms aren’t reaping those rewards. HR Mag. 48(11), 44–64 (2003)
Wrobel, S.: An algorithm for multi-relational discovery of subgroups. In: Komorowski, J., Żytkow, J.M. (eds.) PKDD 1997. LNCS, vol. 1263, pp. 78–87. Springer, Heidelberg (1997)
Zhi-Hua, Z.: Ensemble Methods: Foundations and Algorithms. Chapman and Hall/CRC, Boca Raton (2012)
Acknowledegments
The authors thank Dr. Ritu Anand, Preeti Gulati for their support and our team members for much help.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Palshikar, G.K., Sahu, K., Srivastava, R. (2016). Ensembles of Interesting Subgroups for Discovering High Potential Employees. In: Bailey, J., Khan, L., Washio, T., Dobbie, G., Huang, J., Wang, R. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2016. Lecture Notes in Computer Science(), vol 9652. Springer, Cham. https://doi.org/10.1007/978-3-319-31750-2_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-31750-2_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-31749-6
Online ISBN: 978-3-319-31750-2
eBook Packages: Computer ScienceComputer Science (R0)