Abstract
The ability to forecast job advertisement demands is vital to enhance the customer retention rate for recruitment companies. On top of that, it is uneconomical to cold call every individual on a regular basis for companies with a large pool of customers. This paper presents a novel approach in predicting the re-ordering demand of a potential group of SMEs customers in a large online recruitment company. Two feature selection techniques, namely Correlation-based Feature Selection (CFS) and Subset Consistency (SC) Feature Selection, were applied to predictive models in this study. The predictive models were compared with other similar models in the absence of feature selections. Results of various experiments show that those models using feature selections generally outperform those without feature selections. The results support the authors’ hypothesis that the predictive model can perform better and further ahead than similar methods that exclude feature selection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Kim, Y.S., Street, W.N.: An Intelligent System for Customer Targeting: A Data Mining Approach. Decision Support Systems 37, 215–228 (2004)
Bella, A., Ferri, C., Hernández-Orallo, J., RamĂrez-Quintana, M.J.: Joint Cutoff Probabilistic Estimation Using Simulation: A Mailing Campaign Application. In: Yin, H., Tino, P., Corchado, E., Byrne, W., Yao, X. (eds.) IDEAL 2007. LNCS, vol. 4881, pp. 609–619. Springer, Heidelberg (2007)
Cox, L.A.: Data Mining and Causal Modeling of Customer Behaviors. Telecommunication Systems 21, 349–381 (2002)
Hung, S.Y., Yen, C., Wang, H.Y.: Applying Data Mining to Telecom Churn Management. Expert Systems with Applications 31, 515–524 (2006)
Au, W.H., Chan, K.C.C.: Mining Fuzzy Association Rules in A Bank-Account Database. IEEE Transactions on Fuzzy Systems 11, 238–248 (2003)
Kumar, D.A., Ravi, V.: Predicting Credit Card Customer Churn in Banks Using Data Mining. Int. J. Data Anal. Tech. Strateg. 1, 4–28 (2008)
Ling, C.X., Li, C.: Data Mining for Direct Marketing: Problems and Solutions. In: Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pp. 73–79 (1998)
Blum, A.L., Langley, P.: Selection of Relevant Features and Examples in Machine Learning. Artificial Intelligence 97, 245–271 (1997)
Kohavi, R., John, G.H.: Wrappers for Feature Subset Selection. Artificial Intelligence 97, 273–324 (1997)
Jain, A.K., Duin, R.P.W., Mao, J.: Statistical Pattern Recognition: A Review. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 4–37 (2000)
Guyon, I., Elisseeff, A.: An Introduction to Variable and Feature Selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)
Elashoff, J.D., Elashoff, R.M., Goldman, G.E.: On the Choice of Variables in Classification Problems with Dichotomous Variables. Biometrika 54, 668–670 (1967)
Toussaint, G.: Note on Optimal Selection of Independent Binary-Valued Features for Pattern Recognition (Corresp.). IEEE Transactions on Information Theory 17, 618 (1971)
Tirelli, T., Pessani, D.: Importance of Feature Selection in Decision-Tree and Artificial-Neural-Network Ecological Applications. Alburnus Alburnus Alborella: A Practical Example. Ecological Informatics (in press, corrected proof, Available online November 30, 2010)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
Holte, R.C.: Very Simple Classification Rules Perform Well on Most Commonly Used Datasets. Machine Learning 11, 63–91 (1993)
Liu, H., Motoda, H.: Computational Methods of Feature Selection. Chapman & Hall/CRC, Boca Raton (2007)
Chen, K., Liu, H.: Towards An Evolutionary Algorithm: Comparison of Two Feature Selection Algorithms. In: Proceedings of the Congress on Evolutionary Computation, vol. 2, pp. 1309–1313 (1999)
Liu, H., Setiono, R.: A Probabilistic Approach to Feature Selection - A Filter Solution. In: Proceedings of the Thirteenth International Conference on Machine Learning, pp. 319–327 (1996)
Hall, M.: Correlation-Based Feature Selection for Discrete and Numeric Class Machine Learning. In: Langley, P. (ed.) Proceedings of the Seventeenth International Conference on Machine Learning, pp. 359–366 (2000)
Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C., Wirth, R.: CRISP-DM 1.0: Step-by-Step Data Mining Guide. CRISP-DM consortium (2000)
Ngai, E.W.T., Xiu, L., Chau, D.C.K.: Application of Data Mining Techniques in Customer Relationship Management: A Literature Review and Classification. Expert Systems with Applications 36, 2592–2602 (2009)
Provost, F., Fawcett, T.: Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Distributions. In: Heckerman, D., Mannila, H., Pregibon, D., Uthurusamy, R. (eds.) Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, pp. 43–48. AAAI Press, California (1997)
Huang, J., Ling, C.X.: Using AUC and Accuracy in Evaluating Learning Algorithms. IEEE Transactions on Knowledge and Data Engineering 17, 299–310 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tan, DW., Sim, YW., Yeoh, W. (2011). Applying Feature Selection Methods to Improve the Predictive Model of a Direct Marketing Problem. In: Mohamad Zain, J., Wan Mohd, W.M.b., El-Qawasmeh, E. (eds) Software Engineering and Computer Systems. ICSECS 2011. Communications in Computer and Information Science, vol 179. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22170-5_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-22170-5_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22169-9
Online ISBN: 978-3-642-22170-5
eBook Packages: Computer ScienceComputer Science (R0)