Abstract
The fast and accurate estimation of makespan is essential for the determination of the delivery date and the sustainable development of the enterprise. In this paper, a high-quality training dataset is constructed and an adaptive ensemble model is proposed to achieve fast and accurate makespan estimation. First, both the logistics features extracted by the Pearson correlation coefficient and the new meaningful nonlinear combination features dug out by gene expression programming are first involved in this paper for constructing a high-quality dataset. Secondly, an improved clustering with elbow criterion and a resampling operation are applied simultaneously to generate representative subsets; and correspondingly, several back propagation neural network (BPNN) with the architecture optimized by genetic algorithm are trained by these subsets respectively to generate effective diverse learners; and then, a K-nearest neighbor based dynamic weight combination strategy which is sensitive to current testing sample is proposed to make full use of the learner’s positive effects and avoid its negative effects. Finally, the results of effective experiments prove that both the newly involved features and the improvements in the proposed ensemble are effective. In addition, comparison experiments confirm that the proposed enhanced ensemble of BPNNs outperforms significantly the prevailing approaches, including single, ensemble and hybrid models. And hence, the proposed model can be utilized as a convenient and reliable tool to support customer order acceptance.
Similar content being viewed by others
References
Ahmadizar, F., & Shahmaleki, P. (2014). Group-shop scheduling with sequence-dependent set-up and transportation times. Applied Mathematical Modelling, 38(21–22), 5080–5091. https://doi.org/10.1016/j.apm.2014.03.035.
Chang, J., Kong, X., & Yin, L. (2015). A novel approach for product makespan prediction in production life cycle. The International Journal of Advanced Manufacturing Technology, 80(5–8), 1433–1448. https://doi.org/10.1007/s00170-015-7093-3.
Chen, K., Jiang, J., Zheng, F., & Chen, K. (2018). A novel data-driven approach for residential electricity consumption prediction based on ensemble learning. Energy, 150, 49–60. https://doi.org/10.1016/j.energy.2018.02.028.
Dai, M., Tang, D., Giret, A., & Salido, M. A. (2019). Multi-objective optimization for energy-efficient flexible job shop scheduling problem with transportation constraints. Robotics and Computer-Integrated Manufacturing, 59, 143–157. https://doi.org/10.1016/j.rcim.2019.04.006.
De Jong, A. W., Rubrico, J. I. U., Adachi, M., Nakamura, T., & Ota, J. (2019). A generalised makespan estimation for shop scheduling problems, using visual data and a convolutional neural network. International Journal of Computer Integrated Manufacturing, 32(6), 559–568. https://doi.org/10.1080/0951192x.2019.1599430.
Duan, C., Deng, C., Gharaei, A., Wu, J., & Wang, B. (2018). Selective maintenance scheduling under stochastic maintenance quality with multiple maintenance actions. International Journal of Production Research, 56(23), 7160–7178. https://doi.org/10.1080/00207543.2018.1436789.
Fransoo, J. C., de Kok, T. G., & Paulli, J. (1995). Makespan estimations in flexible manufacturing systems. Working Papers, Department of Mathematical Sciences, University of Aarhus.
Giri, B. C., & Bardhan, S. (2014). Coordinating a supply chain with backup supplier through buyback contract under supply disruption and uncertain demand. International Journal of Systems Science: Operations & Logistics, 1(4), 193–204. https://doi.org/10.1080/23302674.2014.951714.
Gong, H., Sun, Y., Shu, X., & Huang, B. (2018). Use of random forests regression for predicting IRI of asphalt pavements. Construction and Building Materials, 189, 890–897. https://doi.org/10.1016/j.conbuildmat.2018.09.017.
Han, Q., Liu, J., Shen, Z., Liu, J., & Gong, F. (2020). Vector partitioning quantization utilizing K-means clustering for physical layer secret key generation. Information Sciences, 512, 137–160. https://doi.org/10.1016/j.ins.2019.09.076.
Javed, K., Gouriveau, R., Li, X., & Zerhouni, N. (2016). Tool wear monitoring and prognostics challenges: A comparison of connectionist methods toward an adaptive ensemble model. Journal of Intelligent Manufacturing, 29(8), 1873–1890. https://doi.org/10.1007/s10845-016-1221-2.
Karimi, S., Ardalan, Z., Naderi, B., & Mohammadi, M. (2017). Scheduling flexible job-shops with transportation times: Mathematical models and a hybrid imperialist competitive algorithm. Applied Mathematical Modelling, 41, 667–682. https://doi.org/10.1016/j.apm.2016.09.022.
Karthiga, R., & Mangai, S. (2019). Feature selection using multi-objective modified genetic algorithm in multimodal biometric system. Journal of Medical Systems, 43(7), 214. https://doi.org/10.1007/s10916-019-1351-0.
Kei Fong, M. W., Ng, E. Y. K., Er Zi Jian, K., & Hong, T. J. (2019). SVR ensemble-based continuous blood pressure prediction using multi-channel photoplethysmogram. Computers in Biology and Medicine. https://doi.org/10.1016/j.compbiomed.2019.103392.
Kuo, C.-F. J., Hsu, C.-T. M., Liu, Z.-X., & Wu, H.-C. (2014). Automatic inspection system of LED chip using two-stages back-propagation neural network. Journal of Intelligent Manufacturing, 25(6), 1235–1243. https://doi.org/10.1007/s10845-012-0725-7.
Li, K., Chen, Y., Li, W., He, J., & Xue, Y. (2018). Improved gene expression programming to solve the inverse problem for ordinary differential equations. Swarm and Evolutionary Computation, 38, 231–239. https://doi.org/10.1016/j.swevo.2017.07.005.
Li, S., Li, Y., Liu, Y., & Xu, Y. (2007). A GA-based NN approach for makespan estimation. Applied Mathematics and Computation, 185(2), 1003–1014. https://doi.org/10.1016/j.amc.2006.07.024.
Lin, J., Chen, H., Li, S., Liu, Y., Li, X., & Yu, B. (2019). Accurate prediction of potential druggable proteins based on genetic algorithm and Bagging-SVM ensemble classifier. Artificial Intelligence in Medicine, 98, 35–47. https://doi.org/10.1016/j.artmed.2019.07.005.
Manzini, M., & Urgo, M. (2015). Makespan estimation of a production process affected by uncertainty: Application on MTO production of NC machine tools. Journal of Manufacturing Systems, 37, 1–16. https://doi.org/10.1016/j.jmsy.2015.10.001.
Mendes-Moreira, J., Soares, C., Jorge, A. M., & Sousa, J. F. D. (2012). Ensemble approaches for regression: A survey. ACM Computing Surveys, 45(1), 1–40. https://doi.org/10.1145/2379776.2379786.
Mirshekarian, S., & Šormaz, D. N. (2016). Correlation of job-shop scheduling problem features with scheduling efficiency. Expert Systems with Applications, 62, 131–147. https://doi.org/10.1016/j.eswa.2016.06.014.
Nie, L., Gao, L., Li, P., & Li, X. (2012). A GEP-based reactive scheduling policies constructing approach for dynamic flexible job shop scheduling problem with job release dates. Journal of Intelligent Manufacturing, 24(4), 763–774. https://doi.org/10.1007/s10845-012-0626-9.
Raaymakers, W. H. M., & Weijters, A. J. M. M. (2003). Makespan estimation in batch process industries: A comparison between regression analysis and neural networks. European Journal of Operational Research, 145, 14–30.
Rayal, R., Khanna, D., Sandhu, J. K., Hooda, N., & Rana, P. S. (2017). N-semble: neural network based ensemble approach. International Journal of Machine Learning and Cybernetics, 10(2), 337–345. https://doi.org/10.1007/s13042-017-0718-0.
Salo, F., Nassif, A. B., & Essex, A. (2019). Dimensionality reduction with IG-PCA and ensemble classifier for network intrusion detection. Computer Networks, 148, 164–175. https://doi.org/10.1016/j.comnet.2018.11.010.
Sayyadi, R., & Awasthi, A. (2016). A simulation-based optimisation approach for identifying key determinants for sustainable transportation planning. International Journal of Systems Science: Operations & Logistics, 5(2), 161–174. https://doi.org/10.1080/23302674.2016.1244301.
Sayyadi, R., & Awasthi, A. (2018). An integrated approach based on system dynamics and ANP for evaluating sustainable transportation policies. International Journal of Systems Science: Operations & Logistics. https://doi.org/10.1080/23302674.2018.1554168.
Shafaei, R., Rabiee, M., & Mirzaeyan, M. (2011). An adaptive neuro fuzzy inference system for makespan estimation in multiprocessor no-wait two stage flow shop. International Journal of Computer Integrated Manufacturing, 24(10), 888–899. https://doi.org/10.1080/0951192x.2011.597430.
Shah, N. H., Chaudhari, U., & Cárdenas-Barrón, L. E. (2018). Integrating credit and replenishment policies for deteriorating items under quadratic demand in a three echelon supply chain. International Journal of Systems Science: Operations & Logistics. https://doi.org/10.1080/23302674.2018.1487606.
Tasnim, S., Rahman, A., Oo, A. M. T., & Haque, M. E. (2017). Wind power prediction using cluster based ensemble regression. International Journal of Computational Intelligence and Applications, 16(4), 1700. https://doi.org/10.1142/s1469026817500262.
Wang, J., Yang, J., Zhang, J., Wang, X., & Zhang, W. (2018a). Big data driven cycle time parallel prediction for production planning in wafer manufacturing. Enterprise Information Systems, 12(6), 714–732. https://doi.org/10.1080/17517575.2018.1450998.
Wang, J., & Zhang, J. (2016). Big data analytics for forecasting cycle time in semiconductor wafer fabrication system. International Journal of Production Research, 54(23), 7231–7244. https://doi.org/10.1080/00207543.2016.1174789.
Wang, J., Zhang, J., & Wang, X. (2018b). A data driven cycle time prediction with feature selection in a semiconductor wafer fabrication system. IEEE Transactions on Semiconductor Manufacturing, 31(1), 173–182. https://doi.org/10.1109/tsm.2017.2788501.
Wenny, H. M., Raaymakers, J., Bertrand, J. M. W., & Fransoo, J. C. (2001). Makespan estimation in batch process industries using aggregate resource and job set characteristics. International Journal of Production Economics, 70, 145–161.
Wu, C., Chen, T., Jiang, R., Ning, L., & Jiang, Z. (2015). A novel approach to wavelet selection and tree kernel construction for diagnosis of rolling element bearing fault. Journal of Intelligent Manufacturing, 28(8), 1847–1858. https://doi.org/10.1007/s10845-015-1070-4.
Xiao, J. (2019). SVM and KNN ensemble learning for traffic incident detection. Physica A: Statistical Mechanics and Its Applications, 517, 29–35. https://doi.org/10.1016/j.physa.2018.10.060.
Yang, J., Zeng, X., Zhong, S., & Wu, S. (2013). Effective neural network ensemble approach for improving generalization performance. IEEE Transactions on Neural Networks and Learning Systems, 24(6), 878–887. https://doi.org/10.1109/TNNLS.2013.2246578.
Yang, W.-A., Zhou, W., Liao, W., & Guo, Y. (2014). Prediction of drill flank wear using ensemble of co-evolutionary particle swarm optimization based-selective neural network ensembles. Journal of Intelligent Manufacturing, 27(2), 343–361. https://doi.org/10.1007/s10845-013-0867-2.
Yu, A. J., & Seif, J. (2016). Minimizing tardiness and maintenance costs in flow shop scheduling by a lower-bound-based GA. Computers & Industrial Engineering, 97, 26–40. https://doi.org/10.1016/j.cie.2016.03.024.
Yu-Quan, Z., Ji-Shun, O., Geng, C., & Hai-Ping, Y. (2010). Dynamic weighting ensemble classifiers based on cross-validation. Neural Computing and Applications, 20(3), 309–317. https://doi.org/10.1007/s00521-010-0372-x.
Acknowledgements
The authors would like to thank the anonymous reviewers for their helpful comments and constructive suggestions. This work is supported by the National Natural Science Foundation of China (Nos. 51875421, 51875420).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Cheng, L., Tang, Q., Zhang, Z. et al. Data mining for fast and accurate makespan estimation in machining workshops. J Intell Manuf 32, 483–500 (2021). https://doi.org/10.1007/s10845-020-01585-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10845-020-01585-y