Abstract
The pervasive adoption of Artificial Intelligence (AI) and Machine Learning (ML) applications has exponentially increased the demand for efficient resource allocation, workload scheduling, and parallel computing capabilities in cloud environments. This research addresses the critical need for enhancing both the scalability and security of AI/ML workloads in cloud computing settings. The study emphasizes the optimization of resource allocation strategies to accommodate the diverse requirements of AI/ML workloads. Efficient resource allocation ensures that computational resources are utilized judiciously, avoiding bottlenecks and latency issues that could hinder the performance of AI/ML applications. The research explores advanced parallel computing techniques to harness the full possible cloud infrastructure, enhancing the speed and efficiency of AI/ML computations. The integration of robust security measures is crucial to safeguard sensitive data and models processed in the cloud. The research delves into secure multi-party computation and encryption techniques like the Hybrid Heft Pso Ga algorithm, Heuristic Function for Adaptive Batch Stream Scheduling Module (ABSS) and allocation of resources parallel computing and Kuhn–Munkres algorithm tailored for AI/ML workloads, ensuring confidentiality and integrity throughout the computation lifecycle. To validate the proposed methodologies, the research employs extensive simulations and real-world experiments. The proposed ABSS_SSMM method achieves the highest accuracy and throughput values of 98% and 94%, respectively. The contributions of this research extend to the broader cloud computing and AI/ML communities. By providing scalable and secure solutions, the study aims to empower cloud service providers, enterprises, and researchers to leverage AI/ML technologies with confidence. The findings are anticipated to inform the design and implementation of next-generation cloud platforms that seamlessly support the evolving landscape of AI/ML applications, fostering innovation and driving the adoption of intelligent technologies in diverse domains.
Similar content being viewed by others
Data availability
No datasets were generated or analysed during the current study.
References
Adil, M., Nabi, S., Aleem, M., Diaz, V.G., Lin, J.C.W.: CA-MLBS: content-aware machine learning-based load balancing scheduler in the cloud environment. Expert. Syst. 40(4), 13150 (2023)
Lu, Y. Bian, S. Chen, L. He, Y. Hui, Y. Lentz, M. Li, B. Liu, F. Li, J. Liu, Q., Liu, R.: Computing in the era of large generative models: from cloud-native to AI-native (2024). arXiv preprint arXiv:2401.12230
Mart, J., Oyetoro, A., Amah, U.: Best practices for running workloads in public cloud environments. Science Open Preprints (2023).
Khan, M.M.I., Nencioni, G.: Resource allocation in networking and computing systems: a security and dependability perspective. IEEE Access 11, 89433 (2023)
Hoefler, T. Copik, M. Beckman, P. Jones, A. Foster, I. Parashar, M. Reed, D. Troyer, M. Schulthess, T. Ernst, D., Dongarra, J.: XaaS: acceleration as a service to enable productive high-performance cloud computing (2024). arXiv preprint arXiv:2401.04552
Ali, S.A.: Desigining secure and robust e-commerce platform for public cloud. Asian Bull. Big Data Manag. 3(1), 164 (2023)
Mangalampalli, S., Karri, G.R., Mohanty, S.N., Ali, S., Khan, M.I., Abdullaev, S., Alqahtani, S.A.: Multi-objective Prioritized Task Scheduler using improved Asynchronous advantage actor critic (a3c) algorithm in multi cloud environment. IEEE Access 91, 407 (2024)
Alqahtani, A.: Multi-objective Prioritized Task Scheduler using improved Asynchronous advantage actor critic (a3c) algorithm in multi cloud environment (2024)
Cai, Q., Xiao, G., Lin, S., Yang, W., Li, K., Li, K.: ABSS: an adaptive batch-stream scheduling module for dynamic task parallelism on chiplet-based multi-chip systems. ACM Trans. Parallel Comput. 11, 1 (2024)
Wubben, J.: Distributed management and coordination of UAV swarms based on infrastructure less wireless networks (Doctoral dissertation, Universitesi Polytechnic de Valencia) (2023)
Pati, C.: Search using Grover’s Algorithm (2023)
Njeri, N.: Quantum computing algorithms for solving complex optimization problems. J. Adv. Technol. Syst. 1(1), 24–34 (2023)
Bhoumick, D. Mitra, D. Chowdhury, D.R., Nath, A.: A comprehensive study on implementation of grover’s search algorithm on quantum processors. Int. J. 11(5) (2023)
Tuli, S., Mirhakimi, F., Pallewatta, S., Zawad, S., Casale, G., Javadi, B., Yan, F., Buyya, R., Jennings, N.R.: AI augmented edge and fog computing: trends and challenges. J. Netw. Comput. Appl. 216, 103648 (2023)
GUPTA, A.: Transforming organizations through cloud computing (Doctoral dissertation) (2023)
Panesar, G.S., Chadha, R.: A hybrid optimization algorithm for efficient virtual machine migration and task scheduling using a cloud-based adaptive multi-agent deep deterministic policy gradient technique. Int. J. Intell. Syst. Appl. Eng. 12(6s), 30–45 (2024)
Singla, A., Malhotra, T.: Challenges and opportunities in scaling AI/ML pipelines. J. Sci. Technol. 5(1), 1–21 (2024)
Grzesik, P., Mrozek, D.: Combining machine learning and edge computing: opportunities, challenges, platforms, frameworks, and use cases. Electronics 13(3), 640 (2024)
Akindote, O.J., Adegbite, A.O., Dawodu, S.O., Omotosho, A., Anyanwu, A.: Innovation in data storage technologies: from cloud computing to edge computing. Comput. Sci. IT Res. J. 4(3), 273–299 (2023)
Theodoropoulos, T., Rosa, L., Benzaid, C., Gray, P., Marin, E., Makris, A., Cordeiro, L., Diego, F., Sorokin, P., Girolamo, M.D., Barone, P.: Security in cloud-native services: a survey. J. Cybersecur. Privacy 3(4), 758–793 (2023)
Donta, K.: Murturi, I.: Casamayor Pujol, V.: Sedlak, B. and Dustdar, S.: Exploring the potential of distributed computing continuum systems. Computers, 12(10), 198 (2023).
Taleb, T., Benzaïd, C., Addad, R.A., Samdanis, K.: AI/ML for beyond 5G systems: concepts, technology enablers & solutions. Comput. Netw. 237, 110044 (2023)
Kamdjou, H.M. Baudry, D. Havard, V., Ouchani, S.: Resource-constrained extended reality operated with digital twin in industrial Internet of Things. IEEE Open J. Commun. Soc. (2024)
Author information
Authors and Affiliations
Contributions
All authors contributed to the design and implementation of the research, to the analysis of the results and to the writing of the manuscript
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Priyadarshini, S., Sawant, T.N., Bhimrao Yadav, G. et al. Enhancing security and scalability by AI/ML workload optimization in the cloud. Cluster Comput 27, 13455–13469 (2024). https://doi.org/10.1007/s10586-024-04641-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-024-04641-x