Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Enhancing security and scalability by AI/ML workload optimization in the cloud

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

The pervasive adoption of Artificial Intelligence (AI) and Machine Learning (ML) applications has exponentially increased the demand for efficient resource allocation, workload scheduling, and parallel computing capabilities in cloud environments. This research addresses the critical need for enhancing both the scalability and security of AI/ML workloads in cloud computing settings. The study emphasizes the optimization of resource allocation strategies to accommodate the diverse requirements of AI/ML workloads. Efficient resource allocation ensures that computational resources are utilized judiciously, avoiding bottlenecks and latency issues that could hinder the performance of AI/ML applications. The research explores advanced parallel computing techniques to harness the full possible cloud infrastructure, enhancing the speed and efficiency of AI/ML computations. The integration of robust security measures is crucial to safeguard sensitive data and models processed in the cloud. The research delves into secure multi-party computation and encryption techniques like the Hybrid Heft Pso Ga algorithm, Heuristic Function for Adaptive Batch Stream Scheduling Module (ABSS) and allocation of resources parallel computing and Kuhn–Munkres algorithm tailored for AI/ML workloads, ensuring confidentiality and integrity throughout the computation lifecycle. To validate the proposed methodologies, the research employs extensive simulations and real-world experiments. The proposed ABSS_SSMM method achieves the highest accuracy and throughput values of 98% and 94%, respectively. The contributions of this research extend to the broader cloud computing and AI/ML communities. By providing scalable and secure solutions, the study aims to empower cloud service providers, enterprises, and researchers to leverage AI/ML technologies with confidence. The findings are anticipated to inform the design and implementation of next-generation cloud platforms that seamlessly support the evolving landscape of AI/ML applications, fostering innovation and driving the adoption of intelligent technologies in diverse domains.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

Data availability

No datasets were generated or analysed during the current study.

References

  1. Adil, M., Nabi, S., Aleem, M., Diaz, V.G., Lin, J.C.W.: CA-MLBS: content-aware machine learning-based load balancing scheduler in the cloud environment. Expert. Syst. 40(4), 13150 (2023)

    Article  Google Scholar 

  2. Lu, Y. Bian, S. Chen, L. He, Y. Hui, Y. Lentz, M. Li, B. Liu, F. Li, J. Liu, Q., Liu, R.: Computing in the era of large generative models: from cloud-native to AI-native (2024). arXiv preprint arXiv:2401.12230

  3. Mart, J., Oyetoro, A., Amah, U.: Best practices for running workloads in public cloud environments. Science Open Preprints (2023).

  4. Khan, M.M.I., Nencioni, G.: Resource allocation in networking and computing systems: a security and dependability perspective. IEEE Access 11, 89433 (2023)

    Article  Google Scholar 

  5. Hoefler, T. Copik, M. Beckman, P. Jones, A. Foster, I. Parashar, M. Reed, D. Troyer, M. Schulthess, T. Ernst, D., Dongarra, J.: XaaS: acceleration as a service to enable productive high-performance cloud computing (2024). arXiv preprint arXiv:2401.04552

  6. Ali, S.A.: Desigining secure and robust e-commerce platform for public cloud. Asian Bull. Big Data Manag. 3(1), 164 (2023)

    Article  MathSciNet  Google Scholar 

  7. Mangalampalli, S., Karri, G.R., Mohanty, S.N., Ali, S., Khan, M.I., Abdullaev, S., Alqahtani, S.A.: Multi-objective Prioritized Task Scheduler using improved Asynchronous advantage actor critic (a3c) algorithm in multi cloud environment. IEEE Access 91, 407 (2024)

    Google Scholar 

  8. Alqahtani, A.: Multi-objective Prioritized Task Scheduler using improved Asynchronous advantage actor critic (a3c) algorithm in multi cloud environment (2024)

  9. Cai, Q., Xiao, G., Lin, S., Yang, W., Li, K., Li, K.: ABSS: an adaptive batch-stream scheduling module for dynamic task parallelism on chiplet-based multi-chip systems. ACM Trans. Parallel Comput. 11, 1 (2024)

    Article  MathSciNet  Google Scholar 

  10. Wubben, J.: Distributed management and coordination of UAV swarms based on infrastructure less wireless networks (Doctoral dissertation, Universitesi Polytechnic de Valencia) (2023)

  11. Pati, C.: Search using Grover’s Algorithm (2023)

  12. Njeri, N.: Quantum computing algorithms for solving complex optimization problems. J. Adv. Technol. Syst. 1(1), 24–34 (2023)

    Google Scholar 

  13. Bhoumick, D. Mitra, D. Chowdhury, D.R., Nath, A.: A comprehensive study on implementation of grover’s search algorithm on quantum processors. Int. J. 11(5) (2023)

  14. Tuli, S., Mirhakimi, F., Pallewatta, S., Zawad, S., Casale, G., Javadi, B., Yan, F., Buyya, R., Jennings, N.R.: AI augmented edge and fog computing: trends and challenges. J. Netw. Comput. Appl. 216, 103648 (2023)

    Article  Google Scholar 

  15. GUPTA, A.: Transforming organizations through cloud computing (Doctoral dissertation) (2023)

  16. Panesar, G.S., Chadha, R.: A hybrid optimization algorithm for efficient virtual machine migration and task scheduling using a cloud-based adaptive multi-agent deep deterministic policy gradient technique. Int. J. Intell. Syst. Appl. Eng. 12(6s), 30–45 (2024)

    Google Scholar 

  17. Singla, A., Malhotra, T.: Challenges and opportunities in scaling AI/ML pipelines. J. Sci. Technol. 5(1), 1–21 (2024)

    Google Scholar 

  18. Grzesik, P., Mrozek, D.: Combining machine learning and edge computing: opportunities, challenges, platforms, frameworks, and use cases. Electronics 13(3), 640 (2024)

    Article  Google Scholar 

  19. Akindote, O.J., Adegbite, A.O., Dawodu, S.O., Omotosho, A., Anyanwu, A.: Innovation in data storage technologies: from cloud computing to edge computing. Comput. Sci. IT Res. J. 4(3), 273–299 (2023)

    Article  Google Scholar 

  20. Theodoropoulos, T., Rosa, L., Benzaid, C., Gray, P., Marin, E., Makris, A., Cordeiro, L., Diego, F., Sorokin, P., Girolamo, M.D., Barone, P.: Security in cloud-native services: a survey. J. Cybersecur. Privacy 3(4), 758–793 (2023)

    Article  Google Scholar 

  21. Donta, K.: Murturi, I.: Casamayor Pujol, V.: Sedlak, B. and Dustdar, S.: Exploring the potential of distributed computing continuum systems. Computers, 12(10), 198 (2023).

  22. Taleb, T., Benzaïd, C., Addad, R.A., Samdanis, K.: AI/ML for beyond 5G systems: concepts, technology enablers & solutions. Comput. Netw. 237, 110044 (2023)

    Article  Google Scholar 

  23. Kamdjou, H.M. Baudry, D. Havard, V., Ouchani, S.: Resource-constrained extended reality operated with digital twin in industrial Internet of Things. IEEE Open J. Commun. Soc. (2024)

Download references

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the design and implementation of the research, to the analysis of the results and to the writing of the manuscript

Corresponding author

Correspondence to Sabina Priyadarshini.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Priyadarshini, S., Sawant, T.N., Bhimrao Yadav, G. et al. Enhancing security and scalability by AI/ML workload optimization in the cloud. Cluster Comput 27, 13455–13469 (2024). https://doi.org/10.1007/s10586-024-04641-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10586-024-04641-x

Keywords