Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

An Adaptable Approach to Fault Tolerance in Cloud Computing

Published: 31 March 2023 Publication History

Abstract

Existing fault tolerance approaches in the cloud are broadly based on replication and checkpointing. Each of these approaches has its advantages and limitations. This paper presents an adaptable fault tolerance method for determining which of the two approaches will be appropriate for the successful execution of a task in the given cloud conditions. The proposed method classifies the failure risk of host machines available for task execution based on their failure history. Subsequently, fuzzy logic is used to determine the appropriate fault tolerance approach by considering a host's failure risk, user-defined task's priority, and level of resource redundancy. Setting a task's priority provides a user with control to solicit a desired fault tolerance level while the availability of resources reflects a cloud provider's capability to offer fault tolerance. Simulation experiments have verified that the proactive selection of a fault-tolerance method increases the number of tasks that complete successfully.

References

[1]
AbdElfattah, EElkawkagy, MEl-Sisi, A. (2017), A reactive fault tolerance approach for cloud computing. 13th International Computer Engineering Conference (ICENCO). IEEE.
[2]
AgarwalK. K.KotakulaH. (2022). Replication Based Fault Tolerance Approach for Cloud. In International Conference on Distributed Computing and Internet Technology, (pp. 163-169). Springer. 10.1007/978-3-030-94876-4_11
[3]
Amin, Z., Singh, H., & Sethi, N. (2015). Review on fault tolerance techniques in cloud computing. International Journal of Computers and Applications, 116(18).
[4]
Amoon, M., El-Bahnasawy, N., Sadi, S., & Wagdi, M. (2019). On the design of reactive approach with flexible checkpoint interval to tolerate faults in cloud computing systems. Journal of Ambient Intelligence and Humanized Computing, 10(11), 4567–4577.
[5]
Amoon, M., El-Bahnasawy, N., Sadi, S., & Wagdi, M. (2019). On the design of reactive approach with flexible checkpoint interval to tolerate faults in cloud computing systems. Journal of Ambient Intelligence and Humanized Computing, 10(11), 4567–4577.
[6]
Attallah, S. M., Fayek, M. B., Nassar, S. M., & Hemayed, E. E. (2021). Proactive load balancing fault tolerance algorithm in cloud computing. Concurrency and Computation, 33(10), e6172.
[7]
Bui, D. M., & Lee, S. (2018). Early fault detection in IaaS cloud computing based on fuzzy logic and prediction technique. The Journal of Supercomputing, 74(11), 5730–5745.
[8]
Buyya, R., Yeo, C. S., Venugopal, S., Broberg, J., & Brandic, I. (2009). Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility. Future Generation Computer Systems, 25(6), 599–616.
[9]
Cheraghlou, M. N., Khademzadeh, A., & Haghparast, M. (2019). New fuzzy-based fault tolerance evaluation framework for cloud computing. Journal of Network and Systems Management, 27(4), 930–948.
[10]
Chinnathambi, S., Santhanam, A., Rajarathinam, J., & Senthilkumar, M. (2019). Scheduling and checkpointing optimization algorithm for Byzantine fault tolerance in cloud clusters. Cluster Computing, 22(6), 14637–14650.
[11]
Ejimogu, O. H., & Başaran, S. (2017). A systematic mapping study on soft computing techniques to the cloud environment. Procedia Computer Science, 120, 31–38.
[12]
Gupta, B. B., Agrawal, D. P., Yamaguchi, S., & Sheng, M. (2020). Soft computing techniques for big data and cloud computing. Springer.
[13]
Hasan, M., & Goraya, M. S. (2018). Fault tolerance in cloud computing environment: A systematic survey. Computers in Industry, 99, 156–172.
[14]
Java. (n.d.). Java Point. Java. https://www.javatpoint.com/k-nearest-neighbor-algorithm-for-machine-learning
[15]
Kumari, P., & Kaur, P. (2018). A survey of fault tolerance in cloud computing. Journal of King Saud University-Computer and Information Sciences.
[16]
Kumari, P., & Kaur, P. (2020). Topology-aware virtual machine replication for fault tolerance in cloud computing systems. Multiagent and Grid Systems, 16(2), 193–206.
[17]
Liu, J., Wang, S., Zhou, A., Kumar, S. A., Yang, F., & Buyya, R. (2016). Using a proactive fault-tolerance approach to enhance cloud service reliability. IEEE Transactions on Cloud Computing, 6(4), 1191–1202.
[18]
Menychtas, A., & Konstanteli, K. G. (2012). Fault detection and recovery mechanisms and techniques for service-oriented infrastructures. In Achieving real-time in distributed computing: from grids to clouds (pp. 259–274). IGI Global.
[19]
Mohammed, B., Modu, B., Maiyama, K. M., Ugail, H., Awan, I., & Kiran, M. (2018). Failure analysis modeling in an infrastructure as a service (IaaS) environment. Electronic Notes in Theoretical Computer Science, 340, 41–54.
[20]
Monil, M. A. H., & Rahman, R. M. (2016). VM consolidation approach is based on heuristics, fuzzy logic, and migration control. Journal of Cloud Computing, 5(1), 8.
[21]
Patra, P. K., Singh, H., Singh, R., Das, S., Dey, N., & Victoria, A. D. C. (2016). Replication and resubmission based adaptive decision for fault tolerance in real-time cloud computing: A new approach. International Journal of Service Science, Management, Engineering, and Technology, 7(2), 46–60.
[22]
Prakash, S., & Vyas, V. (2022). Analysis of Fault Tolerance Techniques in Virtual Machine Environment. In ICT Analysis and Applications, (pp. 121-131). Springer.
[23]
Rawat, A., Sushil, R., Agarwal, A., Sikander, A., & Bhadoria, R. S. (2021). A new adaptive fault tolerant framework in the cloud. Journal of the Institution of Electronics and Telecommunication Engineers, 1–3.
[24]
Ray, B., Saha, A., Khatua, S., & Roy, S. (2020). Proactive Fault-Tolerance Technique to Enhance Reliability of Cloud Service in Cloud Federation Environment. IEEE Transactions on Cloud Computing. IEEE.
[25]
Rezaeipanah, A., Mojarad, M., & Fakhari, A. (2022). Providing a new approach to increase fault tolerance in cloud computing using fuzzy logic. International Journal of Computers and Applications, 44(2), 1–9.
[26]
Rong, H., Wang, H. M., Liu, J., & Xian, M. (2016). Privacy-preserving k-nearest neighbor computation in multiple cloud environments. IEEE Access: Practical Innovations, Open Solutions, 4, 9589–9603.
[27]
Saikia, L. P., & Devi, Y. L. (2014). Fault tolerance techniques and algorithms in cloud computing. International Journal of Computer Science & Communication Networks, 4(1), 01-08.
[28]
Sathiyamoorthi, V., Keerthika, P., Suresh, P., Zhang, Z. J., Rao, A. P., & Logeswaran, K. (2021). Adaptive fault tolerant resource allocation scheme for cloud computing environments. Journal of Organizational and End User Computing, 33(5), 135–152.
[29]
Sharma, S. (2017). Enhance data security in cloud computing using machine learning and hybrid cryptography techniques. International journal of advanced research in computer science, 8(9).
[30]
Srivastava, N. P., & Srivastava, R. K. (2014). Soft Computing Approaches To Fault-Tolerant Systems. International Journal of Advanced Networking and Applications, 5(6), 2096.
[31]
Sun, J., Du, W., & Shi, N. (2018). A Survey of kNN Algorithm. Information Engineering and Applied Computing.
[32]
Tran, D., Tran, N., Nguyen, G., & Nguyen, B. M. (2017). A proactive cloud scaling model based on fuzzy time series and SLA awareness. Procedia Computer Science, 108, 365–374.
[33]
Wu, Y., Peng, G., Wang, H., & Zhang, H. (2019). A two-stage fault tolerance method for large-scale manufacturing network. IEEE Access: Practical Innovations, Open Solutions, 7, 81574–81592.
[34]
Zhao, J., Xiang, Y., Lan, T., Huang, H. H., & Subramaniam, S. (2016). Elastic reliability optimization through peer-to-peer checkpointing in cloud computing. IEEE Transactions on Parallel and Distributed Systems, 28(2), 491–502.
[35]
Zhou, A., Sun, Q., & Li, J. (2017). Enhancing reliability via checkpointing in cloud computing systems. China Communications, 14(7), 1–10.
[36]
Zhou, A., Wang, S., Cheng, B., Zheng, Z., Yang, F., Chang, R. N., & Buyya, R. (2016). Cloud service reliability enhancement via virtual machine placement optimization. IEEE Transactions on Services Computing, 10(6), 902–913.

Cited By

View all
  • (2023)Resource-Aware Least Busy (RALB) Strategy for Load Balancing in Containerized Cloud SystemsInternational Journal of Cloud Applications and Computing10.4018/IJCAC.32809413:1(1-14)Online publication date: 11-Aug-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image International Journal of Cloud Applications and Computing
International Journal of Cloud Applications and Computing  Volume 13, Issue 1
Jan 2023
322 pages
ISSN:2156-1834
EISSN:2156-1826
Issue’s Table of Contents

Publisher

IGI Global

United States

Publication History

Published: 31 March 2023

Author Tags

  1. Checkpointing
  2. Classification
  3. Cloud Computing
  4. Fault Tolerance
  5. Fuzzy Logic
  6. Replication

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Resource-Aware Least Busy (RALB) Strategy for Load Balancing in Containerized Cloud SystemsInternational Journal of Cloud Applications and Computing10.4018/IJCAC.32809413:1(1-14)Online publication date: 11-Aug-2023

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media