Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Job completion time on a virtualized server with software rejuvenation

Published: 13 January 2014 Publication History

Abstract

This article analyzes the completion time of a job running on a virtualized server subject to software aging and rejuvenation in a virtual machine monitor (VMM). A job running on the server may be interrupted by virtual machine (VM) failure, VMM failure or VMM rejuvenation. The job interruption is categorized as either preemptive-repeat (prt), in which case the interrupted job needs to restart from the beginning, or preemptive-resume (prs), in which case the job resumes execution from the point of interruption. Using a semi-Markov process (SMP) to model the server behavior, the steady-state server availability is computed and the theory developed in Kulkarni et al. [1987] is used to obtain the Laplace-Stieltjes transform (LST) of the job completion time. In the numerical experiments, we introduce four types of aging behavior of VMM. The effectiveness of VMM rejuvenation on job completion time is discussed in association with the type of interruption it causes and the VMM aging type. With our parameter settings, VMM rejuvenation with prs job interruption improves the performance of job execution regardless of the aging type, with performance degradation is taken into account.

References

[1]
Bao, Y., Sun, X., and Trivedi, K. S. 2005. A workload-based analysis of software aging and rejuvenation. IEEE Trans. Reliab. 54, 4, 541--548.
[2]
Chimento, P. F. and Trivedi, K. S. 1993. The completion time of programs on processors subject to failure and repair. IEEE Trans. Computer. 42, 10, 1184--1194.
[3]
Ciardo, G., Muppala, J. K., and Trivedi, K. S. 1989. SPNP: Stochastic Petri net package. In Proceedings of the International Workshop on Petri Nets and Performance Models. 142--151.
[4]
Durbin, F. 1974. Numerical inversion of Laplace transforms: an efficient improvement to Durbner and Abate's method. Comput. J. 17, 4, 371--376.
[5]
Dohi, T., Goseva-Popstojanova, K., and Trivedi, K. S. 2001. Estimating software rejuvenation schedule in high assurance systems. Comput. J. 44, 6, 473--485.
[6]
Garg, S., Huang, Y., Kintala, C., and Trivedi, K. S. 1996. Minimizing completion time of a program by checkpointing and rejuvenation. In Proceedings of the 1996 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems. 252--261.
[7]
Grottke, M., Li, L., Vaidyanathan, K., and Trivedi, K. S. 2006. Analysis of software aging in a web server. IEEE Trans. Reliab. 55, 3, 411--420.
[8]
Grottke, M., Nikora, A. P., and Trivedi, K. S. 2010. An empirical investigation of fault types in space mission system software. In Proceedings of the 2010 IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'10). 447--456.
[9]
Huang, Y., Kintala, C., Kolettis, N., and Fulton, N. D. 1995. Software rejuvenation: Analysis, module and applications. In Proceedings of the International Symposium on Fault Tolerant Computing (FTCS'95). 381--390.
[10]
Kourai, K. and Chiba, S. 2007. A fast rejuvenation technique for server consolidation with virtual machines, In Proceedings of the 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'07). 245--255.
[11]
Kulkarni, V. G., Nicola, V. F., and Trivedi, K. S. 1987. The completion time of a job on multimode systems. Ad. Appli. Probab. 19, 4, 932--954.
[12]
Machida, F., Kim, D. S., and Trivedi, K. S. 2010. Modeling and analysis of software rejuvenation in a server virtualized system. In Proceedings of the 2nd International Workshop on Software Aging and Rejuvenation (WoSAR'10). 1--6.
[13]
Machida, F., Nicola, V. F., and Trivedi, K. S. 2011a. Job completion time on a virtualized server subject to software aging and rejuvenation. In Proceedings of the 3rd International Workshop on Software Aging and Rejuvenation (WoSAR'11). 44--49.
[14]
Machida, F., Andrade, E., Kim, D. S., and Trivedi, K. S. 2011b. Candy: Component-based availability modeling framework for cloud service management using SysML. In Proceedings of the International Symposium on Reliable Distributed Systems (SRDS'11). 209--218.
[15]
Mallet. A. 2000. Numerical Inversion of Laplace Transform. In Wolfram Library Archive. http://library. wolfram.com/infocenter/MathSource/2691/.
[16]
Matias, Jr., R., Trivedi, K. S., and Martins Maciel, P. R. 2010. Using accelerated life tests to estimate time to software aging failure. In Proceedings of the IEEE 21st International Symposium on Software Reliability Engineering (ISSRE'10). 211--219.
[17]
Nicola, V. F., Kulkarni, V. G., and Trivedi, K. S. 1987. Queueing analysis of fault-tolerant computer systems. IEEE Trans. Softw. Eng. 13, 3, 363--375.
[18]
Okamura, H. and Dohi, T. 2008. Analysis of a software system with rejuvenation, restoration and checkpointing. In Proceedings of the International Service Availability Symposium (ISAS'08). 110--128.
[19]
Trivedi, K. S. 2001. Probability and Statistics with Reliability, Queuing, and Computer Science Applications 2nd Ed. Wiley.
[20]
Vaidyanathan, K. and Trivedi, K. S. 2005. A comprehensive model for software rejuvenation. IEEE Trans. Depend. Secure Comput. 2, 2, 124--137.
[21]
Zhao, J., Jin, Y., Trivedi, K. S., and Matias Jr., R. 2011. Injecting memory leaks to accelerate software failures, In Proceedings of the IEEE 22nd International Symposium on Software Reliability Engineering (ISSRE'11). 260--269.

Cited By

View all
  • (2024)A Compositional Approach to Coordinated Software Rejuvenation of Component-Based Systems2024 IEEE 35th International Symposium on Software Reliability Engineering (ISSRE)10.1109/ISSRE62328.2024.00062(593-604)Online publication date: 28-Oct-2024
  • (2024)PMTT: Parallel multi-scale temporal convolution network and transformer for predicting the time to aging failure of software systemsJournal of Systems and Software10.1016/j.jss.2024.112167217(112167)Online publication date: Nov-2024
  • (2024)TTAFPred: Prediction of time to aging failure for software systems based on a two-stream multi-scale features fusion networkSoftware Quality Journal10.1007/s11219-024-09692-232:4(1481-1513)Online publication date: 22-Jul-2024
  • Show More Cited By

Index Terms

  1. Job completion time on a virtualized server with software rejuvenation

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Journal on Emerging Technologies in Computing Systems
    ACM Journal on Emerging Technologies in Computing Systems  Volume 10, Issue 1
    Special Issue on Reliability and Device Degradation in Emerging Technologies and Special Issue on WoSAR 2011
    January 2014
    210 pages
    ISSN:1550-4832
    EISSN:1550-4840
    DOI:10.1145/2543749
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Journal Family

    Publication History

    Published: 13 January 2014
    Accepted: 01 November 2012
    Revised: 01 September 2012
    Received: 01 April 2012
    Published in JETC Volume 10, Issue 1

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Job completion time
    2. rejuvenation
    3. semi-Markov process
    4. server virtualization
    5. software aging

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)7
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 25 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A Compositional Approach to Coordinated Software Rejuvenation of Component-Based Systems2024 IEEE 35th International Symposium on Software Reliability Engineering (ISSRE)10.1109/ISSRE62328.2024.00062(593-604)Online publication date: 28-Oct-2024
    • (2024)PMTT: Parallel multi-scale temporal convolution network and transformer for predicting the time to aging failure of software systemsJournal of Systems and Software10.1016/j.jss.2024.112167217(112167)Online publication date: Nov-2024
    • (2024)TTAFPred: Prediction of time to aging failure for software systems based on a two-stream multi-scale features fusion networkSoftware Quality Journal10.1007/s11219-024-09692-232:4(1481-1513)Online publication date: 22-Jul-2024
    • (2023)A Comparative Analysis of Software Aging in Image Classifiers on Cloud and EdgeIEEE Transactions on Dependable and Secure Computing10.1109/TDSC.2021.313920120:1(563-573)Online publication date: 1-Jan-2023
    • (2022)Cloud Video System Availability Assessment Using Markov and Semi-Markov Models2022 IEEE 16th International Conference on Advanced Trends in Radioelectronics, Telecommunications and Computer Engineering (TCSET)10.1109/TCSET55632.2022.9767064(578-583)Online publication date: 22-Feb-2022
    • (2022)An Empirical Study on Software Aging of Long-Running Object Detection Algorithms2022 IEEE 22nd International Conference on Software Quality, Reliability and Security (QRS)10.1109/QRS57517.2022.00112(1091-1102)Online publication date: Dec-2022
    • (2022)A Markov Regenerative Model of Software Rejuvenation Beyond the Enabling Restriction2022 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)10.1109/ISSREW55968.2022.00060(138-145)Online publication date: Oct-2022
    • (2022)The Impact of Software Aging and Rejuvenation on the User Experience for Android System2022 IEEE 33rd International Symposium on Software Reliability Engineering (ISSRE)10.1109/ISSRE55969.2022.00049(435-445)Online publication date: Oct-2022
    • (2022)Model-Based Performability and Dependability Evaluation of a System with VM Migration as Rejuvenation in the Presence of Bursty WorkloadsJournal of Network and Systems Management10.1007/s10922-021-09619-330:1Online publication date: 1-Jan-2022
    • (2021)Software Testing Strategies for Detecting Hypercall Handlers' Aging-related Bugs2021 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)10.1109/ISSREW53611.2021.00043(48-55)Online publication date: Oct-2021
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media