Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Shared recovery for energy efficiency and reliability enhancements in real-time applications with precedence constraints

Published: 11 April 2013 Publication History
  • Get Citation Alerts
  • Abstract

    While Dynamic Voltage Scaling (DVS) remains as a popular energy management technique for modern computing systems, recent research has identified significant and negative impacts of voltage scaling on system reliability. To preserve system reliability under DVS settings, a number of reliability-aware power management (RA-PM) schemes have been recently studied. However, the existing RA-PM schemes normally schedule a separate recovery for each task whose execution is scaled down and are rather conservative. To overcome such conservativeness, we study in this article novel RA-PM schemes based on the shared recovery (SHR) technique. Specifically, we consider a set of frame-based real-time tasks with individual deadlines and a common period where the precedence constraints are represented by a directed acyclic graph (DAG). We first show that the earliest deadline first (EDF) algorithm can always yield a schedule where all timing and precedence constraints are met by considering the effective deadlines of tasks derived from as late as possible (ALAP) policy, provided that the task set is feasible. Then, we propose a shared recovery based frequency assignment technique (namely SHR-DAG) and prove its optimality to minimize energy consumption while preserving the system reliability. To exploit additional slack that arises from early completion of tasks, we also study a dynamic extension for SHR-DAG to improve energy efficiency and system reliability at runtime. The results from our extensive simulations show that, compared to the existing RA-PM schemes, SHR-DAG can achieve up to 35% energy savings, which is very close to the maximum achievable energy savings. More interestingly, our extensive evaluation also indicates that the new schemes offer non-trivial improvements on system reliability over the existing RA-PM schemes as well.

    References

    [1]
    Acharya, S. and Mahapatra, R. 2008. A dynamic slack management technique for real-time distributed embedded systems. IEEE Trans. Comput. 57, 2, 215--230.
    [2]
    Aydin, H., Devadas, V., and Zhu, D. 2006. System-level energy management for periodic real-time tasks. In Proceedings of the IEEE Real-Time Systems Symposium.
    [3]
    Aydin, H., Melhem, R., and Mossé, D. 1999. Incorporating error recovery into the imprecise computation model. In Proceedings of the IEEE International Conference on Real-Time Computing Systems and Applications.
    [4]
    Aydin, H., Melhem, R., Mossé, D., and Mejia-Alvarez, P. 2004. Power-aware scheduling for periodic real-time tasks. IEEE Trans. Comput. 53, 5, 584--600.
    [5]
    Baruah, S. K., Howell, R. R., and Rosier, L. 1990. Algorithms and complexity concerning the preemptive scheduling of periodic, real-time tasks on one processor. Real-Time Syst. 2, 301--324.
    [6]
    Benoit, A., Canon, L.-C., Jeannot, E., and Robert, Y. 2011. Reliability of task graph schedules with transient and fail-stop failures: complexity and algorithms. J. Scheduling, 1--13.
    [7]
    Bini, E. and Buttazzo, G. C. 2005. Measuring the performance of schedulability tests. Real-Time Syst. 30, 1--2.
    [8]
    Buttazzo, G. C. 2004. Hard Real-Time Computing Systems: Predictable Scheduling Algorithms and Applications. Real-Time Systems Series. Springer.
    [9]
    Caccamo, M. and Buttazzo, G. 1998. Optimal scheduling for fault-tolerant and firm real-time systems. In Proceedings of the IEEE International Conference on Real-Time Computing Systems and Applications.
    [10]
    Castillo, X., McConnel, S., and Siewiorek, D. 1982. Derivation and calibration of a transient error reliability model. IEEE Trans. Comput. 31, 7, 658--671.
    [11]
    Chandra, V. and Aitken, R. 2008. Impact of technology and voltage scaling on the soft error susceptibility in nanoscale cmos. In Proceedings of the IEEE International Symposium on Defect and Fault Tolerance of VLSI Systems.
    [12]
    Dabiri, F., Amini, N., Rofouei, M., and Sarrafzadeh, M. 2008. Reliability-aware optimization for DVS-enabled real-time embedded systems. In Proceedings of the International Symposium on Quality of Electronic Design.
    [13]
    Dick, R., Rhodes, D. L., and Wolf, W. 1998. Tgff: Task graphs for free. In Proceedings of the 6th International Workshop on Hardware/Software Co-design.
    [14]
    Ejlali, A., Al-Hashimi, B. M., and Eles, P. 2009. A standby-sparing technique with low energy- overhead for fault-tolerant hard real-time systems. In Proceedings of the International Conference on Hardware/Software Codesign and System Synthesis.
    [15]
    Ejlali, A., Schmitz, M. T., Al-Hashimi, B. M., Miremadi, S. G., and Rosinger, P. 2005. Energy efficient SEU-tolerance in DVS-enabled real-time systems through information redundancy. In Proceedings of the International Symposium on Low Power and Electronics and Design.
    [16]
    Elnozahy, E. M., Kistler, M., and Rajamony, R. 2002. Energy-efficient server clusters. In Proceedings of the Workshop on Power Aware Computer Systems.
    [17]
    Ernst, D., Das, S., Lee, S., Blaauw, D., Austin, T., Mudge, T., Kim, N. S., and Flautner, K. 2004. Razor: Circuit-level correction of timing errors for low-power operation. IEEE Micro 24, 6, 10--20.
    [18]
    Girault, A. and Kalla, H. 2009. A novel bicriteria scheduling heuristics providing a guaranteed global system failure rate. IEEE Trans. Dependable Secure Comput. 6, 4, 241--254.
    [19]
    Hazucha, P. and Svensson, C. 2000. Impact of cmos technology scaling on the atmospheric neutron soft error rate. IEEE Trans. Nucl. Sci. 47, 6, 2586--2594.
    [20]
    Iyer, R., Rossetti, D., and Hsueh, M. 1986. Measurement and modeling of computer reliability as affected by system activity. ACM Trans. Comput. Syst. 4, 3, 214--237.
    [21]
    Izosimov, V., Eles, P., and Peng, Z. 2010. Value-based scheduling of distributed fault-tolerant real-time systems with soft and hard timing constraints. In Proceedings of the 8th IEEE Workshop on Embedded Systems for Real-Time Multimedia.
    [22]
    Jejurikar, R. and Gupta, R. 2004. Dynamic voltage scaling for systemwide energy minimization in real-time embedded systems. In Proceedings of the International Symposium on Low Power Electronics and Design.
    [23]
    Kung, S., Whitehouse, H., and Kailath, T. 1985. VLSI and Modern Signal Processing. Prentice-Hall, Inc., Englewood Cliffs, NJ.
    [24]
    Liu, Y., Liang, H., and Wu, K. 2010. Scheduling for energy efficiency and fault tolerance in hard real-time systems. In Proceedings of the Design, Automation and Test in Europe Conference and Exhibition.
    [25]
    Luenberger, D. 1984. Linear and Nonlinear Programming. Addison-Wesley, Reading MA.
    [26]
    Mejia-Alvarez, P., Aydin, H., Mossé, D., and Melhem, R. 2000. Scheduling optional computations in fault-tolerant real-time systems. In Proceedings of the IEEE International Conference on Real-Time Computing Systems and Applications.
    [27]
    Melhem, R., Mossé, D., and Elnozahy, E. 2004. The interplay of power management and fault recovery in real-time systems. IEEE Trans. Comput. 53, 2, 217--231.
    [28]
    Pillai, P. and Shin, K. G. 2001. Real-time dynamic voltage scaling for lowpower embedded operating systems. In Proceedings of the ACM Symposium on Operating Systems Principles.
    [29]
    Pop, P., Poulsen, K., Izosimov, V., and Eles, P. 2007. Scheduling and voltage scaling for energy/reliability trade-offs in fault-tolerant time-triggered embedded systems. In Proceedings of the International Conference on Hardware/Software Codesign and System Synthesis.
    [30]
    Pradhan, D. K. 1996. Fault-Tolerant Computer System Design. Prentice-Hall, Inc., Upper Saddle River, NJ.
    [31]
    Quan, G. and Hu, X. 2007. Energy efficient dvs schedule for fixed-priority real-time systems. ACM Trans. Embed. Comput. Syst. 6, 4, 1--30.
    [32]
    Wang, Y., Liu, D., Wang, M., Qin, Z., and Shao, Z. 2010. Optimal task scheduling by removing inter-core communication overhead for streaming applications on mpsocs. In Proceedings of the IEEE Real-Time and Embedded Technology and Applications Symposium.
    [33]
    Wei, T., Mishra, P., Wu, K., and Liang, H. 2006. Online task-scheduling for fault-tolerant low-energy real-time systems. In Proceedings of IEEE/ACM International Conference on Computer-Aided Design.
    [34]
    Weiser, M., Welch, B., Demers, A., and Shenker, S. 1994. Scheduling for reduced cpu energy. In Proceedings of the USENIX Conference on Operating Systems Design and Implementation.
    [35]
    Yao, F., Demers, A., and Shenker, S. 1995. A scheduling model for reduced cpu energy. In Proceedings of the Annual Symposium on Foundations of Computer Science.
    [36]
    Zhang, Y. and Chakrabarty, K. 2003. Energy-aware adaptive checkpointing in embedded real-time systems. In Proceedings of the Conference on Design, Automation and Test in Europe.
    [37]
    Zhao, B., Aydin, H., and Zhu, D. 2008. Reliability-aware dynamic voltage scaling for energy-constrained real-time embedded systems. In Proceedings of the International Conference on Computer Design.
    [38]
    Zhao, B., Aydin, H., and Zhu, D. 2011. Generalized reliability-oriented energy management for real-time embedded applications. In Proceedings of the Design Automation Conference.
    [39]
    Zhao, B., Zhu, D., and Aydin, H. 2009. Enhanced reliability-aware power management through shared recovery technique. In Proceedings of the IEEE/ACM International Conference on Computer Aided Design.
    [40]
    Zhu, D. 2006. Reliability-aware dynamic energy management in dependable embedded real-time systems. In Proceedings of the IEEE Real-Time and Embedded Technology and Applications Symposium.
    [41]
    Zhu, D. and Aydin, H. 2006. Energy management for real-time embedded systems with reliability requirements. In Proceedings of the International Conference on Computer Aided Design.
    [42]
    Zhu, D. and Aydin, H. 2009. Reliability-aware energy management for periodic real-time tasks. IEEE Trans. Comput. 58, 10, 1382--1397.
    [43]
    Zhu, D., Melhem, R., and Mossé, D. 2004. The effects of energy management on reliability in real-time embedded systems. In Proceedings of the International Conference on Computer Aided Design.
    [44]
    Zhuo, J. and Chakrabarti, C. 2005. System-level energy-efficient dynamic task scheduling. In Proceedings of the Annual Conference on Design Automation.
    [45]
    Ziegler, J. F. 2004. Trends in electronic reliability: Effects of terrestrial cosmic rays. http://www.srim.org/SER/SERTrends.htm.

    Cited By

    View all
    • (2024)Online Energy-Aware Scheduling for Deadline-Constrained Applications in Distributed Heterogeneous SystemsInternational Journal of Aerospace Engineering10.1155/2024/21228952024(1-14)Online publication date: 2-May-2024
    • (2024)Reliability Enhancement Strategies for Workflow Scheduling Under Energy Consumption Constraints in CloudsIEEE Transactions on Sustainable Computing10.1109/TSUSC.2023.33147599:2(155-169)Online publication date: Mar-2024
    • (2024)Binary Search-Based Fast Scheduling Algorithms for Reliability-Aware Energy-Efficient Task Graph Scheduling With Fault ToleranceIEEE Transactions on Sustainable Computing10.1109/TSUSC.2023.32959399:3(433-451)Online publication date: May-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Design Automation of Electronic Systems
    ACM Transactions on Design Automation of Electronic Systems  Volume 18, Issue 2
    March 2013
    429 pages
    ISSN:1084-4309
    EISSN:1557-7309
    DOI:10.1145/2442087
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Journal Family

    Publication History

    Published: 11 April 2013
    Accepted: 01 November 2012
    Revised: 01 August 2012
    Received: 01 December 2011
    Published in TODAES Volume 18, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. DVS
    2. real-time systems
    3. reliability-aware power management

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)12
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 28 Jul 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Online Energy-Aware Scheduling for Deadline-Constrained Applications in Distributed Heterogeneous SystemsInternational Journal of Aerospace Engineering10.1155/2024/21228952024(1-14)Online publication date: 2-May-2024
    • (2024)Reliability Enhancement Strategies for Workflow Scheduling Under Energy Consumption Constraints in CloudsIEEE Transactions on Sustainable Computing10.1109/TSUSC.2023.33147599:2(155-169)Online publication date: Mar-2024
    • (2024)Binary Search-Based Fast Scheduling Algorithms for Reliability-Aware Energy-Efficient Task Graph Scheduling With Fault ToleranceIEEE Transactions on Sustainable Computing10.1109/TSUSC.2023.32959399:3(433-451)Online publication date: May-2024
    • (2024) Reliability-aware scheduling for -firm real-time embedded systems under hard energy budget constraint Journal of Systems Architecture10.1016/j.sysarc.2024.103185154(103185)Online publication date: Sep-2024
    • (2023)Software Fault Tolerance in Real-Time Systems: Identifying the Future Research QuestionsACM Computing Surveys10.1145/358995055:14s(1-30)Online publication date: 17-Jul-2023
    • (2023)Power-Efficient and Aging-Aware Primary/Backup Technique for Heterogeneous Embedded SystemsIEEE Transactions on Sustainable Computing10.1109/TSUSC.2023.32821648:4(715-726)Online publication date: Oct-2023
    • (2023)Energy-Aware Real-Time Scheduling of Multiple Periodic DAGs on Heterogeneous SystemsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2022.322850442:8(2447-2460)Online publication date: Aug-2023
    • (2023)Temperature-Constrained Reliability Optimization of Industrial Cyber-Physical Systems Using Machine Learning and Feedback ControlIEEE Transactions on Automation Science and Engineering10.1109/TASE.2021.306240820:1(20-31)Online publication date: Jan-2023
    • (2023)FTSC: Fault-tolerant scheduling and control co-design for distributed real-time systemJournal of Systems Architecture10.1016/j.sysarc.2023.102934142(102934)Online publication date: Sep-2023
    • (2023)Online energy-efficient scheduling of DAG tasks on heterogeneous embedded platformsJournal of Systems Architecture10.1016/j.sysarc.2023.102894140(102894)Online publication date: Jul-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media