Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Optimization of Fault-Tolerant Mixed-Criticality Multi-Core Systems with Enhanced WCRT Analysis

Published: 21 December 2018 Publication History

Abstract

This article proposes a novel optimization technique of fault-tolerant mixed-criticality multi-core systems with worst-case response time (WCRT) guarantees. Typically, in fault-tolerant multi-core systems, tasks can be replicated or re-executed in order to enhance the reliability. In addition, based on the policy of mixed-criticality scheduling, low-criticality tasks can be dropped at runtime. Such uncertainties caused by hardening and mixed-criticality scheduling make WCRT analysis very difficult. We show that previous analysis techniques are pessimistic as they consider avoidably extreme cases that can be safely ignored within the given reliability constraint. We improve the analysis in order to tighten the pessimism of WCRT estimates by considering the maximum number of faults to be tolerated. Further, we improve the mixed-criticality scheduling by allowing partial dropping of low-criticality tasks. On top of those, we explore the design space of hardening, task-to-core mapping, and quality-of-service of the multi-core mixed-criticality systems. The effectiveness of the proposed technique is verified by extensive experiments with synthetic and real-life benchmarks.

References

[1]
Edward H. Adelson, Charles H. Anderson, James R. Bergen, Peter J. Burt, and Joan M. Ogden. 1984. Pyramid methods in image processing. RCA Engineer 29, 6 (1984), 33--41.
[2]
Z. Al-bayati, J. Caplan, B. H. Meyer, and H. Zeng. 2016. A four-mode model for efficient fault-tolerant mixed-criticality systems. In Proceedings of the 2016 Design, Automation Test in Europe Conference Exhibition (DATE). 97--102.
[3]
Hyung-Chan An, Hoeseok Yang, and Soonhoi Ha. 2016. A formal approach to power optimization in CPSs with delay-workload dependence awareness. Trans. Comp.-Aided Des. Integ. Cir. Sys. 35, 5 (May 2016), 750--763.
[4]
Benjamin Arai, Gautam Das, Dimitrios Gunopulos, and Nick Koudas. 2007. Anytime measures for top-k algorithms. In Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB’07). VLDB Endowment, 914--925.
[5]
Philip Axer, Maurice Sebastian, and Rolf Ernst. 2011. Reliability analysis for MPSoCs with mixed-critical, hard real-time constraints. In Proceedings of the 7th IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis. ACM, 149--158.
[6]
Rabeh Ayari, Imane Hafnaoui, Giovanni Beltrame, and Gabriela Nicolescu. 2016. Schedulability-guided exploration of multi-core systems. In Proceedings of the 27th International Symposium on Rapid System Prototyping: Shortening the Path from Specification to Prototype. ACM, 121--127.
[7]
S. Baruah and B. Chattopadhyay. 2013. Response-time analysis of mixed criticality systems with pessimistic frequency specification. In Proceedings of the 2013 IEEE 19th International Conference on Embedded and Real-Time Computing Systems and Applications. 237--246.
[8]
S. Baruah and Z. Guo. 2014. Scheduling mixed-criticality implicit-deadline sporadic task systems upon a varying-speed processor. In Proceedings of the 2014 IEEE Real-Time Systems Symposium. 31--40.
[9]
Sanjoy K. Baruah, Vincenzo Bonifaci, Gianlorenzo D’Angelo, Alberto Marchetti-Spaccamela, Suzanne Van Der Ster, and Leen Stougie. 2011. Mixed-criticality scheduling of sporadic task systems. In Proceedings of the European Symposium on Algorithms. Springer, 555--566.
[10]
S. K. Baruah, A. Burns, and R. I. Davis. 2011. Response-time analysis for mixed criticality systems. In Proceedings of the 2011 IEEE 32nd Real-Time Systems Symposium (RTSS’11). IEEE Computer Society, Washington, D.C., 34--43.
[11]
Cristiana Bolchini and Antonio Miele. 2013. Reliability-driven system-level synthesis for mixed-critical embedded systems. IEEE Trans. Comput. 62, 12 (2013), 2489--2502.
[12]
Alan Burns and Robert Davis. 2013. Mixed criticality systems—A review. Department of Computer Science, University of York, Tech. Report.
[13]
A. Burns, R. Davis, and S. Punnekkat. 1996. Feasibility analysis of fault-tolerant real-time task sets. In Proceedings of the 8th Euromicro Workshop on Real-Time Systems. 29--33.
[14]
Thidapat Chantem, X. Sharon Hu, and Robert P. Dick. 2011. Temperature-aware scheduling and assignment for hard real-time applications on MPSoCs. IEEE Trans. Very Large Scale Integration (VLSI) Syst. 19, 10 (2011), 1884--1897.
[15]
Junchul Choi and Soonhoi Ha. 2017. Worst-case response time analysis of a synchronous dataflow graph in a multiprocessor system with real-time tasks. ACM Trans. Des. Autom. Electron. Syst. 22, 2, Article 36 (Jan. 2017), 26 pages.
[16]
Cristian Constantinescu. 2003. Trends and challenges in VLSI circuit reliability. IEEE Micro 23, 4 (2003), 14--19.
[17]
M. Damschen, L. Bauer, and J. Henkel. 2017. Timing analysis of tasks on runtime reconfigurable processors. IEEE Trans. Very Large Scale Integration (VLSI) Syst. 25, 1 (Jan. 2017), 294--307.
[18]
K. Deb, A. Pratap, S. Agarwal, and T. Meyarivan. 2002. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evolutionary Comput. 6, 2 (Apr. 2002), 182--197.
[19]
G. Madl et al. 2006. Tutorial for the open-source dream tool. In Univ. California, Irvine, CA, CECS Tech. Rep.
[20]
International Organization for Standardization. 2011. International Standard 26262: Road Vehicles Functional Safety.
[21]
H. Gall. 2008. Functional safety IEC 61508 / IEC 61511 The impact to certification and the user. In Proceedings of the 2008 IEEE/ACS International Conference on Computer Systems and Applications. 1027--1031.
[22]
Arnaud Giacometti and Arnaud Soulet. 2016. Anytime algorithm for frequent pattern outlier detection. International J. Data Sci. Anal. 2, 3 (Dec 2016), 119--130.
[23]
Rafik Henia, Arne Hamann, Marek Jersak, Razvan Racu, Kai Richter, and Rolf Ernst. 2005. System level performance analysis—The SymTA/S approach. IEE Proceedings—Computers and Digital Techniques 152, 2 (2005), 148--166.
[24]
Biao Hu, Kai Huang, Pengcheng Huang, Lothar Thiele, and Alois Knoll. 2016. On-the-fly fast overrun budgeting for mixed-criticality systems. In Proceedings of the 2016 International Conference on Embedded Software (EMSOFT). IEEE, 1--10.
[25]
Pengcheng Huang, Hoeseok Yang, and Lothar Thiele. 2014. On the scheduling of fault-tolerant mixed-criticality systems. In Proceedings of the 51st Annual Design Automation Conference (DAC’14). ACM, New York, NY, Article 131, 6 pages.
[26]
V. Izosimov, P. Pop, P. Eles, and Z. Peng. 2005. Design optimization of time- and cost-constrained fault-tolerant distributed embedded systems. In Design, Automation and Test in Europe, Vol. 2. 864--869.
[27]
M. Jan, L. Zaourar, and M. Pitel. 2013. Maximizing the execution rate of low criticality tasks in mixed criticality system. In Proc. WMC RTSS. 43--48.
[28]
A. Jhumka, S. Klaus, and S. A. Huss. 2005. A dependability-driven system-level design approach for embedded systems. In Design, Automation and Test in Europe, Vol. 1. 372--377.
[29]
Nagarajan Kandasamy, John P. Hayes, and Brian T. Murray. 2005. Dependable communication synthesis for distributed embedded systems. Reliability Engineering 8 System Safety 89, 1 (2005), 81--92. Safety, Reliability and Security of Industrial Computer Systems.
[30]
Shin-haeng Kang, Hoeseok Yang, Sungchan Kim, Iuliana Bacivarov, Soonhoi Ha, and Lothar Thiele. 2014. Static mapping of mixed-critical applications for fault-tolerant MPSoCs. In Proceedings of the 51st Annual Design Automation Conference (DAC’14). ACM, New York, NY, Article 31, 6 pages.
[31]
Jinwoo Kim, Hyunok Oh, Junchul Choi, Hyojin Ha, and Soonhoi Ha. 2013. A novel analytical method for worst case response time estimation of distributed embedded systems. In 2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC). 1--10.
[32]
H. Kopetz, H. Kantz, G. Grunsteidl, P. Puschner, and J. Reisinger. 1990. Tolerating transient faults in MARS. In 1990 Digest of Papers. 20th International Symposium on Fault-Tolerant Computing: 20th International Symposium. 466--473.
[33]
Kai Lampka, Simon Perathoner, and Lothar Thiele. 2010. Analytic real-time analysis and timed automata: A hybrid methodology for the performance analysis of embedded real-time systems. Design Autom. Embedded Syst. 14, 3 (2010), 193--227.
[34]
J. Lin, A. M. K. Cheng, D. Steel, and M. Y.-C. Wu. 2014. Scheduling mixed-criticality real-time tasks with fault tolerance. In Proceedings 2nd Workshop Mixed Criticality Conjunction IEEE RTSS.
[35]
Martin Lukasiewycz, Michael Glaß, Felix Reimann, and Jürgen Teich. 2011. Opt4J: A modular framework for meta-heuristic optimization. In Proceedings of the 13th Annual Conference on Genetic and Evolutionary Computation (GECCO’11). ACM, New York, NY, 1723--1730.
[36]
Nahmsuk Oh, Philip P. Shirvani, and Edward J. McCluskey. 2002. Control-flow checking by software signatures. IEEE Transactions on Reliability 51, 1 (2002), 111--122.
[37]
Yash Vardhan Pant, Houssam Abbas, Kartik Mohta, Truong X. Nghiem, Joseph Devietti, and Rahul Mangharam. 2015. Co-design of anytime computation and robust control. In Real-Time Systems Symposium, 2015 IEEE. IEEE, 43--52.
[38]
P. Shivakumar, M. Kistler, S. W. Keckler, D. Burger, and L. Alvisi. 2002. Modeling the effect of technology trends on the soft error rate of combinational logic. In Proceedings International Conference on Dependable Systems and Networks. 389--398.
[39]
A. Thekkilakattil, R. Dobrin, and S. Punnekkat. 2014. Mixed criticality scheduling in fault-tolerant distributed real-time systems. In 2014 International Conference on Embedded Systems (ICES). 92--97.
[40]
Abhilash Thekkilakattil, Radu Dobrin, and Sasikumar Punnekkat. 2015. Fault tolerant scheduling of mixed criticality real-time tasks under error bursts. Procedia Computer Science 46 (2015), 1148--1155.
[41]
Steve Vestal. 2007. Preemptive scheduling of multi-criticality systems with varying degrees of execution time assurance. In Proceedings of the 28th IEEE International Real-Time Systems Symposium (RTSS’07). IEEE Computer Society, Washington, D.C., 239--243.
[42]
Yi Xiang and Sudeep Pasricha. 2014. A hybrid framework for application allocation and scheduling in multicore systems with energy harvesting. In Proceedings of the 24th Edition of the Great Lakes Symposium on VLSI. ACM, 163--168.
[43]
Hoeseok Yang, Sungchan Kim, and Soonhoi Ha. 2010. An MILP-based performance analysis technique for non-preemptive multitasking MPSoC. IEEE Transactions on Computer-aided Design of Integrated Circuits and Systems 29, 10 (2010), 1600--1613.
[44]
Luyuan Zeng, Pengcheng Huang, and Lothar Thiele. 2016. Towards the design of fault-tolerant mixed-criticality systems on multicores. In Proceedings of the International Conference on Compilers, Architectures and Synthesis for Embedded Systems. ACM, 6.
[45]
Junlong Zhou, Min Yin, Zhifang Li, Kun Cao, Jianming Yan, Tongquan Wei, Mingsong Chen, and Xin Fu. 2017. Fault-tolerant task scheduling for mixed-criticality real-time systems. Journal of Circuits, Systems and Computers 26, 01 (2017), 1750016.
[46]
Dakai Zhu, Rami Melhem, and Daniel Mossé. 2004. The effects of energy management on reliability in real-time embedded systems. In 2004 IEEE/ACM International Conference on Computer Aided Design (ICCAD’04). IEEE, 35--40.
[47]
Shlomo Zilberstein. 1996. Using anytime algorithms in intelligent systems. 17 (Sept. 1996), 73--83.
[48]
Eckart Zitzler, Marco Laumanns, and Lothar Thiele. 2001. SPEA2: Improving the strength pareto evolutionary algorithm. In EUROGEN 2001. Evolutionary Methods for Design, Optimization and Control With Applications to Industrial Problems.

Cited By

View all
  • (2023)SACREDInternational Journal of e-Collaboration10.4018/IJeC.31578219:1(1-20)Online publication date: 6-Jan-2023
  • (2023)Utilization and criticality based fault-tolerant scheduling in multicore mixed critical systemsInternational Journal of Pervasive Computing and Communications10.1108/IJPCC-06-2022-024820:1(126-146)Online publication date: 7-Mar-2023
  • (2023)Preliminaries and Related WorkQuality-of-Service Aware Design and Management of Embedded Mixed-Criticality Systems10.1007/978-3-031-38960-3_2(17-36)Online publication date: 24-Jul-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Design Automation of Electronic Systems
ACM Transactions on Design Automation of Electronic Systems  Volume 24, Issue 1
January 2019
309 pages
ISSN:1084-4309
EISSN:1557-7309
DOI:10.1145/3293467
  • Editor:
  • Naehyuck Chang
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Journal Family

Publication History

Published: 21 December 2018
Accepted: 01 August 2018
Revised: 01 August 2018
Received: 01 April 2018
Published in TODAES Volume 24, Issue 1

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Fault-tolerance
  2. mixed-criticality
  3. real-time systems
  4. scheduling
  5. worst-case analysis

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)1
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)SACREDInternational Journal of e-Collaboration10.4018/IJeC.31578219:1(1-20)Online publication date: 6-Jan-2023
  • (2023)Utilization and criticality based fault-tolerant scheduling in multicore mixed critical systemsInternational Journal of Pervasive Computing and Communications10.1108/IJPCC-06-2022-024820:1(126-146)Online publication date: 7-Mar-2023
  • (2023)Preliminaries and Related WorkQuality-of-Service Aware Design and Management of Embedded Mixed-Criticality Systems10.1007/978-3-031-38960-3_2(17-36)Online publication date: 24-Jul-2023
  • (2022)Toward the Design of Fault-Tolerance-Aware and Peak-Power-Aware Multicore Mixed-Criticality SystemsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2021.308249541:5(1509-1522)Online publication date: May-2022
  • (2022)A Mixed-Criticality Approach to Fault Tolerance: Integrating Schedulability and Failure Requirements2022 IEEE 28th Real-Time and Embedded Technology and Applications Symposium (RTAS)10.1109/RTAS54340.2022.00011(27-39)Online publication date: May-2022
  • (2022)Peak-Power Aware Life-Time Reliability Improvement in Fault-Tolerant Mixed-Criticality SystemsIEEE Open Journal of Circuits and Systems10.1109/OJCAS.2022.32075983(199-215)Online publication date: 2022
  • (2022)ARMS: An Analysis Framework for Mixed Criticality Systems2022 IEEE 1st International Conference on Data, Decision and Systems (ICDDS)10.1109/ICDDS56399.2022.10037556(1-6)Online publication date: 2-Dec-2022
  • (2021)Reliability-Aware Resource Management in Multi-/Many-Core Systems: A Perspective PaperJournal of Low Power Electronics and Applications10.3390/jlpea1101000711:1(7)Online publication date: 25-Jan-2021

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media