Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2039370.2039396acmconferencesArticle/Chapter ViewAbstractPublication PagesesweekConference Proceedingsconference-collections
research-article

Reliability analysis for MPSoCs with mixed-critical, hard real-time constraints

Published: 09 October 2011 Publication History

Abstract

Methods such as rollback and modular redundancy are efficient to correct transient errors. In hard real-time systems, however, correction has a strong impact on response times, also on tasks that were not directly affected by errors. Due to deadline misses, these tasks eventually fail to provide correct service. In this paper we present a reliability analysis for periodic task sets and static priorities that includes realistic detection and roll-back scenarios and covers a hyperperiod instead of just a critical instant and therefore leads to much higher accuracy than previous approaches. The approach is compared with Monte-Carlo simulation to demonstrate the accuracy and with previous approaches covering critical instants to evaluate the improvements.

References

[1]
T. Austin, D. Blaauw, T. Mudge, and K. Flautner. Making typical silicon matter with razor. IEEE Computer, 37(3):57--65, 2004.
[2]
S. Baruah, H. Li, and L. Stougie. Towards the design of certifiable mixed-criticality systems. In Proc. of Real-Time and Embedded Technology and Applications Symp., pages 13--22. IEEE, 2010.
[3]
S. Borkar. Designing reliable systems from unreliable components: the challenges of transistor variability and degradation. IEEE Micro, 25(6):10--16, 2005.
[4]
I. Broster, A. Burns, and G. Rodríguez-Navas. Probabilistic analysis of CAN with faults. In Proc. of Real-Time Systems Symposium, pages 269--278. IEEE, 2002.
[5]
A. Burns, R. Davis, and S. Punnekkat. Feasibility analysis of fault-tolerant real-time task sets. In Proc. of Euromicro Workshop Real-Time Systems, pages 29--33, 1996.
[6]
A. Burns, S. Punnekkat, L. Strigini, and D. R. Wright. Probabilistic scheduling guarantees for fault-tolerant real-time systems. In Proc. of Dependable Computing for Critical Applications, pages 361--378, 1999.
[7]
D. Chabrol, C. Aussagues, and V. David. A spatial and temporal partitioning approach for dependable automotive systems. In Proc. of Emerging Technologies & Factory Automation, pages 1--8, 2009.
[8]
M. Glass, M. Lukasiewycz, F. Reimann, C. Haubelt, and J. Teich. Symbolic reliability analysis and optimization of ECU networks. In Proc. of Design, Automation and Test in Europe, pages 158--163, 2008.
[9]
International Electrotechnical Commission (IEC). Functional safety of electrical / electronic / programmable electronic safety-related systems, 1998.
[10]
V. Izosimov, P. Pop, P. Eles, and Z. Peng. Synthesis of fault-tolerant embedded systems with checkpointing and replication. In Proc. of Int. Workshop Electronic Design, Test and Applications, 2006.
[11]
H. Kopetz. Real-Time Systems: Design Principles for Distributed Embedded Applications. Kluwer Academic Publishers, Norwell, MA, USA, 1997.
[12]
C. LaFrieda, E. Ipek, J. F. Martinez, and R. Manohar. Utilizing dynamically coupled cores to form a resilient chip multiprocessor. In Proc. of Int. Conf. Dependable Systems and Networks, pages 317--326, 2007.
[13]
P. Pop, V. Izosimov, P. Eles, and Z. Peng. Design optimization of time- and cost-constrained fault-tolerant embedded systems with checkpointing and replication. IEEE Trans. on VLSI, 17(3):389--402, 2009.
[14]
S. Punnekkat and A. Burns. Analysis of checkpointing for schedulability of real-time systems. In Proc. of Int. Workshop Real-Time Computing Systems and Applications, pages 198--205, 1997.
[15]
M. Sebastian and R. Ernst. Reliability Analysis of Single Bus Communication with Real-Time Requirements. In Proc. of Pacific Rim Int. Symp. Dependable Computing, pages 3--10, 2009.
[16]
J. C. Smolens, B. T. Gold, J. Kim, B. Falsafi, J. C. Hoe, and A. G. Nowatryk. Fingerprinting: bounding soft-error-detection latency and bandwidth. IEEE Micro, 24(6):22--29, 2004.
[17]
D. J. Sorin, M. M. K. Martin, M. D. Hill, and D. A. Wood. Safetynet: improving the availability of shared memory multiprocessors with global checkpoint/recovery. In Proc. of Int. Computer Architecture Symp., pages 123--134, 2002.
[18]
R. Teodorescu, J. Nakano, and J. Torrellas. Swich: A prototype for efficient cache-level checkpointing and rollback. IEEE Micro, 26(5):28--40, 2006.
[19]
K. W. Tindell, A. Burns, and A. J. Wellings. An extendible approach for analyzing fixed priority hard real-time tasks. Real-Time Systems, 6(2):133--151, 1994.

Cited By

View all
  • (2023)Software Fault Tolerance in Real-Time Systems: Identifying the Future Research QuestionsACM Computing Surveys10.1145/358995055:14s(1-30)Online publication date: 17-Jul-2023
  • (2022)A Mixed-Criticality Approach to Fault Tolerance: Integrating Schedulability and Failure Requirements2022 IEEE 28th Real-Time and Embedded Technology and Applications Symposium (RTAS)10.1109/RTAS54340.2022.00011(27-39)Online publication date: May-2022
  • (2021)Reliability-Aware Resource Management in Multi-/Many-Core Systems: A Perspective PaperJournal of Low Power Electronics and Applications10.3390/jlpea1101000711:1(7)Online publication date: 25-Jan-2021
  • Show More Cited By

Index Terms

  1. Reliability analysis for MPSoCs with mixed-critical, hard real-time constraints

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        CODES+ISSS '11: Proceedings of the seventh IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
        October 2011
        402 pages
        ISBN:9781450307154
        DOI:10.1145/2039370
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 09 October 2011

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. embedded systems
        2. real-time

        Qualifiers

        • Research-article

        Conference

        ESWeek '11
        ESWeek '11: Seventh Embedded Systems Week
        October 9 - 14, 2011
        Taipei, Taiwan

        Acceptance Rates

        Overall Acceptance Rate 280 of 864 submissions, 32%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)6
        • Downloads (Last 6 weeks)2
        Reflects downloads up to 10 Nov 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)Software Fault Tolerance in Real-Time Systems: Identifying the Future Research QuestionsACM Computing Surveys10.1145/358995055:14s(1-30)Online publication date: 17-Jul-2023
        • (2022)A Mixed-Criticality Approach to Fault Tolerance: Integrating Schedulability and Failure Requirements2022 IEEE 28th Real-Time and Embedded Technology and Applications Symposium (RTAS)10.1109/RTAS54340.2022.00011(27-39)Online publication date: May-2022
        • (2021)Reliability-Aware Resource Management in Multi-/Many-Core Systems: A Perspective PaperJournal of Low Power Electronics and Applications10.3390/jlpea1101000711:1(7)Online publication date: 25-Jan-2021
        • (2020)A Taxonomy of Supervised Learning for IDSs in SCADA EnvironmentsACM Computing Surveys10.1145/337949953:2(1-37)Online publication date: 17-Apr-2020
        • (2020)A Survey of Network Virtualization Techniques for Internet of Things Using SDN and NFVACM Computing Surveys10.1145/337944453:2(1-40)Online publication date: 17-Apr-2020
        • (2020)A Survey of Blockchain-Based Strategies for HealthcareACM Computing Surveys10.1145/337691553:2(1-27)Online publication date: 20-Mar-2020
        • (2020)The Landscape of Exascale ResearchACM Computing Surveys10.1145/337239053:2(1-43)Online publication date: 20-Mar-2020
        • (2020)ASTEROID and the Replica-Aware Co-scheduling for Mixed-CriticalityDependable Embedded Systems10.1007/978-3-030-52017-5_3(57-84)Online publication date: 10-Dec-2020
        • (2019)TÿchoACM Transactions on Embedded Computing Systems10.1145/336269218:6(1-25)Online publication date: 14-Dec-2019
        • (2019)REALACM Transactions on Embedded Computing Systems10.1145/336210018:6(1-24)Online publication date: 15-Nov-2019
        • Show More Cited By

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media