Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3215525.3215538acmconferencesArticle/Chapter ViewAbstractPublication PagesmobisysConference Proceedingsconference-collections
research-article

Transparent Fault Tolerance for Real-Time Automation Systems

Published: 10 June 2018 Publication History

Abstract

Developing software is hard. Developing software that is resilient and does not crash at the occurrence of unexpected inputs or events is even harder, especially with IoT devices and real-time requirements, e.g., due to interactions with human beings. Therefore, there is a need for a software architecture that helps software developers to build fault-tolerant software with as little pain and effort as possible. To this end, we have designed a fault tolerance framework for automation systems that lets developers be mostly oblivious to fault tolerance issues. Thus they can focus on the application logic encapsulated in (micro)services. That is, the developer only needs to specify the required fault tolerance level by description, not implementation. The fault tolerance aspects are transparent to the developer, as the framework takes care of them. This approach is particularly suited for the development for mixed-criticality systems, where different parts have very different and demanding functional and non-functional requirements. For such systems highly specialized developers are needed and removing the burden of fault tolerance results in faster time to market and safer and more dependable systems.

References

[1]
Navid Aghdaie. 2005. Transparent Fault-Tolerant Network Services Using Off-the-Shelf Components. Ph.D. Dissertation. University of California, Los Angeles.
[2]
Joe Armstrong. 2010. erlang. Commun. ACM 53, 9 (2010), 68--75.
[3]
Thomas Becker. 1994. Application-transparent fault tolerance in distributed systems. In Proceedings of 2nd International Workshop on Configurable Distributed Systems. IEEE, 36--45.
[4]
Vijay Dialani, Simon Miles, Luc Moreau, David De Roure, and Michael Luck. 2002. Transparent fault tolerance for web services based architectures. In European Conference on Parallel Processing. Springer, 889--898.
[5]
Jeff Fryman and Bjoern Matthias. 2012. Safety of industrial robots: From conventional to collaborative applications. In Robotics; Proceedings of ROBOTIK 2012; 7th German Conference on. VDE, 1--5.
[6]
Rachid Guerraoui, David Kozhaya, Manuel Oriol, and Yvonne-Anne Pignolet. 2016. Who's On Board?: Probabilistic Membership for Real-Time Distributed Control Systems. In 35th Symposium on Reliable Distributed Systems (SRDS). IEEE, 167--176.
[7]
Nitya Narasimhan. 2001. Transparent fault tolerance for Java remote method invocation. University of California, Santa Barbara.
[8]
Titos Saridakis. 2002. A System of Patterns for Fault Tolerance. In EuroPLoP.
[9]
Robert Stewart, Patrick Maier, and Phil Trinder. 2016. Transparent fault tolerance for scalable functional computation. Journal of Functional Programming 26 (2016).

Cited By

View all
  • (2022)Enhanced software development process for CubeSats to cope with space radiation faults2022 IEEE 27th Pacific Rim International Symposium on Dependable Computing (PRDC)10.1109/PRDC55274.2022.00022(78-88)Online publication date: Nov-2022

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
IoPARTS'18: Proceedings of the 1st International Workshop on Internet of People, Assistive Robots and Things
June 2018
72 pages
ISBN:9781450358439
DOI:10.1145/3215525
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 June 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. automation systems
  2. dependability
  3. fault-tolerance
  4. real-time
  5. transparency

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

MobiSys '18
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)1
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Enhanced software development process for CubeSats to cope with space radiation faults2022 IEEE 27th Pacific Rim International Symposium on Dependable Computing (PRDC)10.1109/PRDC55274.2022.00022(78-88)Online publication date: Nov-2022

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media