Software and hardware systems have become increasingly used in many industrial sectors, such as manufacturing, energy supply, aerospace, transportation, communication and healthcare. Failures due to software or hardware malfunctions, users' mistakes and malicious intentions can have serious economics consequences, and can also endanger human life. Fault tolerance prevents system failures and is intended to ensure that it delivers the required service in spite of faults and errors which it might encounter and as such it is crucial for meeting high reliability and availability requirements.
Fault tolerance engineering during the entire life cycle has been advocated by some researchers as one of the main approaches to ensuring the overall system dependability. In particular, different classes of faults, errors and failures must be identified and dealt with at each phase of software development, depending on the abstraction level used in modelling the software system under development. A number of studies have been conducted so far in these areas, but understanding where and how fault-tolerance should be integrated in the software life-cycle still requires major research efforts.
This international workshop builds on this trend and aims at investigating how fault tolerance mechanisms can be taken into account when engineering complex software systems. To address the new problems the system developers are facing nowadays (such as identifying the places where fault tolerance means must be applied and the degree of fault tolerance that must be achieved) we need novel models to be applied at different abstraction levels (requirement, architecture and design models for fault tolerance, together with new implementation schemes), innovative technologies (tools and frameworks for implementing distributed fault tolerant systems) and advanced verification environments (to assess the achieved level of fault tolerance and to evaluate the dependability properties of the systems). Recently there has been growing interest in the areas directly related and overlapping with fault tolerance, such as system self-healing, resilience, self-adaptation and self-management. The topics related to engineering of systems with such properties are in the scope of the workshop as the intention is to improve the current understanding of how fault tolerance engineering can benefit from research on these areas.
EFTS 2007 is the appropriate venue to reflect on the achievements of the researchers and practitioners in the fields of software engineering and fault tolerance, bringing together people from these two communities.
Proceeding Downloads
Formal modeling of BPEL workflows including fault and compensation handling
Electronically executed business processes are frequently implemented using the Business Process Execution Language (BPEL). These workflows may be in control of crucial business processes of an organization, in the same time existing model checking ...
Towards fault tolerance in web services compositions
Many businesses are now moving towards the use of composite web services that are based on a collection of web services working together to achieve an objective. Although they are becoming business-critical elements, current development support tools do ...
Diagnosis service for embedded software component based systems
This paper studies the fault diagnosis of component-based applications, especially embedded ones. The principle of the proposed diagnosis technique is to implement inter-component tests in order to detect and locate faulty components without component ...
On engineering standards based carrier grade platforms
The remarkable pace of advancement in communications technologies and the exponential growth of the market have pressured network equipment providers into producing more features in products in a much faster rate at lower costs. The strategy of buying ...
Experimenting with diversity in the model driven development of a railway signaling system
In this paper we discuss how we have introduced elements of diversity in the experimental model driven development process of a railway signalling system. The experience has been done inside a larger industrial project undertaken to evaluate the ...
Fault-tolerant digital systems implemented with partially definite and partially correct automata
The multi-version approach to the system design is becoming a standard for the critical applications as it allows decreasing the probability of common mode failure. The main idea of this approach is to obtain the least correlated versions of the same ...
Fault and adversary tolerance as an emergent property of distributed systems' software architectures
Fault and adversary tolerance have become not only desirable but required properties of software systems because mission-critical systems are commonly distributed on large networks of insecure nodes. In this paper, we describe how the tile style, an ...
Scheduling of embedded time-triggered systems
Distributed system composition is the main trend in creating safety-critical (SC) real-time systems like automotive, aerospace, and industrial control systems. Their growing complexity (e.g. tens of control units in a modern car) led to an integrated ...
Evaluating fault-tolerant system designs using FAUmachine
This paper presents an elaborate framework for early evaluation of fault-tolerant behaviour of complex hardware/software systems in the presence of hardware faults. The evaluation is based on sophisticated fault injection experiments. An illustrative ...
Refinement patterns for rapid development of dependable systems
Although, it is widely recognised that formal methods is the single most important technique for building complex dependable systems, formal methods struggle to gain wide acceptance outside their few traditional application areas. In this paper we ...
- Proceedings of the 2007 workshop on Engineering fault tolerant systems
Recommendations
Fault Injection and Dependability Evaluation of Fault-Tolerant Systems
The authors describe a dependability evaluation method based on fault injection that establishes the link between the experimental evaluation of the fault tolerance process and the fault occurrence process. The main characteristics of a fault injection ...
Graceful Degradation in Algorithm-Based Fault Tolerant Multiprocessor Systems
Algorithm-based fault tolerance (ABFT) is a technique which improves the reliability of a multiprocessor system by providing concurrent error detection and fault location capability to it. It encodes data at the system level and modifies the algorithm ...