Name: | Description: | Size: | Format: | |
---|---|---|---|---|
390.2 KB | Adobe PDF |
Authors
Advisor(s)
Abstract(s)
It is imperative to accept that failures can and will occur, even in meticulously
designed distributed systems, and design proper measures to counter those failures.
Passive replication minimises resource consumption by only activating redundant
replicas in case of failures, as typically providing and applying state updates is less
resource demanding than requesting execution. However, most existing solutions
for passive fault tolerance are usually designed and configured at design time, explicitly
and statically identifying the most critical components and their number of
replicas, lacking the needed flexibility to handle the runtime dynamics of distributed
component-based embedded systems.
This paper proposes a cost-effective adaptive fault tolerance solution with a significant
lower overhead compared to a strict active redundancy-based approach,
achieving a high error coverage with the minimum amount of redundancy. The
activation of passive replicas is coordinated through a feedback-based coordination
model that reduces the complexity of the needed interactions among components
until a new collective global service solution is determined, improving the overall
maintainability and robustness of the system.
Description
Keywords
Component-based systems Embedded real-time systems Coordination model Fault-tolerance Passive replication