@signof

The Heard-Of model: computing in distributed systems with benign faults

, and . Distributed Computing, 22 (1): 49--71 (April 2009)

Abstract

Abstract  Problems in fault-tolerant distributed computing have been studied in a variety of models. These models are structured around two central ideas: (1) degree of synchrony and failure model are two independent parameters that determine a particular type of system, (2) the notion of faulty component is helpful and even necessary for the analysis of distributed computations when faults occur. In this work, we question thesetwo basic principles of fault-tolerant distributed computing, and show that it is both possible and worthy to renounce themin the context of benign faults: we present a computational model based only on the notion of transmission faults. In this model, computations evolve in rounds, and messages missed in a round are lost. Only information transmission isrepresented: for each round r and each process p, our model provides the set of processes that p “hears of” at round r (heard-of set), namely the processes from which p receives some message at round r. The features of a specific system are thus captured as a whole, just by a predicate over the collection of heard-of sets.We show that our model handles benign failures, be they static or dynamic, permanent or transient, in a unified framework.We demonstrate how this approach leads to shorter and simpler proofs of important results (non-solvability, lower bounds).In particular, we prove that the Consensus problem cannot be generally solved without an implicit and permanent consensuson heard-of sets. We also examine Consensus algorithms in our model. In light of this specific agreement problem, we showhow our approach allows us to devise new interesting solutions.

Description

The Heard-Of model: computing in distributed systems with benign faults

Links and resources

Tags

community

  • @signof
  • @giuliano.losa
  • @dblp
@signof's tags highlighted