Fact-checked by Grok 2 weeks ago
References
-
[1]
[PDF] Replication Management using the State Machine ApproachThe state machine approach is a general method for implementing a fault-tolerant service by replicating servers and coordinating client interactions with ...
-
[2]
[PDF] A Guided Tour on the Theory and Practice of State Machine ...In this chapter we present a guided tour to the most important results regarding the theory and practice of State Machine Replication (SMR) for fault tolerance.
-
[3]
[PDF] Leaderless State-Machine Replication: Specification, Properties ...Aug 6, 2020 · In SMR, a service is defined by a deterministic state machine, and each process maintains its own local copy of the machine. Classical SMR ...
-
[4]
Paxos vs Raft: have we reached c - ACM Digital LibraryApr 27, 2020 · State machine replication requires that an application's deterministic state machine is replicated across n servers with each applying the same ...
-
[5]
Byzantine Fault-tolerant State-machine Replication from a Systems ...Feb 11, 2021 · This survey aims at facilitating the task of building BFT systems by presenting an overview of state-of-the-art techniques and analyzing their practical ...
-
[6]
[PDF] Spanner: Google's Globally-Distributed DatabaseTo support replication, each spanserver implements a single Paxos state machine on top of each tablet. (An early Spanner incarnation supported multiple Paxos ...
-
[7]
Low-latency geo-replicated state machines with guaranteed writesReplicated state machines are an important and widely-studied methodology for tolerating a wide range of faults. Unfortunately, while replicas should be ...
-
[8]
[PDF] Exploring Latency Boundaries of Blockchains in Edge Computing ...Edge computing and its applications tend to be distributed and ... consensus nodes adapting traditional protocols known from state machine replication.<|control11|><|separator|>
-
[9]
[PDF] Implementing Fault-Tolerant Services Using the State Machine ...This paper reviews the approach and describes protocols for two different failure models-Byzantine and fail stop. System reconfiguration techniques for removing ...Missing: seminal | Show results with:seminal
-
[10]
[PDF] Contributions to the practice and theory of state-machine replicationMay 26, 2024 · The first historical definition of SMR appears in the seminal paper of Lamport [10] on causality in distributed systems. In this paper ...
-
[11]
Implementing fault-tolerant services using the state machine approachThe state machine approach is a general method for implementing fault-tolerant services in distributed systems. This paper reviews the approach.
-
[12]
Eventual Consistency Today: Limitations, Extensions, and BeyondMay 1, 2013 · In a dynamic, partitionable Internet, services requiring guaranteed low latency must often relax their expectations of data consistency.
-
[13]
Eventually Consistent - ACM QueueDec 4, 2008 · A system that is not tolerant to network partitions can achieve data consistency and availability, and often does so by using transaction ...
-
[14]
[PDF] An Analysis of Network-Partitioning Failures in Cloud SystemsOct 8, 2018 · distributed systems: network-partitioning faults. ... Nagappan,. "Characterizing cloud computing hardware reliability," in ACM symposium on Cloud.
-
[15]
What are Race Conditions? - Some Issues and FormalizationsAug 6, 2025 · A race condition occurs in a parallel program execution when two or more threads access a common resource, e.g., a variable in shared memory, ...
-
[16]
[PDF] Evaluating the Scalability of Distributed SystemsAbstract. Many distributed systems must be scalable, meaning that they must be economically deployable in a wide range of sizes and con gurations.
-
[17]
[PDF] Perspectives on the CAP Theorem - ResearchConsistency among the servers is ensured by using a replicated state machine protocol (specifically, Paxos [20]) to maintain synchronized logs. Chubby continues ...
-
[18]
[PDF] Impossibility of Distributed Consensus with One Faulty ProcessThe consensus problem involves an asynchronous system of processes, some of which may be unreliable. The problem is for the reliable processes to agree on a ...
-
[19]
Avoiding Single Points of Failures in Distributed Systems - BaeldungMar 18, 2024 · In distributed systems, a Single Point of Failure (SPOF) is such a ... It is important to make a system resilient to failures of external services ...
-
[20]
[PDF] Practical Byzantine Fault ToleranceThis paper describes a new replication algorithm that is able to tolerate Byzantine faults. We believe that Byzantine- fault-tolerant algorithms will be ...
-
[21]
[PDF] Total Order Broadcast and Multicast Algorithms: Taxonomy and SurveyWe define five classes of order- ing mechanisms: communication history, privilege-based, moving sequencer, fixed sequencer, and destinations agreement. In this ...
-
[22]
[PDF] In Search of an Understandable Consensus AlgorithmMay 20, 2014 · We used our Raft implementation to measure the per- formance of Raft's leader election algorithm and answer two questions. First, does the ...
-
[23]
[PDF] Paxos Made Simple - Leslie LamportNov 1, 2001 · To guarantee that all servers execute the same sequence of state machine commands, we implement a sequence of separate instances of the Paxos.
-
[24]
[PDF] Viewstamped Replication Revisited - MITThis paper presents an updated version of Viewstamped. Replication, a replication technique that handles failures in which nodes crash.
-
[25]
[PDF] State Machine Replication with Byzantine Faults - cachin.comMar 5, 2009 · The following definition [4] is adapted from the corre- sponding one in the crash-failure model [13]. ... Byzantine fault model. The ...
-
[26]
[PDF] On the Efficiency of Durable State Machine Replication - USENIXThis paper addresses the problem of adding durability to SMR systems. Durability is defined as the capability of a SMR system to survive the crash or shutdown ...Missing: seminal | Show results with:seminal<|control11|><|separator|>
-
[27]
[PDF] Verifiable state machines: Proofs that untrusted services operate ...This article describes recent progress in realizing verifiable state machines, a primitive that enables untrusted services to provide cryptographic proofs that ...Missing: randomness | Show results with:randomness
-
[28]
[PDF] Practical Byzantine Fault Tolerance - MicrosoftCastro, A. Adya, B. Liskov, and A. Myers. HAC: Hybrid Adaptive Caching for Distributed Storage Systems. In Proc. 16th ACM Symp. on Operating System.Missing: paper | Show results with:paper
-
[29]
[PDF] Tolerating latency in replicated state machines through client ...In this section, we apply our general strategy for support- ing client speculative execution in replicated services to the Practical Byzantine Fault Tolerance ( ...
-
[30]
[PDF] Unreliable Failure Detectors for Reliable Distributed Systemsand accuracy. We show that Consensus can be solved even with unreliable failure detectors that make an infinite number of mistakes, and determine which ones.
-
[31]
[PDF] All about Eve: Execute-Verify Replication for Multi-Core ServersFigure 1 shows an overview of Eve, whose “execute- then-verify” design departs from the “agree-then- execute” approach of traditional SMR [7, 27, 50].
-
[32]
[PDF] Towards Fast and Adaptive Byzantine State Machine Replication for ...The periodic auditing of replicas can be used to construct a recovery mechanism: When inconsistent ... BFT-SMaRt Byzantine Fault-Tolerant State Machine ...
-
[33]
[PDF] Failure Detectors | CSE 486/586 Distributed SystemsWhat do we want from a failure detector? – Failures are always detected (completeness). – No false positives (accuracy) ... • False Detection Rate: Average number ...
-
[34]
[PDF] Byzantine Fault-Tolerant State-Machine Replication from a Systems ...Byzantine fault-tolerant (BFT) state-machine replication makes it possible to design systems that are resilient against arbitrary faults, a requirement ...
-
[35]
[PDF] Liveness and latency of Byzantine state-machine replicationWe present a simple formal specification of an SMR synchronizer and its bounded-space implementation under partial synchrony. We also apply our ...
-
[36]
[PDF] Dynamic Reconfiguration of Primary/Backup Clusters - USENIXPrimary/backup repli- cation is a special instance of a more general problem, state-machine replication (SMR). ... as Paxos [15]. Similarly to our ...
-
[37]
[PDF] Vertical Paxos and Primary-Backup Replication - Leslie LamportWe introduce a class of Paxos algorithms called Vertical Paxos, in which reconfiguration can occur in the middle of reaching agreement.
-
[38]
[PDF] High performance recovery for parallel state machine replicationBy introducing the consensus instance in the delivery event, a server can easily determine the messages it needs to retrieve upon recovering from a failure.
-
[39]
[PDF] Time, Clocks, and the Ordering of Events in a Distributed SystemA distributed algorithm is given for synchronizing a system of logical clocks which can be used to totally order the events. The use of the total ordering is ...
-
[40]
[PDF] Tandem TR 88.6 A COMPARISON OF THE BYZANTINE ... - Jim GrayThey give single-fault tolerance by duplexing fail-fast modules [Gray2]. Byzantine Algorithms require at least four-plexed modules to tolerate a single fault.
-
[41]
[PDF] In Search of an Understandable Consensus AlgorithmRaft is a consensus algorithm for managing a replicated log, designed to be more understandable than Paxos, which is difficult to understand.
-
[42]
HotStuff: BFT Consensus in the Lens of Blockchain - arXivMar 13, 2018 · We present HotStuff, a leader-based Byzantine fault-tolerant replication protocol for the partially synchronous model.Missing: scalability | Show results with:scalability
-
[43]
[PDF] Consensus without Mining - TendermintAbstract. Cryptocurrencies such as Bitcoin enable users to submit payment transactions without going through a centralized trusted or- ganization.Missing: Cosmos replication
-
[44]
[PDF] The design, architecture and performance of the Tendermint ... - SUSITendermint is a state machine replication (SMR) [1] en- gine that tolerates Byzantine faults. It was among the first systems to adapt classical Byzantine ...
-
[45]
Extending State Machine Replication through CompositionDec 10, 2024 · This paper takes the first steps in this direction, presenting a formal definition of state machine replication and compositionality.
-
[46]
SWARM: Replicating Shared Disaggregated-Memory Data in No TimeSep 24, 2024 · We propose SWARM (Swift WAit-free Replication in disaggregated Memory), the first replication scheme for in-disaggregated-memory shared objects.<|separator|>
-
[47]
Building State Machine Replication Using Practical Network ... - arXivWe prove this hypothesis by engineering a practical design that uses a combination of kernel-bypass network, multithreaded architecture, and ...Missing: low- latency