Review on Practical Byzantine Fault Tolerance

The paper of Castro-Liskov talks about a practical algorithm that is able to tolerate Byzantine faults. The practical algorithm works in asynchronous environments, such as the Internet, and is able to speed up the response time. Byzantine fault happens in distributed computing systems where there is imperfect information about which nodes is treacherous. For a distributed systems to work, the healthy nodes must work out a consensus despite the presence of treacherous nodes.

The practical algorithm (PBFT) offers safety and liveness, provided at most maximum [(n-1)/3] nodes are simultaneously faulty. Safety means that the systems functions as a centralised systems that executes operations atomically. Liveness means that nodes eventually receive replies to their requests.

The algorithm can be used to implement any deterministic replicated service with state and some operations. The operations can be reads or writes of the service state. Deterministic 是指在执行程序，同样的参数和状态会产生同样的结果。

The algorithm is a form of state machine replication, where the service is modeled as a state machine that is replicated across different nodes in a distributed system. Each state machine replica maintains the state and implements the operations. The view is snapshot of the moving state of replicas. In a view, one replica is the primary, the others are backups.

The algorithm works roughly as follows:

A client sends a request to invoke a service operation to the primary

node
The primary multicasts the request to the backup nodes
Nodes execute the request and send a reply to the client
The client waits for 1 replies from different nodes with the same result; this is the result of the operation.

If the nodes are deterministic and starting from same state, all healthy nodes agree on the outcome of the execution of request despite faulty nodes.

In normal operation, where primary node receives client request, it starts a three phase protocol to send the request to the replica nodes. The three phases are pre-prepare, prepare and commit. The pre-prepare and prepare phases are used to totally order requests sent in the same view even when the primary, which proposes the ordering of requests, is faulty. The prepare and commit phases are used to ensure that requests that commit are totally ordered across views. (Total order means any pairs in the set is comparable.)

PBFT 的三个阶段

Review on Practical Byzantine Fault Tolerance

In the figure above, it shows the operation of the algorithm in the normal case of no primary faults. Replica 0 is the primary, replica 3 is faulty, and C is the client.

The view-change protocol provides liveness by allowing the system to make progress when the primary fails. View changes are triggered by backup request timeouts. A backup is waiting for a request if it received a valid request and has not executed it. If timeout, the backup starts a view change to move the system to the next view.

To improve the response time of the algorithm, three optimisations are applied. The first avoids sending large replies. Only one nodes sends the result. The other nodes send the digest of the result.

The second reduces the number of message delays. After request is executed, the nodes send tentative replies to the client. If 2f+1 relies are matching, the request is guaranteed to commit eventually, and there is no retransmission of the request.

The third improves the performance of read only operations that do not modify the state. Read only request are executed immediately and reply is sent only after all request are reflected in the tentative state are committed. This is to prevent the client from observing uncommitted state.

The Istanbul BFT consensus is inspired by PBFT.

See http://docs.goquorum.com/en/latest/Consensus/ibft/ibft/

and

https://github.com/ethereum/EIPs/issues/650

Review on Practical Byzantine Fault Tolerance

继续阅读

openstack安装指南_5个使用OpenStack的新指南

openstack安装指南_3个新的OpenStack指南

Hyperledger Fabric（术语表）

Why and How zk-SNARK Works 1: Introduction & the Medium of a Proof

Cisco官方网站悄然换标

火币链转账

火币链节点监听记录火币链节点监听记录

世链投研| 仅一个月追赶DeFi？火币生态链Heco有何过人之处？

HyperGraph(HGT)—Heco生态链3月最重磅项目发布

币安链、火币链提前预知配对合约地址方法Solidity代码

区块链可能会如何改变音乐行业？

fabric go语言链码打包并在其他Peer节点部署

比特币下跌与加密货币的联动效应（附代码）

在拥有比特币之前了解区块链钱包开发

2021-09-30一码在手安全无忧从农田到餐桌，全流程追溯四大模块，助力客户实现品牌化

写作一段时间后，我的反思