Search papers, labs, and topics across Lattice.
The paper introduces CD-Raft, an optimized Raft consensus protocol designed to minimize latency in cross-domain data synchronization scenarios. CD-Raft achieves this by optimizing cross-domain round-trip times for read/write operations and strategically positioning the leader node within the distributed system. Evaluation using the YCSB benchmark demonstrates a 32.90% reduction in average latency and a 49.24% reduction in tail latency compared to the classic Raft protocol.
CD-Raft slashes distributed consensus latency by nearly 50% in cross-domain settings, offering a significant speedup for data-intensive AI workloads.
Today's massive AI computation loads push heavy data synchronization across sites, i.e., nodes in data centers. Any reduction in such consensus latency can significantly improve the overall performance of desired systems. This consensus challenge explosively peaks at cross-domain sites. In this paper, we proposed CD-Raft to address the cross-domain latency challenge, an optimized Raft protocol for strong consistency in cross-domain sites. CD-Raft can significantly reduce consensus latency by optimizing cross-domain round-trip time (RTT) for reads and writes, as well as carefully positioning the leader node. We verified the correctness of CD-Raft in a formal specification using the TLA+ specification, guaranteeing the strong consistency across sites. We have prototyped CD-Raft and evaluated it using the YCSB benchmark. Empirical results show that compared to the classic Raft, CD-Raft reduces the average latency by 32.90% and (99th percentile) tail latency by 49.24% for renown traces across multiple sites.