by hugofirth on 7/12/21, 6:20 AM with 24 comments
by rdw on 7/13/21, 10:16 PM
by luhn on 7/13/21, 10:43 PM
The whole section on "bin-packing Paxos/Raft is more efficient" is strange, because people don't generally bin-pack Paxos/Raft—The bin-packing orchestrators are built off of Paxos/Raft!
by hugofirth on 7/13/21, 9:35 PM
If your raft state machines are doing IO via some write through cache (which they often are) then having specific machines do specific jobs can increase the cache quality. I.e. your leader node can have a better cache for your write workload, whilst your follower nodes can have better caches for your read workload.
This may lead to higher throughput (yay) but then also leave you vulnerable to significant slow-downs after leader elections (boo).
What makes sense will depend on your use case, but I personally agree with the author that multiple simple raft/paxos groups scheduled across nodes by some workload aware component might be the best of both worlds.
by toolslive on 7/13/21, 10:39 PM
by LAC-Tech on 7/13/21, 10:06 PM
Does leader = master here? My first reaction is that this is a multi-master system but I can't quite unpack "opportunistic coordinator".
by mistralefob on 7/13/21, 8:56 PM
by orangepanda on 7/14/21, 9:31 AM
Actual title - why fast replication protocols are slow