The unexpected and “Byzantine faults”

I was discussing with a colleague of mine the other day about handling of node failures in a distributed system, esp the case where a node A thinks that node C is down but node B thinks otherwise. The ramifications of this could be fatal to the system unless handled properly.

Later I came across the term “Byzantine faults” in another unrelated search and started following lot of other literature around this. Here are couple of good reads.

Leave a comment