Fault Tolerance in Distributed Systems: Distributed Consensus

Back to writing after a while. Exam time! would try my best to stick around. Anyways, back to the topic- Welcome the most sought after and esoteric topics in distributed systems- Distributed Consensus! A plethora of algorithms and their variants have been proposed to solve this problem. This is a very rigorous subject and we…

Fault Tolerance in Distributed Systems: Timing Models

In the last post, I wrote about fault tolerance in Distributed Systems and how failure models are classified. This post will describe the timing models that need to be considered while studying fault tolerance. Roughly, a timing model is simply the way a distributed model behaves with respect to time. 1. Synchronous timing model –…

Fault Tolerance in Distributed Systems: Introduction

“Fault tolerance” or being able to handle any type of fault in itself is a motivation for distributed systems. This is one of the most widely studied topics in the area of Distributed Systems. It has remained one of the hot areas for some obvious reasons – If you are talking of a distributed environment of thousands…