If you have any trouble accessing any of these materials, please post a note on Piazza and we'll help.
- Slides: all animations, limited animations
- Outline
- Recording
- No supplemental reading for this lecture
- Explain the problem with simply extending 2PC to multiple servers and multiple coordinators
- Explain the problem that a network partition causes when we use a primary/backup mechanism
- Explain the role of every component in a replicated state machine (RSM): coordinators, the view server, and primary and backup servers.
- This includes any data structures they keep, how they respond to requests, whether they communicate with other components, etc.
- Explain the role of every component in an RSM when there are failures (server failures, network partitions).
- This includes how a component would discover a failure, any actions they need to take in response to the failure, what would happen while the failure is being handled, etc.
- Explain why we allow for the view server to be a centralized component, but did not decide to centralize the role of the coordinator
- After Recitation 20: Put the problem that Raft solves in the context of RSMs