Accessibility

6.033--Computer System Engineering

Suggestions for classroom discussion of:

Replication.

preliminary notes, by J. H. Saltzer, version of April 22, 2002, 1:45 p.m. (Some of these ideas came from the Echo paper.) Minor corrections for consistency with notes, January 2005.


Random observations, in random order. This is not a coherent set of suggestions!

  • Some algorithms are said to work in the face of "Non-transitive network service interruptions". That is an interesting additional constraint that we didn't consider in the Networking chapter. How could such interruptions arise? (On a poorly-terminated Ethernet with lots of echos, A can hear B and B can hear C, but A can't hear C.)

  • Cold start. What makes cold starts (e.g., general power failure) so hard to accomplish? (Unnoticed design circularity. For example, the file system requires name service, but name servers depend on the file system to hold their data bases.)

  • Doesn't placing a name server on a dedicated computer fix this? (Only if the dedicated computer has a stand-alone file system. That server can boot, then the name server can run and provide help to the rest of the community.)

  • Yes, but it doesn't have any backup. (So build three of them. The tricky part is that you also have to explicitly copy data from one to another to maintain the replicas, rather than expecting some replicated file system to do it automatically. They key is to identify the base cases and special-case them. The generators used by large power companies have bearings that must be lubricated by high-pressure pumps. The pumps, of course, are driven by electric motors. They have the same problem performing a cold start.)
    Comments and suggestions: Saltzer@mit.edu