Reading 20, Part 1: Synchronization

## Synchronization **The correctness of a concurrent program should not depend on accidents of timing.** Since race conditions caused by concurrent manipulation of shared mutable data are disastrous bugs --- hard to discover, hard to reproduce, hard to debug --- we need a way for concurrent modules that share memory to **synchronize** with each other. **Locks** are one synchronization technique. A lock is an abstraction that allows at most one thread to *own* it at a time. *Holding a lock* is how one thread tells other threads: "I'm changing this thing, don't touch it right now." Locks have two operations: + **`acquire`** allows a thread to take ownership of a lock. If a thread tries to acquire a lock currently owned by another thread, it *blocks* until the other thread releases the lock. At that point, it will contend with any other threads that are trying to acquire the lock. At most one thread can own the lock at a time. + **`release`** relinquishes ownership of the lock, allowing another thread to take ownership of it. Using a lock also tells the compiler and processor that you're using shared memory concurrently, so that registers and caches will be flushed out to shared storage. This avoids the problem of [reordering](../../17-concurrency/#reordering), ensuring that the owner of a lock is always looking at up-to-date data. ### Bank account example

Our first example of shared memory concurrency was a [bank with cash machines](../17-concurrency/#shared_memory_example). The diagram from that example is on the right. The bank has several cash machines, all of which can read and write the same account objects in memory. Of course, without any coordination between concurrent reads and writes to the account balances, [things went horribly wrong](../../17-concurrency/#interleaving). To solve this problem with locks, we can add a lock that protects each bank account. Now, before they can access or update an account balance, cash machines must first acquire the lock on that account.

In the diagram to the right, both A and B are trying to access account 1. Suppose B acquires the lock first. Then A must wait to read and write the balance until B finishes and releases the lock. This ensures that A and B are synchronized, but another cash machine C is able to run independently on a different account (because that account is protected by a different lock). ## Deadlock When used properly and carefully, locks can prevent race conditions. But then another problem rears its ugly head. Because the use of locks requires threads to wait (`acquire` blocks when another thread is holding the lock), it's possible to get into a a situation where two threads are waiting *for each other* --- and hence neither can make progress.

In the figure to the right, suppose A and B are making simultaneous transfers between two accounts in our bank. A transfer between accounts needs to lock both accounts, so that money can't disappear from the system. A and B each acquire the lock on their respective "from" account: A acquires the lock on account 1, and B acquires the lock on account 2. Now, each must acquire the lock on their "to" account: so A is waiting for B to release the account 2 lock, and B is waiting for A to release the account 1 lock. Stalemate! A and B are frozen in a "deadly embrace," and accounts are locked up. **Deadlock** occurs when concurrent modules are stuck waiting for each other to do something. A deadlock may involve more than two modules: the signal feature of deadlock is a **cycle of dependencies**, e.g. A is waiting for B which is waiting for C which is waiting for A. None of them can make progress. You can also have deadlock without using any locks --- we'll see an example when we discuss message-passing in the next section of the reading.

In the Java Tutorials, read: + [Deadlock] (1 page) [Deadlock]: http://docs.oracle.com/javase/tutorial/essential/concurrency/deadlock.html

## Developing a threadsafe abstract data type Let's see how to use synchronization to implement a threadsafe ADT. You can see all the code for this example on GitHub: [**edit buffer example**](https://github.com/mit6005/F14-EX20-editor). You are *not* expected to read and understand all the code. All the relevant parts are excerpted below. Suppose we're building a multi-user editor, like Google Docs, that allows multiple people to connect to it and edit it at the same time. We'll need a mutable datatype to represent the text in the document. Here's the interface; basically it represents a string with insert and delete operations: ```java /** An EditBuffer represents a threadsafe mutable string of characters * in a text editor. */ public interface EditBuffer { /** * Modifies this by inserting a string. * @param pos position to insert at (requires 0 <= pos <= current buffer length) * @param ins string to insert */ public void insert(int pos, String ins); /** * Modifies this by deleting a substring * @param pos starting position of substring to delete * (requires 0 <= pos <= current buffer length) * @param len length of substring to delete * (requires 0 <= len <= current buffer length - pos) */ public void delete(int pos, int len); /** * @return length of text sequence in this edit buffer */ public int length(); /** * @return content of this edit buffer */ public String toString(); } ``` A very simple rep for this datatype would just be a string: ```java public class SimpleBuffer implements EditBuffer { private String text; // Rep invariant: // text != null // Abstraction function: // represents the sequence text[0],...,text[text.length()-1] ``` The downside of this rep is that every time we do an insert or delete, we have to copy the entire string into a new string. That gets expensive. Another rep we could use would be a character array, with space at the end. That's fine if the user is just typing new text at the end of the document (we don't have to copy anything), but if the user is typing at the beginning of the document, then we're copying the entire document with every keystroke. A more interesting rep, which is used by many text editors in practice, is called a *gap buffer*. It's basically a character array with extra space in it, but instead of having all the extra space at the end, the extra space is a *gap* that can appear anywhere in the buffer. Whenever an insert or delete operation needs to be done, the datatype first moves the gap to the location of the operation, and then does the insert or delete. If the gap is already there, then nothing needs to be copied --- an insert just consumes part of the gap, and a delete just enlarges the gap! Gap buffers are particularly well-suited to representing a string that is being edited by a user with a cursor, since inserts and deletes tend to be focused around the cursor, so the gap rarely moves. ```java /** GapBuffer is a non-threadsafe EditBuffer that is optimized for * editing with a cursor, which tends to make a sequence of inserts * and deletes at the same place in the buffer. */ public class GapBuffer implements EditBuffer { private char[] a; private int gapStart; private int gapLength; // Rep invariant: // a != null // 0 <= gapStart <= a.length // 0 <= gapLength <= a.length - gapStart // Abstraction function: // represents the sequence a[0],...,a[gapStart-1], // a[gapStart+gapLength],...,a[length-1] ``` In a multiuser scenario, we'd want multiple gaps, one for each user's cursor, but we'll use a single gap for now. ## Steps to develop the datatype Recall our recipe for designing and implementing an ADT:

1. **Specify.** Define the operations (method signatures and specs). We did that in the `EditBuffer` interface. 2. **Test.** Develop test cases for the operations. See `EditBufferTest` in the provided code. The test suite includes a testing strategy based on partitioning the parameter space of the operations. 3. **Rep.** Choose a rep. We chose two of them for `EditBuffer`, and this is often a good idea: 1. **Implement a simple, brute-force rep first.** It's easier to write, you're more likely to get it right, and it will validate your test cases and your specification so you can fix problems in them before you move on to the harder implementation. This is why we implemented `SimpleBuffer` before moving on to `GapBuffer`. Don't throw away your simple version, either --- keep it around so that you have something to test and compare against in case things go wrong with the more complex one. 2. **Write down the rep invariant and abstraction function, and implement `checkRep()`.** `checkRep()` asserts the rep invariant at the end of every constructor, producer, and mutator method. (It's typically not necessary to call it at the end of an observer, since the rep hasn't changed.) In fact, assertions can be very useful for testing complex implementations, so it's not a bad idea to also assert the postcondition at the end of a complex method. You'll see an example of this in `GapBuffer.moveGap()` in the code with this reading.

In all these steps, we're working entirely single-threaded at first. Multithreaded clients should be in the back of our minds at all times while we're writing specs and choosing reps (we'll see later that careful choice of operations may be necessary to avoid race conditions in the clients of your datatype). But get it working, and thoroughly tested, in a sequential, single-threaded environment first. Now we're ready for the next step:

**Synchronize.** Make an argument that your rep is threadsafe. Write it down explicitly as a comment in your class, right by the rep invariant, so that a maintainer knows how you designed thread safety into the class.

This part of the reading is about how to do step 4. We already saw [how to make a thread safety argument](../../18-thread-safety/#how_to_make_a_safety_argument), but this time, we'll rely on synchronization in that argument. And then the extra step we hinted at above:

**Iterate**. You may find that your choice of operations makes it hard to write a threadsafe type with the guarantees clients require. You might discover this in step 1, or in step 2 when you write tests, or in steps 3 or 4 when you implement. If that's the case, go back and refine the set of operations your ADT provides.

## Locking Locks are so commonly-used that Java provides them as a built-in language feature. In Java, every object has a lock implicitly associated with it --- a `String`, an array, an `ArrayList`, and every class you create, all of their object instances have a lock. Even a humble `Object` has a lock, so bare `Object`s are often used for explicit locking: ```java Object lock = new Object(); ``` You can't call `acquire` and `release` on Java's intrinsic locks, however. Instead you use the **`synchronized`** statement to acquire the lock for the duration of a statement block: ```java synchronized (lock) { // thread blocks here until lock is free // now this thread has the lock balance = balance + 1; // exiting the block releases the lock } ``` Synchronized regions like this provide **mutual exclusion**: only one thread at a time can be in a synchronized region guarded by a given object's lock. In other words, you are back in sequential programming world, with only one thread running at a time, at least with respect to other synchronized regions that refer to the same object. ### Locks guard access to data Locks are used to **guard** a shared data variable, like the account balance shown here. If all accesses to a data variable are guarded (surrounded by a synchronized block) by the same lock object, then those accesses will be guaranteed to be atomic --- uninterrupted by other threads. Because every object in Java has a lock implicitly associated with it, you might think that simply owning an object's lock would prevent other threads from accessing that object. **That is not the case.** Acquiring the lock associated with object `obj` using ```java synchronized (obj) { ... } ``` in thread *t* does one thing and one thing only: prevents other threads from entering a `synchronized(obj)` block, until thread *t* finishes its synchronized block. That's it. Locks only provide mutual exclusion with other threads that acquire the same lock. All accesses to a data variable must be guarded by the same lock. You might guard an entire collection of variables behind a single lock, but all modules must agree on which lock they will all acquire and release. ## Monitor pattern When you are writing methods of a class, the most convenient lock is the object instance itself, i.e. `this`. As a simple approach, we can guard the entire rep of a class by wrapping all accesses to the rep inside `synchronized (this)`.


/** SimpleBuffer is a threadsafe EditBuffer with a simple rep. */
public class SimpleBuffer implements EditBuffer {
    private String text;
    ...
    public SimpleBuffer() {
        synchronized (this) {
            text = "";
            checkRep();
        }
    }
    public void insert(int pos, String ins) {
        synchronized (this) {
            text = text.substring(0, pos) + ins + text.substring(pos);
            checkRep();
        }
    }
    public void delete(int pos, int len) {
        synchronized (this) {
            text = text.substring(0, pos) + text.substring(pos+len);
            checkRep();
        }
    }
    public int length() {
        synchronized (this) {
            return text.length();
        }
    }
    public String toString() {
        synchronized (this) {
            return text;
        }
    }
}

Note the very careful discipline here. *Every* method that touches the rep must be guarded with the lock --- even apparently small and trivial ones like `length()` and `toString()`. This is because reads must be guarded as well as writes --- if reads are left unguarded, then they may be able to see the rep in a partially-modified state. This approach is called the **monitor pattern**. A monitor is a class whose methods are mutually exclusive, so that only one thread can be inside an instance of the class at a time. Java provides some syntactic sugar for the monitor pattern. If you add the keyword `synchronized` to a method signature, then Java will act as if you wrote `synchronized (this)` around the method body. So the code below is an equivalent way to implement the synchronized `SimpleBuffer`:


/** SimpleBuffer is a threadsafe EditBuffer with a simple rep. */
public class SimpleBuffer implements EditBuffer {
    private String text;
    ...
    public SimpleBuffer() {
        text = "";
        checkRep();
    }
    public synchronized void insert(int pos, String ins) {
        text = text.substring(0, pos) + ins + text.substring(pos);
        checkRep();
    }
    public synchronized void delete(int pos, int len) {
        text = text.substring(0, pos) + text.substring(pos+len);
        checkRep();
    }
    public synchronized int length() {
        return text.length();
    }
    public synchronized String toString() {
        return text;
    }
}

Notice that the `SimpleBuffer` constructor doesn't have a `synchronized` keyword. Java actually forbids it, syntactically, because an object under construction should be confined to a single thread until it has returned from its constructor. So synchronizing constructors should be unnecessary.

In the Java Tutorials, read: + [Synchronized Methods] (1 page) + [Intrinsic Locks and Synchronization] (1 page) [Synchronized Methods]: http://docs.oracle.com/javase/tutorial/essential/concurrency/syncmeth.html [Intrinsic Locks and Synchronization]: http://docs.oracle.com/javase/tutorial/essential/concurrency/locksync.html

mitx:95e324249b2a43488112784e7e9859f6 Synchronizing with locks ## Thread safety argument with synchronization Now that we're protecting `SimpleBuffer`'s rep with a lock, we can write a better thread safety argument: ```java /** SimpleBuffer is a threadsafe EditBuffer with a simple rep. */ public class SimpleBuffer implements EditBuffer { private String text; // Rep invariant: // text != null // Abstraction function: // represents the sequence text[0],...,text[text.length()-1] // Thread safety argument: // all accesses to text happen within SimpleBuffer methods, // which are all guarded by SimpleBuffer's lock ``` The same argument works for `GapBuffer`, if we use the monitor pattern to synchronize all its methods. Note that the encapsulation of the class, the absence of rep exposure, is very important for making this argument. If text were public: ```java public String text; ``` then clients outside `SimpleBuffer` would be able to read and write it without knowing that they should first acquire the lock, and `SimpleBuffer` would no longer be threadsafe. ### Locking discipline A locking discipline is a strategy for ensuring that synchronized code is threadsafe. We must satisfy two conditions: 1. Every shared mutable variable must be guarded by some lock. The data may not be read or written except inside a synchronized block that acquires that lock. 2. If an invariant involves multiple shared mutable variables (which might even be in different objects), then all the variables involved must be guarded by the *same* lock. Once a thread acquires the lock, the invariant must be reestablished before releasing the lock. The monitor pattern as used here satisfies both rules. All the shared mutable data in the rep --- which the rep invariant depends on --- are guarded by the same lock. ## Atomic operations Consider a find-and-replace operation on the `EditBuffer` datatype: ```java /** Modifies buf by replacing the first occurrence of s with t. * If s not found in buf, then has no effect. * @returns true if and only if a replacement was made */ public static boolean findReplace(EditBuffer buf, String s, String t) { int i = buf.toString().indexOf(s); if (i == -1) { return false; } buf.delete(i, s.length()); buf.insert(i, t); return true; } ``` This method makes three different calls to `buf` --- to convert it to a string in order to search for `s`, to delete the old text, and then to insert `t` in its place. Even though each of these calls individually is atomic, the `findReplace` method as a whole is not threadsafe, because other threads might mutate the buffer while `findReplace` is working, causing it to delete the wrong region or put the replacement back in the wrong place. To prevent this, `findReplace` needs to synchronize with all other clients of `buf`. ### Giving clients access to a lock It's sometimes useful to make your datatype's lock available to clients, so that they can use it to implement higher-level atomic operations using your datatype. So one approach to the problem with `findReplace` is to document that clients can use the `EditBuffer`'s lock to synchronize with each other:


/** An EditBuffer represents a threadsafe mutable string of characters
 *  in a text editor. Clients may synchronize with each other using the
 *  EditBuffer object itself. */
public interface EditBuffer {
   ...
}

And then `findReplace` can synchronize on `buf`:


public static boolean findReplace(EditBuffer buf, String s, String t) {
    synchronized (buf) {
        int i = buf.toString().indexOf(s);
        if (i == -1) {
            return false;
        }
        buf.delete(i, s.length());
        buf.insert(i, t);
        return true;
    }
}

The effect of this is to enlarge the synchronization region that the monitor pattern already put around the individual `toString`, `delete`, and `insert` methods, into a single atomic region that ensures that all three methods are executed without interference from other threads. ### Sprinkling `synchronized` everywhere? So is thread safety simply a matter of putting the `synchronized` keyword on every method in your program? Unfortunately not. First, you actually don't want to synchronize methods willy-nilly. Synchronization imposes a large cost on your program. Making a synchronized method call may take significantly longer, because of the need to acquire a lock (and flush caches and communicate with other processors). Java leaves many of its mutable datatypes unsynchronized by default exactly for these performance reasons. When you don't need synchronization, don't use it. Second, it's not sufficient. Dropping `synchronized` onto a method without thinking means that you're acquiring a lock without thinking about which lock it is, or about whether it's the right lock for guarding the shared data access you're about to do. Suppose we had tried to solve `findReplace`'s synchronization problem simply by dropping `synchronized` onto its declaration: ```java public static synchronized boolean findReplace(EditBuffer buf, ...) { ``` This wouldn't do what we want. It would indeed acquire a lock --- because `findReplace` is a static method, it would acquire a static lock for the whole class that `findReplace` happens to be in, rather than an instance object lock. As a result, only one thread could call `findReplace` at a time --- even if other threads want to operate on *different* buffers, which should be safe, they'd still be blocked until the single lock was free. So we'd suffer a significant loss in performance, because only one user of our massive multiuser editor would be allowed to do a find-and-replace at a time, even if they're all editing different documents. Worse, however, it wouldn't provide useful protection, because other code that touches the document probably wouldn't be acquiring the same lock. It wouldn't actually eliminate our race conditions. The `synchronized` keyword is not a panacea. Thread safety requires a discipline --- using confinement, immutability, or locks to protect shared data. And that discipline needs to be written down, or maintainers won't know what it is. ## Designing a datatype for concurrency `findReplace`'s problem can be interpreted another way: that the `EditBuffer` interface really isn't that friendly to multiple simultaneous clients. It relies on integer indexes to specify insert and delete locations, which are extremely brittle to other mutations. If somebody else inserts or deletes before the index position, then the index becomes invalid. So if we're designing a datatype specifically for use in a concurrent system, we need to think about providing operations that have better-defined semantics when they are interleaved. For example, it might be better to pair `EditBuffer` with a `Position` datatype representing a cursor position in the buffer, or even a `Selection` datatype representing a selected range. Once obtained, a `Position` could hold its location in the text against the wash of insertions and deletions around it, until the client was ready to use that `Position`. If some other thread deleted all the text around the `Position`, then the `Position` would be able to inform a subsequent client about what had happened (perhaps with an exception), and allow the client to decide what to do. These kinds of considerations come into play when designing a datatype for concurrency. As another example, consider the [`ConcurrentMap`](java:java/util/concurrent/ConcurrentMap) interface in Java. This interface extends the existing `Map` interface, adding a few key methods that are commonly needed as atomic operations on a shared mutable map, e.g.: + `map.putIfAbsent(key,value)` is an atomic version of `if ( ! map.containsKey(key)) map.put(key, value);` + `map.replace(key, value)` is an atomic version of `if (map.containsKey(key)) map.put(key, value);` ## Deadlock rears its ugly head The locking approach to thread safety is powerful, but (unlike confinement and immutability) it introduces blocking into the program. Threads must sometimes wait for other threads to get out of synchronized regions before they can proceed. And blocking raises the possibility of deadlock --- a very real risk, and frankly *far* more common in this setting than in message passing with blocking I/O (where we first mentioned it). With locking, deadlock happens when threads acquire multiple locks at the same time, and two threads end up blocked while holding locks that they are each waiting for the other to release. The monitor pattern unfortunately makes this fairly easy to do. Here's an example. Suppose we're modeling the social network of a series of books: ```java public class Wizard { private final String name; private final Set<Wizard> friends; // Rep invariant: // name, friends != null // friend links are bidirectional: // for all f in friends, f.friends contains this // Concurrency argument: // threadsafe by monitor pattern: all accesses to rep // are guarded by this object's lock public Wizard(String name) { this.name = name; this.friends = new HashSet<Wizard>(); } public synchronized boolean isFriendsWith(Wizard that) { return this.friends.contains(that); } public synchronized void friend(Wizard that) { if (friends.add(that)) { that.friend(this); } } public synchronized void defriend(Wizard that) { if (friends.remove(that)) { that.defriend(this); } } } ``` Like Facebook, this social network is bidirectional: if *x* is friends with *y*, then *y* is friends with *x*. The `friend()` and `defriend()` methods enforce that invariant by modifying the reps of both objects, which because they use the monitor pattern means acquiring the locks to both objects as well. Let's create a couple of wizards: ```java Wizard harry = new Wizard("Harry Potter"); Wizard snape = new Wizard("Severus Snape"); ``` And then think about what happens when two independent threads are repeatedly running: ```java // thread A // thread B harry.friend(snape); snape.friend(harry); harry.defriend(snape); snape.defriend(harry); ``` We will deadlock very rapidly. Here's why. Suppose thread A is about to execute `harry.friend(snape)`, and thread B is about to execute `snape.friend(harry)`. + Thread A acquires the lock on `harry` (because the friend method is synchronized). + Then thread B acquires the lock on `snape` (for the same reason). + They both update their individual reps independently, and then try to call `friend()` on the other object --- which requires them to acquire the lock on the other object. So A is holding Harry and waiting for Snape, and B is holding Snape and waiting for Harry. Both threads are stuck in `friend()`, so neither one will ever manage to exit the synchronized region and release the lock to the other. This is a classic deadly embrace. The program simply stops. The essence of the problem is acquiring multiple locks, and holding some of the locks while waiting for another lock to become free. Notice that it is possible for thread A and thread B to interleave such that deadlock does not occur: perhaps thread A acquires and releases both locks before thread B has enough time to acquire the first one. If the locks involved in a deadlock are also involved in a race condition --- and very often they are --- then the deadlock will be just as difficult to reproduce or debug. ### Deadlock solution 1: lock ordering One way to prevent deadlock is to put an ordering on the locks that need to be acquired simultaneously, and ensuring that all code acquires the locks in that order. In our social network example, we might always acquire the locks on the `Wizard` objects in alphabetical order by the wizard's name. Since thread A and thread B are both going to need the locks for Harry and Snape, they would both acquire them in that order: Harry's lock first, then Snape's. If thread A gets Harry's lock before B does, it will also get Snape's lock before B does, because B can't proceed until A releases Harry's lock again. The ordering on the locks forces an ordering on the threads acquiring them, so there's no way to produce a cycle in the waiting-for graph. Here's what the code might look like: ```java public void friend(Wizard that) { Wizard first, second; if (this.name.compareTo(that.name) < 0) { first = this; second = that; } else { first = that; second = this; } synchronized (first) { synchronized (second) { if (friends.add(that)) { that.friend(this); } } } } ``` (Note that the decision to order the locks alphabetically by the person's name would work fine for this book, but it wouldn't work in a real life social network. Why not? What would be better to use for lock ordering than the name?) Although lock ordering is useful (particularly in code like operating system kernels), it has a number of drawbacks in practice. + First, it's not modular --- the code has to know about all the locks in the system, or at least in its subsystem. + Second, it may be difficult or impossible for the code to know exactly which of those locks it will need before it even acquires the first one. It may need to do some computation to figure it out. Think about doing a depth-first search on the social network graph, for example --- how would you know which nodes need to be locked, before you've even started looking for them? ### Deadlock solution 2: coarse-grained locking A more common approach than lock ordering, particularly for application programming (as opposed to operating system or device driver programming), is to use coarser locking --- use a single lock to guard many object instances, or even a whole subsystem of a program. For example, we might have a single lock for an entire social network, and have all the operations on any of its constituent parts synchronize on that lock. In the code below, all `Wizard`s belong to a `Castle`, and we just use that `Castle` object's lock to synchronize: ```java public class Wizard { private final Castle castle; private final String name; private final Set<Wizard> friends; ... public void friend(Wizard that) { synchronized (castle) { if (this.friends.add(that)) { that.friend(this); } } } } ``` Coarse-grained locks can have a significant performance penalty. If you guard a large pile of mutable data with a single lock, then you're giving up the ability to access any of that data concurrently. In the worst case, having a single lock protecting everything, your program might be essentially sequential --- only one thread is allowed to make progress at a time. mitx:c18777a5a5ed46418af983e354b773e1 Deadlock ## Next: [Message Passing with Threads & Queues](../message-passing/)