Reading 19: Writing a Program with Abstract Data Types
Software in 6.031
Objectives
After today’s class, you should:
- Understand how to apply specification, testing, and iteration at different levels of software development: from single functions, to abstract data types, to entire programs
Introduction
Bringing together what we’ve learned about abstract data types and recursive data types, this reading demonstrates the development of a recursive ADT to solve a simple problem.
Recipes for programming
Recall the test-first programming approach for
Writing a function:
- Spec. Write the spec, including the function signature (name, argument types, return types, exceptions), and the precondition and the postcondition as a TypeDoc comment.
- Test.
Create systematic test cases that you can run automatically.
- You may have to go back and change your spec when you start to write test cases. Just the process of writing test cases puts pressure on your spec, because you’re thinking about how a client would call the function. So steps 1 and 2 iterate until you’ve got a better spec and some good test cases.
- Make sure at least some of your tests are failing at first. A test suite that passes all tests even when you haven’t implemented the function is not a good test suite for finding bugs.
- Implement.
Write the body of the function.
You’re done when the tests all pass.
- Implementing the function puts pressure on both the tests and the specs, and you may find bugs that you have to go back and fix. So finishing the function may require changing the implementation, the tests, and the specs, and bouncing back and forth among them.
Let’s broaden this to a recipe for
Writing an abstract data type:
- Spec. Write specs for the operations of the datatype, including method signatures, preconditions, and postconditions.
- Test.
Write test cases for the ADT’s operations.
- Again, this puts pressure on the spec. You may discover that you need operations you hadn’t anticipated, so you’ll have to add them to the spec.
- Implement.
For an ADT, this part expands to:
- Choose rep. Write down the private fields of a class, or the variants of a recursive datatype. Write down the rep invariant and abstraction function as a comment.
- Assert rep invariant.
Implement a
checkRep()
method that enforces the rep invariant. This is critically important if the rep invariant is nontrivial, because it will catch bugs much earlier. - Implement operations.
Write the method bodies of the operations, making sure to call
checkRep()
in them. You’re done when the tests all pass.
And let’s broaden it further to a recipe for
Writing a program (consisting of ADTs and functions):
- Choose datatypes. Decide which ones will be mutable and which immutable.
- Choose function. Write your top-level function and break it down into smaller steps.
- Spec. Spec out the ADTs and functions. Keep the ADT operations simple and few at first. Only add complex operations as you need them.
- Test. Write test cases for each unit (ADT or function).
- Implement simply first. Choose simple, brute-force representations. The point here is to put pressure on the specs and the tests, and try to pull your whole program together as soon as possible. Make the whole program work correctly first. Skip the advanced features for now. Skip performance optimization. Skip corner cases. Keep a to-do list of what you have to revisit.
- Iterate. Now that it’s all working, make it work better. Reimplement, optimize, redesign if necessary.
Problem: matrix multiplication
Suppose we want to represent matrix multiplications in a way that allows us to optimize the computation.
For example, if a, b are scalar constants, and X is a big matrix, then
is slow to compute because it loops over the matrix X twice: once to multiply it by a, and then again to multiply it by b. It’d be equivalent and cheaper to do:
That’s just one example of an optimization we could make by rearranging the products in a matrix multiplication. (Remember however that matrix multiplication is associative but not commutative; only scalars commute.)
Spec
Choose datatypes
Let’s call this a MatrixExpression
.
To make the definitions easier to read, we’ll abbreviate that to MatExpr
.
Test
Let’s test optimize()
.
Partitions:
- partition on number of scalars in expression: 0, 1, 2, >2
- partition on position of scalar in expression tree: left child, right child, left-child-of-left-child, left-child-of-right-child, right-child-of-left-child, right-child-of-right-child, more than 2 levels deep
Implement
Choose a rep
This problem is a natural one for a recursive datatype.
MatExpr = Identity + Scalar(value:number) + Matrix(array:number[][]) + Product(m1:MatExpr, m2:MatExpr)
/** Represents an immutable expression of matrix and scalar products. */
interface MatrixExpression {
// Datatype definition:
// MatExpr = Identity + Scalar(value:number)
// + Matrix(array:number[][]) + Product(m1:MatExpr, m2:MatExpr)
// ...
}
class Identity implements MatrixExpression {
public constructor() {
}
}
class Scalar implements MatrixExpression {
private readonly value: number;
// RI: true
// AF(value) = the real scalar represented by value
public constructor(value: number) {
this.value = value;
}
}
class Matrix implements MatrixExpression {
private readonly array: number[][];
// RI: array.length > 0, and all array[i] are equal nonzero length
// AF(array) = the matrix with array.length rows and array[0].length columns
// whose (row,column) entry is array[row][column]
public constructor(array: number[][]) {
this.array = array; // note: danger!
}
}
class Product implements MatrixExpression {
private readonly m1: MatrixExpression;
private readonly m2: MatrixExpression;
// RI: m1's column count == m2's row count, or m1 or m2 is scalar
// AF(m1, m2) = the matrix product m1*m2
public constructor(m1: MatrixExpression, m2: MatrixExpression) {
this.m1 = m1;
this.m2 = m2;
}
}
Choose an identity
It’s always good to have a value in the datatype that represents nothing, so that we can avoid using null
.
For a matrix product, the natural choice is the identity matrix — an empty product expression is just the identity anyway.
So let’s define that:
/** Identity for all matrix computations. */
const I: MatrixExpression = new Identity();
Unfortunately, we’ll see that this is not a perfect situation: other MatExprs might also be the identity.
Implementing make
: use factory functions
Let’s start implementing, starting with the make()
creators.
We don’t want to expose our concrete rep classes Scalar
, Matrix
, and Product
, so that clients won’t depend on them and we’ll be able to change them later (being ready for change).
So we’ll create factory functions to implement make()
:
/** @returns a matrix expression consisting of just the scalar value */
function makeScalar(value: number): MatrixExpression {
return new Scalar(value);
}
/** @returns a matrix expression consisting of just the matrix given */
function makeMatrix(array: number[][]): MatrixExpression {
return new Matrix(array);
}
Implementing isIdentity
: don’t use instanceof
Now let’s implement the isIdentity
observer.
Here’s a bad way to do it:
// don't do this
function isIdentity(m: MatrixExpression): boolean {
if (m instanceof Scalar) {
return m.value === 1;
} else if (m instanceof Matrix) {
// ... check for 1s on the diagonal and 0s everywhere else
} else ... // do the right thing for other variant classes
}
In general, using instanceof
in object-oriented programming is a bad smell.
Break the operation down into pieces that are appropriate for each class, and write instance methods instead:
reading exercises
class Matrix implements MatrixExpression {
private readonly array: number[][];
// RI: array.length > 0, and all array[i] are equal nonzero length
// AF(array) = the matrix with array.length rows and array[0].length columns
// whose (row,column) entry is array[row][column]
...
public isIdentity(): boolean {
if (this.array.length !== this.array[0].length) {
}
for (let row = 0; row < this.array.length; row++) {
for (let col = 0; col < this.array[row].length; col++) {
let expected = (row === col) ? 1 : 0;
if (this.array[row][col] !== expected) {
}
}
}
}
}
(missing explanation)
class Product implements MatrixExpression {
private readonly m1: MatrixExpression;
private readonly m2: MatrixExpression;
// RI: m1's column count === m2's row count, or m1 or m2 is scalar
// AF(m1, m2) = the matrix product m1*m2
...
public isIdentity(): boolean {
return this.m1.isIdentity() ❓ this.m2.isIdentity();
}
}
Pick the best operator to replace ❓:
(missing explanation)
Implementing isIdentity
exposes an issue that we should have first discovered by writing test cases: we will not always return true
for a Product
whose value is the identity matrix (e.g. A × A-1).
We probably want to resolve this by weakening the spec of isIdentity
, so that it doesn’t guarantee to identify every multiplicative identity:
Implementing optimize
without instanceof
Let’s implement optimize()
.
Again, here’s a bad way to do it, which will quickly get us mired in weeds:
// don't do this
class Product implements MatrixExpression {
public optimize(): MatrixExpression {
if (this.m1 instanceof Scalar) {
...
} else if (this.m2 instanceof Scalar) {
...
}
...
}
}
If you find yourself writing code with instanceof
checks all over the place, you need to take a step back and rethink the problem.
In particular, to optimize the scalars, we’re going to need two recursive helper operations, so we’ll add them to our abstract datatype:
These expressions will allow us to pull the scalars out of an expression and move them together in a single multiplication expression.
/** Represents an immutable expression of matrix and scalar products. */
interface MatrixExpression {
// ...
/** @returns the product of all the scalars in this expression */
scalars(): MatrixExpression;
/** @returns the product of all the matrices in this expression in order.
* times(scalars(), matrices()) is equivalent to this expression. */
matrices(): MatrixExpression;
}
Now we’ll implement them as expected:
class Identity implements MatrixExpression {
// no fields
...
public scalars(): MatrixExpression { return this; }
public matrices(): MatrixExpression { return this; }
}
class Scalar implements MatrixExpression {
private readonly value: number;
...
public scalars(): MatrixExpression { return this; }
public matrices(): MatrixExpression { return I; }
}
class Matrix implements MatrixExpression {
private readonly array: number[][];
...
public scalars(): MatrixExpression { return I; }
public matrices(): MatrixExpression { return this; }
}
class Product implements MatrixExpression {
private readonly m1: MatrixExpression;
private readonly m2: MatrixExpression;
...
public scalars(): MatrixExpression {
return times(this.m1.▶▶A◀◀, this.m2.▶▶B◀◀);
}
public matrices(): MatrixExpression {
return times(this.m1.▶▶C◀◀, this.m2.▶▶D◀◀);
}
}
Recall that I
is the constant we defined to represent the identity matrix.
The implementations of Scalar.matrices()
and Matrix.scalars()
return I
to represent an empty product.
We could also have returned new Identity()
here, but there is no need to create a new object when we already have an immutable constant I
.
reading exercises
With these helper functions, optimize()
can just separate the scalars and the matrices:
class Identity implements MatrixExpression {
...
public optimize(): MatrixExpression { return this; }
}
class Scalar implements MatrixExpression {
...
public optimize(): MatrixExpression { return this; }
}
class Matrix implements MatrixExpression {
...
public optimize(): MatrixExpression { return ▶▶A◀◀; }
}
class Product implements MatrixExpression {
...
public optimize(): MatrixExpression {
return times(▶▶B◀◀);
}
}