The Hessian

Prequisites: Matrices, Critical Points

Supposing we have a multi-variable function and that we have figured out its critical points; it would be nice to have a simple test to tell whether the critical points are minima, maxima, or saddle points. We find that, at least for two-variable function, that is functions of the form z=f(x,y), a fairly simple test does exist. To introduce this test, we first must define a structure called the Hessian Matrix.

Developed by Ludwig Hesse,a German mathematician, the Hessian Matrix defined for a n-variable function y= f(x₁, x,₂, ... x_n), is the n by n matrix H whose (i,j)-th entry is the function of the second-order partial derivative

,

a function which can be written in a more compact notation as f_{x_ix_j}. A technical point to notice is that the Hessian matrix is not symmetrical unless the partial drivatives f_{x_ix_j} are continuous. For two-variable functions, our Hessian matrix will be a 2 by 2 matrix.

Now, with all our tools in hand, let's state the test of a critical point of two variable function y= f(x₁,x,₂).

The Second Derivative Test:

If f(x₁,x,₂) has continuous (Why is this important?) second partial derivatives in a neighborhood of a critical point (a₁,a₂) definef a number D by

.

(Note that this is the determinant of f's Hessian Matrix.)

₁

₂

a maximum point if D>0 and f_x₁x₁<0
a minimum point if D>0 and f_x₁x₁>0
a saddle point if D<0.

Further, if D=0, then no conclusion can be drawn, and any of the behaviors described above can occur.

From the above expression for D, note that if D>0, f_x₁x₁ and f_x₂x₂ must have the same sign.
For an explanation and justification of the above criteria and expression for D, see Explanation of the Hessian (a work in progress).

Example:

Let's try this test on a function we've seen before, f(x,y)=x⁵+y⁴-5x-32y, which has critical points (1,2) and (-1,2). We compute

f_xx(x,y)=20x³
f_yy(x,y)=12y²
f_xy(x,y)=0

So D(x,y)=240x³y². (Of course we quickly note to ourselves that f has continuous second partial derivatives.) Clearly for the critical point (1,2) D>0 and f_xx>0 indicating (1,2) is a minimum point. On the other hand, for the critical point (-1,2) D<0 indicating (-1,2) is a saddle point. This matches our previous conclusion.

N-Variable Functions with N>2

A natural question to ask is whether this second derivative test for two variable functions is easily generalized to higher variable functions. And the answer is "No...well, not easily anyway." Actually the two variable case is a specific case of a more general theorem, but that theorem requires a knowledge of linear algebra to understand. To see it written out, check out pp. 311 of T. M. Apostol, Calculus (John Wiley & Sons, Inc., 1969).

Exercises:

Vector Calculus Index | World Web Math Main Page

thing@athena.mit.edu

Last modified 18 July 1997