Last time | Next time |
Consider the following problem: find all the local maxima and minima (and saddle points) of the function $f(x,y)=x^2+y^2+x^2y+4$.
As a polynomial, this has degree 3 (since we have an $x^2y$ term). This means that it is not restricted to the bowl or hyperbolic paraboloid of quadratic functions, but may have more interesting features. On the other hand, we should realize that, provided the function is twice differentiable at any point, it can be approximated by a "tangent bowl" there.
I've used the term "tangent bowl", but it could just as well be a "tangent saddle" (because that's another shape that a quadratic function of two variables can have). Let's take a look at these two:
So we expect the local behavior to have one of these two shapes (if we go beyond the tangent plane, to the tangent bowl/saddle/quadratic shape).
So how are we to determine the shape of $f(x,y)$, and discover the location of its extrema? The extrema at differentiable points will look like bowls or saddles, locally and generically.
There are other possibilities, however: for example
These were summarized well in a short note that my dad published in the American Mathematical Monthly, long, long ago. I'd almost forgotten it, but thought to recreate it using Mathematica.
The univariate case is handled by considering the places where the derivative is equal to 0, and a very similar situation arises in the multivariate case: the analogy is that the gradient will equal the 0 vector.
The argument is simple, given on p. 970: let's go over it. Once again we rely on our understanding of univariate functions to arrive at this conclusion.
For the function $f(x,y)$ given above, we will find that there were two saddles, and a minimum. The graph of $f$ suggests it (notice the contour plots):
Now, how do we know that these are the only critical points, their locations, etc.? We compute the gradient vector, and set it equal to zero. This gives us a pair of equations to solve simultaneously for the critical points:
Notice that this is a natural extension of the univariate second derivative test. If we know that we have a bowl or an umbrella, then all we have to do is take a cross-section through the bottom of the bowl or top of the umbrella, and apply the second derivative test to the univariate cross section. If the second derivative is positive, it's a bowl; if the second derivative is negative, it's an umbrella.