Last time | Next time |
We shouldn't be a stick in the mud about the orientation of our axes: why $x$ and $y$, and not some other pair of directions which are mutually perpendicular? Perhaps we are interested in the slope of the surface along some direction other than $x$ or $y$: hence the idea behind directional derivatives. At a given point at which a function is differentiable, one natural choice for two directions might be the direction in which the function is increasing fastest, and the direction perpendicular to this.
But any direction will do -- take a look at p. 957, Figure 3.
Our author's pretty good TEC animation of that figure.
This figure is partially what motivated me to create that "slicing" demo in Mathematica.
One of the biggest pieces of news is that we're going to be working with vectors -- e.g. $u$ -- and with vector-valued functions; and that will place another demand upon your visualization skills.
But gradients are just another kind of multivariate function: one where the domain is points in the plane, like many of our other functions, but the range is the set of vectors.
At each point in the plane there is a gradient vector pointing -- indicating what?
Are you comfortable with
Consider the following problem: find all the local maxima and minima (and saddle points) of the function $f(x,y)=x^2+y^2+x^2y+4$.
As a polynomial, this has degree 3 (since we have an $x^2y$ term). This means that it is not restricted to the bowl or hyperbolic paraboloid of quadratic functions, but may have more interesting features. On the other hand, we should realize that, provided the function is twice differentiable at any point, it can be approximated by a "tangent bowl" there.
I've used the term "tangent bowl", but it could just as well be a "tangent saddle" (because that's another shape that a quadratic function of two variables can have).
So we expect the local behavior to have one of these two shapes (if we go beyond the tangent plane, to the tangent bowl/saddle/quadratic shape).
So how are we to determine the shape of $f(x,y)$, and discover the location of its extrema? The extrema at differentiable points will look like bowls or saddles, locally and generically.
There are other possibilities, however: for example, one can get troughs.
These were summarized well in that short note that my dad published in the American Mathematical Monthly, long, long ago, which I tried to recreate using Mathematica.
The main point is that "generically" (that is, almost all of the time) we have either a bowl or a saddle.
The univariate case is handled by considering the places where the derivative is equal to 0, and a very similar situation arises in the multivariate case: the analogy is that the gradient will equal the 0 vector.
The argument is simple, given on p. 970 (Theorem 2): let's go over it. Once again we rely on our understanding of univariate functions to arrive at this conclusion.
For the function $f(x,y)$ given above, we will find that there were two saddles, and a minimum. The graph of $f$ suggests it.
Now, how do we know that these are the only critical points, their locations, etc.? We compute the gradient vector, and set it equal to zero. This gives us a pair of equations to solve simultaneously for the critical points: