Proving gradient points in the direction of maximum increase

Click For Summary
The discussion centers on proving that the gradient of a function points in the direction of maximum increase. The gradient is defined as the vector of partial derivatives, which indicates the rate of change of a function with respect to its variables. The relationship between the gradient and directional derivatives is explored, emphasizing that the maximum rate of increase occurs when the angle between the gradient vector and the direction vector is zero. This leads to the conclusion that the gradient indeed points in the direction of maximum increase. Clarification on the definition of the gradient is noted as crucial for a complete understanding of the proof.
hivesaeed4
Messages
217
Reaction score
0
How do we prove that the gradient points in the direction of the maximum increase? Would it be enough to simply state that the gradient is just the derivates of a function w.r.t all the variables a function depends upon. Since the derivative of a term w.r.t a certain variable gives the maximum increase of that term and since in gradient not only do we account for all possible terms and all the variables upon which a term depends upon but also includes the direction, so it is logical to conclude that the gradient points in the direction of the maximum increase.

Note: I do hope that the above proof is adequate but somehow feel that the proof has to be mathematical. Could someone tell me whether or not my intuition is correct?
 
Physics news on Phys.org
"Since the derivative of a term w.r.t a certain variable gives the maximum increase of that term ".
Does it?
 
Sorry. It gives the rate of change of a term.

Then how do I prove that the gradient points in the direction of the maximum increase?
 
Have you learned about directional derivatives yet?
 
Ok:
I just saw that you DO know about directional derivatives, on basis of another thread posted by you!

Now, you now know that the directional derivative is the dot/scalar product between the gradient vector and the direction vector.

Can you rewrite a dot/scalar product between two vectors in another way, a way involving the magintudesof the two vectors?
 
Yes. But that would involve the cos of the angle between the 2 vectors. But how does that prove that the gradient gives the us maximum rate of increase?
 
I haven't done this but as a suggestion.

Make the directional derivative a function of theta and maximize it.
 
hivesaeed4 said:
Yes. But that would involve the cos of the angle between the 2 vectors. But how does that prove that the gradient gives the us maximum rate of increase?
For what angle between the vectors does the cosine function receive its maximum value?
 
zero.
 
  • #10
hivesaeed4 said:
zero.

Indeed!

And, therefore:
What angle between the direction vector and the gradient vector maximizes the dot product between them?
 
  • #11
zero?
 
  • #12
Right!

And, if two vectors have zero angle between them, then they are parallell vectors, meaning that the direction of maximal increase is in the direction of the...?
 
  • #13
In the direction of both vectors.
 
  • #14
So is that the required proof?
 
  • #15
hivesaeed4 said:
In the direction of both vectors.
Yes.
In particular, in the direction of the gradient.
 
  • #16
hivesaeed4 said:
So is that the required proof?

What do you think is lacking/unsatisfactory?
 
  • #17
the answer depends on your definition of the gradient. if you think the gradient is the vector that dots with a direction vector to give the directional derivative, then the previous explanation is all you need.

I you think the gradient is a vector whose entries are the partial derivatives, then you need to make sense out of that a bit more. i.e. you need the chain rule to get the previous statement.
 
  • #18
mathwonk said:
the answer depends on your definition of the gradient. if you think the gradient is the vector that dots with a direction vector to give the directional derivative, then the previous explanation is all you need.

I you think the gradient is a vector whose entries are the partial derivatives, then you need to make sense out of that a bit more. i.e. you need the chain rule to get the previous statement.

That is why I asked OP what, if anything, he found lacking.

However, since he had already posted other threads in which he showed familiarity with the concept of the directional derivative, I assumed he was also familiar with how that is derived.
Therefore, I did not address that point explicitly.
 

Similar threads

  • · Replies 18 ·
Replies
18
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 3 ·
Replies
3
Views
12K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 22 ·
Replies
22
Views
5K
Replies
14
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 21 ·
Replies
21
Views
4K