Do we need Mean Value Theorem to prove f ' (x) = 0 on (a, b) implies that f = constant on (a, b)?

Ng Tze Beng

If we use the Mean Value Theorem here, then it is an immediate consequence of it. What does that mean? Basically that means the Mean Value Theorem does all the work for us. So how is the Mean Value Theorem proved? One proof involves the use of the Extreme Value Theorem. How is that proved? It involves the use of the completeness property of the real numbers. So we can ask the question: If we can define the notion of differentiability for a function from a non complete ordered field such as the set of rational numbers into itself, then does the Mean Value Theorem hold? We can obviously find examples of function from the rational numbers to the rational numbers where the Mean Value Theorem or Rolle's Theorem does not hold. An easy example would be a cubic polynomial function whose derived function is a quadratic with real non-rational roots, for instance f (x) = x 3 - 6x +1. Is there a function from the rational numbers or an appropriate subset of it to the rational numbers whose derived function is zero but  f   is non-constant? An appropriate subset would be an intersection of an non-empty open interval with the rational numbers. Think of the holes that the rational numbers have. An easy example would be a function f defined by  f (x ) = 1  for any rational number x > Ö2 and   f (x) = 2 for any rational number x < Ö2 .   f is not a constant function. Then the function f :Q ® Q is differentiable and f ' (x) = 0 for any rational number x . A more sophisticated example will be provided by g: (2, Ö2 )ÇQ ® Q where g(x) = 1/22 n +2 for x Î(Ö2/2 n + 1, Ö2/2 n)ÇQ or x Î(2/2 n, -Ö2/2 n + 1)ÇQ, n an integer ³ 0 and g(0) = 0. Then g is differentiable and g'(x) = 0 for all x in (2, Ö2 )ÇQ and g is not a constant function.

Theorem 1. f ' (x) = 0 on ( a, b) implies that f = constant on (a, b).

Now we prove the above using only the completeness property of the real numbers.

We assume b > a. The proof is by contradiction. Suppoes that f is not constant. Then there exist u, v in (a, b), u < v such that f (u) ¹ f (v). This means f (v) - f (u) ¹ 0. Then we shall make use of the difference quotient to deduce a contradiction. Suppose now that C > 0.

For now let us suppose that (not assuming anything on C)

f (v)- f (u) = C( v - u). -------------------------- (*)

We are going to bisect the inteval [u, v], pick the next interval from this bisection and continue bisecting in like manner.

Take the mid point of [u, v]. Then either

f (v) - f (w) ³ C( v - w) --------------------- -------- (1)

or f (w) - f (u) ³ C( w- u). ------------------------------(2)

This is because if both (1) and (2) do not hold, then we would have

f (v) - f (w) < C( v - w) and  f (w) - f (u) < C( w- u)

which would imply that f (v) - f (u) < C( v - u) contradicting (*).

If (1) holds, then we name u1 = w and v1 = v. If (1) does not hold we name u1 = u and v1 = w.

Let k = (v - u). Then |v1 - u1| = k/2 and

f (v1) - f (u1) ³ C( v1 - u1 ). -------------------------- (*1)

Obviously, [u1, v1] Ì [u, v], u £ u1 < v1 £ v, |u1 - u| £ |v - u|/2= k/2 and |v - v1| £ |v - u|/2= k/2 . We next take the mid point of [u1, v1]. Then we shall have either

f (v1) - f (w1) ³ C( v1 - w1) --------------------- -------- (3)

or f (w1) - f (u1) ³ C( w1 - u1 ). ------------------------------(4)

Again this is because if both (3) and (4) do not hold then we would have f (v1) - f (w1) < C( v1 - w1) and f (w1) - f (u1) < C( w1 - u1 ) implying f (v1) - f (u1) < C( v1 - u1 ) thus contradicting (*1).

If (3) holds, then we name u2 = w1 and v2 = v1. If (3) does not hold we name u2 = u1 and v2 = w1 . Then |v2 - u2 | = k/22,

f (v2) - f (u2) ³ C( v2 - u2 ). -------------------------- (*2)

Obviously, [u2, v2] Ì[u1, v1], u1 £ u2 < v2 £ v1, |u2 - u1 | £ |v1 - u1|/2= k/22 and |v1 - v2| £ |v1 - u1 |/2= k/22.

In this way we obtained a nested sequence

Ì [u n, v n]Ì ¼ ¼ Ì [u2, v2] Ì [u1, v1]Ì [u, v]

with the length of the interval [u n, v n] = approaches 0 as n tends to infinity;

an increasing sequence (not necessarily strictly increasing)

u1 £ u2 £ u3 £ ¼ ¼ £ u n £ ¼

satisfying, for all n, u n < v n £ v, |u n - u n-1 | £ k/2 n ------------------ (5)

and a decreasing sequence (not necessarily strictly decreasing)

v1 ³ v2 ³ v3 ³ ¼ ¼ ³ v n ³ ¼

satisfying, for all n, u £ u n < v n, |v n - v n-1 | £ k/2 n ------------------ (6)

and

   f (v n) - f (u n) ³ C( v n - u n ). -------------------------- (*n)

Now we have a choice to proceed. We can use the Weierstrass characterization of completeness to conclude that the nested sequence {[u n, v n]}n must have a unique intersection. i.e., there is exactly one point x that belongs to [u n, v n] for all n.

We can also note that the sequence or set {u n} is bounded above by v by (5). Therefore, by the completeness property of the real numbers, {u n} has a least upper bouned or supremum in R also denoted by x, i.e. x = sup {u n}. Also by the completeness property of the real numbers since the sequence {v n} is bounded below by u by (6) it has a greatest lower bound or infimum in R denoted by y, that is, y = inf{v n}.

We claim that x = y. From (5) any v n is an upper bound for {u n}. Hence x = sup {u n}£ v n for each n. Therefore, x is a lower bound for {v n} and so x £ y = inf{v n}. Can x be bigger than y? Suppose x > y. Then since x = sup {u n}, there exists a u j such that y < u j . But since y = inf{v n} and u j < v n for all n, u j £ y = inf{v n}. This contradicts y < u j . Hence x = y. (It is obvious that x cannot be strictly less than  y.  Observe this as follows.  For all n, un £ x £  y  £ vn. If x < y, then there exists an integer n such that (v n - u n ) = (v - u) / 2 n < (y - x).  This is not possible since (y - x) £ (v n - u n ).) 

In particular, we have u n £ x £ v n for all n. That is the same as saying x Î [u n, v n ] for all n.

Next we shall show that f ' (x) ³ C. That is .

If on the contrary , then there exists a d > 0 such that for all y with 0 < | y - x| < d we have

--------------------------- (A).

If we can show that for any d > 0, we can find a x d such that 0 < |x d - x| < d but . Then no d > 0 can exist so that (A) holds and so we can conclude that f ' (x) ³ C. We shall now proceed to do just that.

For any d > 0, x - d < x = sup {u n} and so there exists integer N such that x - d < uN £ x. Likewise using the fact that x = inf{v n}, there exists an integer M such that x £ v M < x + d. Let K = max(N, M). Then we have x - d < uN £ u K £ x £ v K £ v M < x +d and u K < v K . This means that both u K and v K lie in the interval (x - d, x +d). If x = u K , then let x d = v K .

If x = v K , then let x d = u K. In either case by (*K). If u K < x < v K , then as in the beginning of the proof one of the following must be true:

f (v K )- f (x) ³ C( v K - x ) -------------- ( 7)

f (x)- f (u K) ³ C(x - u K ) ---------------- (8).

This is because if both (7) and (8) do not hold, we would then get f (v K )- f (x) < C( v K - x ) and f (x)- f (u K) < C(x - u K ) implying that f (v K) - f (u K) < C( v K - u K ) contradicting (*K). If (7) holds, then we let x d = v K and if (8) holds we let x d = u K. We then have

----------------- (9).

Hence we conclude that if C > 0 this would give us f ' (x) ³ C > 0 thus contradicting f ' (x) = 0. Thus C £ 0.

Suppose C < 0. We can either apply the above argument with the inequality "³" replaced by "£" throughout or we can consider using the function g = - f . We can rewrite (*) as

- f (v) - (- f (u)) = - C ( v - u).

That is

g(v) - g (u) = (-C)( v - u). -------------------------- (**)

Now -C > 0 and so (**) is similar to (*) and so we can concude that we can find an x in [u, v] Í (a, b) such that g' (x) = - f ' (x) ³ -C , that is f ' (x) £ C < 0 thus contradicting f ' (x) = 0. Therefore, C = 0. and so f must be a constant function.

Note that we have actually proved the following result:

Theorem 2: If f :[a, b] ® R is differentiable, then for any u, v in [a, b] with u < v,

there exists a point x in [u, v] such that .

Reversing the inequality "³" by "£" throughout, starting with (1) and (2) we would obtain the following:

Theorem 2': If f :[a, b] ® R is differentiable,  then for any u, v in [a, b] with u < v

there exists a point x in [u, v] such that .

Theorem 3: If f ' (x) < 0 on (a, b), then f is decreasing on (a, b).

Proof.  Take any u, v in (a, b) with u < v, then by Theorem 2, there exists a point x in [u, v] such that Hence f (v) - f (u) <0 and so f (v) < f (u). That means f is decreasing on (a, b).

Theorm 4 (Weak Mean Value Theorem).  If m £ f ' (x) £ M on [a, b], then for any u, v in [a, b] with u < v,

m( v - u)£ f (v) - f (u) £ M( v - u).

Proof. By Theorem 2, f (v)- f (u) £ f ' (y)( v - u) for some y in [u, v] and so f (v) - f (u) £ M( v - u). By Theorem 2', there is a point y in [u, v] such that f (v)- f (u) ³ f ' (y)( v - u) ³ m( v - u). Therefore, m( v - u)£ f (v) - f (u) £ M( v - u).

ã Ng Tze Beng