Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 7
Static optimization

1 INTRODUCTION

Static optimization theory is concerned with finding those points (if any) at which a real‐valued function ϕ, defined on a subset S of ℝⁿ, has a minimum or a maximum. Two types of problems will be investigated in this chapter:

Unconstrained optimization (Sections 7.2–7.10) is concerned with the problem
where the point at which the extremum occurs is an interior point of S.
Optimization subject to constraints (Sections 7.11–7.16) is concerned with the problem of optimizing ϕ subject to m nonlinear equality constraints, say g₁(x) = 0, …, g_m(x) = 0. Letting g = (g₁, g₂, …, g_m)′ and
the problem can be written as
or, equivalently, as

We shall not deal with inequality constraints.

2 UNCONSTRAINED OPTIMIZATION

In Sections 7.2–7.10, we wish to show how the one‐dimensional theory of maxima and minima of differentiable functions generalizes to functions of more than one variable. We start with some definitions.

Let ϕ : S → ℝ be a real‐valued function defined on a set S in ℝⁿ, and let c be a point of S. We say that ϕ has a local minimum at c if there exists an n‐ball B(c) such that

ϕ has a strict local minimum at c if we can choose B(c) such that

ϕ has an absolute minimum at c if

ϕ has a strict absolute minimum at c if

The point c at which the minimum is attained is called a (strict) local minimum point for ϕ, or a (strict) absolute minimum point for ϕ on S, depending on the nature of the minimum.

If ϕ has a minimum at c, then the function −ϕ has a maximum at c. Each maximization problem can thus be converted to a minimization problem (and vice versa). For this reason, we lose no generality by treating minimization problems only.

If c is an interior point of S, and ϕ is differentiable at c, then we say that c is a critical point (stationary point) of ϕ if

The function value ϕ(c) is then called the critical value of ϕ at c.

A critical point is called a saddle point if every n‐ball B(c) contains points x such that ϕ(x) > ϕ(c) and other points such that ϕ(x) < ϕ(c). In other words, a saddle point is a critical point which is neither a local minimum point nor a local maximum point. Figure 7.1 illustrates some of these concepts. The function ϕ is defined and continuous at [0, 5]. It has a strict absolute minimum at x = 0 and an absolute maximum (not strict if you look carefully) at x = 1. There are strict local minima at x = 2 and x = 5, and a strict local maximum at x = 3. At x = 4, the derivative ϕ′ is zero, but this is not an extremum point of ϕ; it is a saddle point.

Plot depicting a curve with inflections at (0,0), (2.5,1), (0.5,2), (2,3), (1,1), and (0.5,5). — Figure 7.1 Unconstrained optimization in one variable

3 THE EXISTENCE OF ABSOLUTE EXTREMA

In the example of Figure 7.1 , the function ϕ is continuous on the compact interval [0, 5] and has an absolute minimum (at x = 0) and an absolute maximum (at x = 1). That this is typical for continuous functions on compact sets is shown by the following fundamental result.

Exercises

1. The Weierstrass theorem is not, in general, correct if we drop any of the conditions, as the following three counterexamples demonstrate.
1. ϕ(x) = x, x ∈ (−1, 1), ϕ(−1) = ϕ(1) = 0,
2. ϕ(x) = x, x ∈ (−∞, ∞),
3. ϕ(x) = x/(1 − |x|), x ∈ (−1, 1).
2. Consider the real‐valued function ϕ : (0, ∞) → ℝ defined by

The set (0, ∞) is neither bounded nor closed, and the function ϕ is not continuous on (0, ∞). Nevertheless, ϕ attains its maximum on (0, ∞). This shows that none of the conditions of the Weierstrass theorem are necessary.

4 NECESSARY CONDITIONS FOR A LOCAL MINIMUM

In the one‐dimensional case, if a real‐valued function ϕ, defined on an interval (a, b), has a local minimum at an interior point c of (a, b), and if ϕ has a derivative at c, then ϕ′(c) must be zero. This result, which relates zero derivatives and local extrema at interior points, can be generalized to the multivariable case as follows.

Exercises

1. Find the extreme value(s) of the following real‐valued functions defined on ℝ² and determine whether they are minima or maxima:
1. ϕ(x, y) = x² + xy + 2y² + 3,
2. ϕ(x, y) = −x² + xy − y² + 2x + y,
3. ϕ(x, y) = (x − y + 1)².
2. Answer the same questions as above for the following real‐valued functions defined for 0 ≤ x ≤ 2, 0 ≤ y ≤ 1:
1. ϕ(x, y) = x³ + 8y³ − 9xy + 1,

5 SUFFICIENT CONDITIONS FOR A LOCAL MINIMUM: FIRST‐DERIVATIVE TEST

In the one‐dimensional case, a sufficient condition for a differentiable function ϕ to have a minimum at an interior point c is that ϕ′(c) = 0 and that there exists an interval (a, b) containing c such that ϕ′(x) < 0 in (a, c) and ϕ′(x) > 0 in (c, b). (These conditions are not necessary, see Exercise 1 below.)

The multivariable generalization is as follows.

In this example, the function ϕ is strictly convex on ℝⁿ, so that the condition of Theorem 7.3 is automatically fulfilled. We shall explore this in more detail in Section 7.7.

Exercises

1. Consider the function ϕ(x) = x²[2 + sin(1/x)] when x ≠ 0 and ϕ(0) = 0. The function ϕ clearly has an absolute minimum at x = 0. Show that the derivative is ϕ′(x) = 4x + 2x sin(1/x) − cos(1/x) when x ≠ 0 and ϕ′(0) = 0. Show further that we can find values of x arbitrarily close to the origin such that xϕ′(x) < 0. Conclude that the converse of Theorem 7.3 is, in general, not true.
3. Consider the function ϕ : ℝ² → ℝ given by ϕ(x, y) = x² + (1 + x)³y². Prove that it has one local minimum (at the origin), no other critical points and no absolute minimum.

6 SUFFICIENT CONDITIONS FOR A LOCAL MINIMUM: SECOND‐DERIVATIVE TEST

Another test for local extrema is based on the Hessian matrix.

In other words, Theorem 7.4 tells us that the conditions

and

(4)

together are sufficient for ϕ to have a strict local minimum at c. If we replace (4) by the condition that Hϕ(c) is negative definite, then we obtain sufficient conditions for a strict local maximum.

If the Hessian matrix Hϕ(c) is neither positive definite nor negative definite, but is nonsingular, then c cannot be a local extremum point (see Theorem 7.2); thus c is a saddle point.

In the case where Hϕ(c) is singular, we cannot tell whether c is a maximum point, a minimum point, or a saddle point (see Exercise 3 below). This shows that the converse of Theorem 7.4 is not true.

Exercises

1. Show that the function ϕ : ℝ² → ℝ defined by ϕ(x, y) = x⁴ + y⁴ − 2(x − y)² has strict local minima at and , and a saddle point at (0, 0).
2. Show that the function ϕ : ℝ² → ℝ defined by ϕ(x, y) = (y − x²)(y − 2x²) has a local minimum along each straight line through the origin, but that ϕ has no local minimum at the origin. In fact, the origin is a saddle point.
3. Consider the functions: (i) ϕ(x, y) = x⁴ + y⁴, (ii) ϕ(x, y) = −x⁴ − y⁴, and (iii) ϕ(x, y) = x³ + y³. For each of these functions show that the origin is a critical point and that the Hessian matrix is singular at the origin. Then prove that the origin is a minimum point, a maximum point, and a saddle point, respectively.
4. Show that the function ϕ : ℝ³ → ℝ defined by ϕ(x, y, z) = xy + yz + zx has a saddle point at the origin, and no other critical points.
5. Consider the function ϕ : ℝ² → ℝ defined by ϕ(x, y) = x³ − 3xy² + y⁴. Find the critical points of ϕ and show that ϕ has two strict local minima and one saddle point.

7 CHARACTERIZATION OF DIFFERENTIABLE CONVEX FUNCTIONS

So far, we have dealt only with local extrema. However, in the optimization problems that arise in economics (among other disciplines) we are usually interested in finding absolute extrema. The importance of convex (and concave) functions in optimization comes from the fact that every local minimum (maximum) of such a function is an absolute minimum (maximum). Before we prove this statement (Theorem 7.8), let us study convex (concave) functions in some more detail.

Recall that a set S in ℝⁿ is convex if for all x, y in S and all λ ∈ (0, 1),

and a real‐valued function ϕ, defined on a convex set S in ℝⁿ, is convex if for all x, y ∈ S and all λ ∈ (0, 1),

(5)

If (5) is satisfied with strict inequality for x ≠ y, then we call ϕ strictly convex. If ϕ is (strictly) convex, then −ϕ is (strictly) concave.

In this section, we consider (strictly) convex functions that are differentiable, but not necessarily twice differentiable. In the next section, we consider twice differentiable convex functions.

We first show that ϕ is convex if and only if at any point the tangent hyperplane is below the graph of ϕ (or coincides with it).

Proof

Assume that ϕ is convex on S. Let x be a point of S, and let u be a point in ℝⁿ such that x + u ∈ S. Then, the point x + tu with t ∈ (0, 1) lies on the line segment joining x and x + u. Since ϕ is differentiable at x, we have

(7)

where r(t)/t → 0 as t → 0. Also, since ϕ is convex on S, we have

(8)

Combining (7) and (8) and dividing by t, we obtain

Let t → 0 and ( 6 ) follows.

To prove the converse, assume that ( 6 ) holds. Let x and y be two points in S, and let z be a point on the line segment joining x and y, that is, z = tx + (1 − t)y for some t ∈ [0, 1]. Using our assumption ( 6 ), we have

(9)

Multiply the first inequality in (9) by t and the second by (1 − t), and add the resulting inequalities. This gives

(10)

because

By rearranging, (10) simplifies to

which shows that ϕ is convex.

Next assume that ϕ is strictly convex. Let x be a point of S, and let u be a point in ℝⁿ such that x + u ∈ S. Since ϕ is strictly convex on S, ϕ is convex on S. Thus,

(11)

for every t ∈ (0, 1). Also, using the definition of strict convexity,

(12)

(This is ( 8 ) with strict inequality.) Combining (11) and (12) and dividing by t, we obtain

and the strict version of inequality ( 6 ) follows.

Finally, the proof that the strict inequality ( 6 ) implies that ϕ is strictly convex is the same as the proof that ( 6 ) implies that ϕ is convex, all inequalities now being strict. □

Another characterization of differentiable functions exploits the fact that, in the one‐dimensional case, the first derivative of a convex function is monotonically nondecreasing. The generalization of this property to the multivariable case is contained in Theorem 7.6.

Proof

Assume that ϕ is convex on S. Let x and y be two points in S. Then, using Theorem 7.5,

To prove the converse, assume that ( 13 ) holds. Let x and y be two distinct points in S. Let L(x, y) denote the line segment joining x and y, that is,

and let z be a point in L(x, y). By the mean‐value theorem, there exists a point ξ = αx + (1 − α)z (0 < α < 1) on the line segment joining x and z (hence in L(x, y)), such that

Noting that ξ − z = α(x − z) and assuming ( 13 ), we have

Further, if z = tx + (1 − t)y, then x − z = (1 − t)(x − y). It follows that

(14)

In precisely the same way we can show that

(15)

From (14) and (15), we obtain

(16)

By rearranging, (16) simplifies to

which shows that ϕ is convex.

The corresponding result for ϕ strictly convex is obtained in precisely the same way, all inequalities now being strict. □

Exercises

1. Show that the function ϕ(x, y) = x + y(y − 1) is convex. Is ϕ strictly convex?
2. Prove that ϕ(x) = x⁴ is strictly convex.

8 CHARACTERIZATION OF TWICE DIFFERENTIABLE CONVEX FUNCTIONS

Both characterizations of differentiable convex functions (Theorems 7.5 and 7.6 ) involved conditions on two points. For twice differentiable functions, there is a characterization that involves only one point.

Exercises

1. Repeat Exercise 1 in Section 4.9 using Theorem 7.7.
2. Show that the function ϕ(x) = x^p, p > 1 is strictly convex on [0, ∞).
3. Show that the function ϕ(x) = x′x, defined on ℝⁿ, is strictly convex.
4. Consider the CES (constant elasticity of substitution) production function
defined for x > 0 and y > 0. Show that ϕ is convex if ρ ≤ −1, and concave if ρ ≥ −1 (and ρ ≠ 0). What happens if ρ = −1?

9 SUFFICIENT CONDITIONS FOR AN ABSOLUTE MINIMUM

The convexity (concavity) of a function enables us to find the absolute minimum (maximum) of the function, since every local minimum (maximum) of such a function is an absolute minimum (maximum).

To check whether a given differentiable function is (strictly) convex, we thus have four criteria at our disposal: the definition in Section 4.9, Theorems 7.5 and 7.6 , and, if the function is twice differentiable, Theorem 7.7.

Exercises

1. Let a be an n × 1 vector and A a positive definite n × n matrix. Show that
for every x in ℝⁿ. For which value of x does the function ϕ(x) = a′x + x′Ax attain its minimum value?
2. (More difficult.) If A is positive semidefinite and AA⁺a = a, show that
for every x in ℝⁿ? What happens when AA⁺a ≠ a?

10 MONOTONIC TRANSFORMATIONS

To complete our discussion of unconstrained optimization, we shall prove the useful, if simple, fact that minimizing a function is equivalent to minimizing a monotonically increasing transformation of that function.

Exercise

1. Consider the likelihood function

Use Theorem 7.9 to maximize L with respect to μ and σ².

11 OPTIMIZATION SUBJECT TO CONSTRAINTS

Let ϕ : S → ℝ be a real‐valued function defined on a set S in ℝⁿ. Hitherto we have considered optimization problems of the type

It may happen, however, that the variables x₁, …, x_n are subject to certain constraints, say g₁(x) = 0, …, g_m(x) = 0. Our problem is now

where g : S → ℝ^m is the vector function g = (g₁, g₂, …, g_m)′. This is known as a constrained minimization problem (or a minimization problem subject to equality constraints), and the most convenient way of solving it is, in general, to use the Lagrange multiplier theory. In the remainder of this chapter, we shall study this important theory in some detail.

We start our discussion with some definitions. The subset of S on which g vanishes, that is,

is known as the opportunity set (constraint set). Let c be a point of Γ. We say that ϕ has a local minimum at c under the constraint g(x) = 0 if there exists an n‐ball B(c) such that

ϕ has a strict local minimum at c under the constraint g(x) = 0 if we can choose B(c) such that

ϕ has an absolute minimum at c under the constraint g(x) = 0 if

and ϕ has a strict absolute minimum at c under the constraint g(x) = 0 if

12 NECESSARY CONDITIONS FOR A LOCAL MINIMUM UNDER CONSTRAINTS

The next theorem gives a necessary condition for a constrained minimum to occur at a given point.

Note

If condition (vi) is replaced by

(vi)′ ϕ(x) ≤ ϕ(c) for every x ∈ B(c) satisfying g(x) = 0,

then the conclusion of the theorem remains valid.

Lagrange's theorem establishes the validity of the following formal method (Lagrange's multiplier method) for obtaining necessary conditions for an extremum subject to equality constraints. We first define the Lagrangian function ψ by

where l is an m × 1 vector of constants λ₁, …, λ_m, called the Lagrange multipliers. (One multiplier is introduced for each constraint. Notice that ψ(x) equals ϕ(x) for every x that satisfies the constraint.) Next, we differentiate ψ with respect to x and set the result equal to 0. Together with the m constraints, we obtain the following system of n + m equations (the first‐order conditions)

We then try to solve this system of n + m equations in n + m unknowns: λ₁, …, λ_m and x₁, …, x_n. The points x = (x₁, …, x_n)′ obtained in this way are called critical points, and among them are any points of S at which constrained minima or maxima occur. (A critical point of the constrained problem is thus defined as ‘a critical point of the function ϕ(x) defined on the surface g(x) = 0’, and not as ‘a critical point of ϕ(x) whose coordinates satisfy g(x) = 0’. Any critical point in the latter sense is also a critical point in the former, but not conversely.)

Of course, the question remains whether a given critical point actually yields a minimum, maximum, or neither.

Proof

Let us partition the m × n matrix Dg(c) as

where D₁g(c) is an m × m matrix and D₂g(c) is an m × (n − m) matrix. By renumbering the variables (if necessary), we may assume that

We shall denote points x in S by (z; t), where z ∈ ℝ^m and t ∈ ℝ^n − m, so that z = (x₁, …, x_m)′ and t = (x_m+1, …, x_n)′. Also, we write c = (z₀; t₀).

By the implicit function theorem (Theorem 7.14), there exists an open set T in ℝ^n − m containing t₀, and a unique function h : T → ℝ^m such that

h(t₀) = z₀,
g(h(t); t) = 0 for all t ∈ T, and
h is differentiable at t₀.

Since h is continuous at t₀, we can choose an (n − m)‐ball T₀ ⊂ T with center t₀ such that

Then the real‐valued function ψ : T₀ → ℝ defined by

has the property

that is, ψ has a local (unconstrained) minimum at t₀. Since h is differentiable at t₀ and ϕ is differentiable at (z₀; t₀), it follows that ψ is differentiable at t₀. Hence, by Theorem 7.2, its derivative vanishes at t₀, and using the chain rule, we find

(22)

Next, consider the vector function κ : T → ℝ^m defined by

The function κ is identically zero on the set T. Therefore, all its partial derivatives are zero on T. In particular, Dκ(t₀) = 0. Further, since h is differentiable at t₀ and g is differentiable at (z₀; t₀), the chain rule yields

(23)

Combining (22) and (23), we obtain

(24)

where E is the (m + 1) × n matrix

Equation (24) shows that the last n − m columns of E are linear combinations of the first m columns. Hence, r(E) ≤ m. But since D₁g(c) is a submatrix of E with rank m, the rank of E cannot be smaller than m. It follows that

The m + 1 rows of E are therefore linearly dependent. By assumption, the m rows of Dg(c) are linearly independent. Hence, Dϕ(c) is a linear combination of the m rows of Dg(c), that is,

for some l ∈ ℝ^m. This proves the existence of l; its uniqueness follows immediately from the fact that Dg(c) has full row rank. □

Exercises

1. Consider the problem

By using Lagrange's method, show that the minimum point is (0,0) with λ = 1. Next, consider the Lagrangian function
and show that ψ has a saddle point at (0,0). That is, the point (0,0) does not minimize ψ. (This shows that it is not correct to say that minimizing a function subject to constraints is equivalent to minimizing the Lagrangian function.)
2. Solve the following problems by using the Lagrange multiplier method:
1. min(max) xy subject to x² + xy + y² = 1,
2. min(max) (y − z)(z − x)(x − y) subject to x² + y² + z² = 2,
3. min(max) x² + y² + z² − yz − zx – xy
  subject to x² + y² + z² − 2x + 2y + 6z + 9 = 0.
3. Prove the inequality
for all positive real numbers x₁, …, x_n. (Compare Section 11.4.)
4. Solve the problem
5. Solve the following utility maximization problem:
with respect to x₁ and x₂ (x₁ > 0, x₂ > 0).

13 SUFFICIENT CONDITIONS FOR A LOCAL MINIMUM UNDER CONSTRAINTS

In the previous section, we obtained conditions that are necessary for a function to achieve a local minimum or maximum subject to equality constraints. To investigate whether a given critical point actually yields a minimum, maximum, or neither, it is often practical to proceed on an ad hoc basis. If this fails, the following theorem provides sufficient conditions to ensure the existence of a constrained minimum or maximum at a critical point.

Proof

Let us define the sets

and

We need to show that a δ > 0 exists such that

By assumption, ϕ and g are twice differentiable at c, and therefore differentiable at each point of an n‐ball B(c) ⊂ S. Let δ₀ be the radius of B(c). Since ψ is twice differentiable at c, we have for every u ∈ U(δ₀) the second‐order Taylor formula (Theorem 6.7)

(27)

where r(u)/∥u∥² → 0 as u → 0. Now, g(c) = 0 and dψ(c; u) = 0 (first‐order conditions). Further, g(c + u) = 0 for u ∈ T. Hence, (27) reduces to

(28)

Next, since g is differentiable at each point of B(c), we may apply the mean‐value theorem to each of its components g₁, …, g_m. This yields, for every u ∈ U(δ₀),

where θ_i ∈ (0, 1), i = 1, …, m. Again, g_i(c) = 0 and, for u ∈ T, g_i(c + u) = 0. Hence,

(29)

Let us denote by Δ(u), u ∈ U(δ₀), the m × n matrix whose ijth element is the jth first‐order partial derivative of g_i evaluated at c + θ_iu, that is,

(Notice that the rows of Δ are evaluated as possibly different points.) Then the m equations in (29) can be written as one vector equation

(30)

Since the functions D_jg_i are continuous at u = 0, the Jacobian matrix Δ is continuous at u = 0. By assumption Δ(0) has maximum rank m, and therefore its rank is locally constant. That is, there exists a δ₁ ∈ (0, δ₀] such that the rank of Δ(u) satisfies

(31)

(see Exercise 1 in Section 5.15). Now, Δ(u) has n columns of which only m are linearly independent. Hence, by Exercise 3 in Section 1.14, there exists an n × (n − m) matrix Γ(u) such that

(32)

(The columns of Γ are of course n − m normalized eigenvectors associated with the n − m zero eigenvalues of Δ′Δ.) Further, since Δ is continuous at u = 0, so is Γ.

From (30) to (32) it follows that u must be a linear combination of the columns of Γ(u), that is, there exists, for every u in T ∩ U(δ₁), a vector q ∈ ℝ^n − m such that

If we denote by K(u) the symmetric (n − m) × (n − m) matrix

and by λ(u) its smallest eigenvalue, then

(33)

for every u in T ∩ U (δ₁). Now, since Γ is continuous at u = 0, so is K and so is λ. Hence, we may write, for u in U (δ₁),

(34)

where R(u) → 0 as u → 0. Combining (28), (33), and (34), we obtain

(35)

for every u in T ∩ U (δ₁).

Let us now prove that λ(0) > 0. By assumption,

(36)

For u ∈ U(δ₁), the condition Δ(0)u = 0 is equivalent to u = Γ(0)q for some q ∈ ℝ^n − m. Hence, (36) is equivalent to

This shows that K(0) is positive definite, and hence that its smallest eigenvalue λ(0) is positive.

Finally, choose δ₂ ∈ (0, δ₁] such that

(37)

for every u ≠ 0 with ∥u∥ < δ₂. Then (35) and (37) imply

for every u in T ∩ U (δ₂). Hence, ϕ has a strict local minimum at c under the constraint g(x) = 0. □

The difficulty in applying Theorem 7.11 lies, of course, in the verification of the second‐order condition. This condition requires that

where

Several sets of necessary and sufficient conditions exist for a quadratic form to be positive definite under linear constraints, and one of these is discussed in Section 3.11. The following theorem is therefore easily proved.

Exercises

1. Discuss the second‐order conditions for the constrained optimization problems in Exercise 2 in Section 7.12.
2. Answer the same question as above for Exercises 4 and 5 in Section 7.12.
3. Compare Example 7.4 and the solution method of Section 7.13 with Example 7.3 and the solution method of Section 7.12.

14 SUFFICIENT CONDITIONS FOR AN ABSOLUTE MINIMUM UNDER CONSTRAINTS

The Lagrange theorem (Theorem 7.10) gives necessary conditions for a local (and hence also for an absolute) constrained extremum to occur at a given point. In Theorem 7.11, we obtained sufficient conditions for a local constrained extremum. To find sufficient conditions for an absolute constrained extremum, we proceed as in the unconstrained case (Section 7.9) and impose appropriate convexity (concavity) conditions.

To prove that the Lagrangian function ψ is (strictly) convex or (strictly) concave, we can use the definition in Section 4.9, Theorem 7.5 or Theorem 7.6, or (if ψ is twice differentiable) Theorem 7.7. In addition, we observe that

if the constraints g₁(x), …, g_m(x) are all linear and ϕ(x) is (strictly) convex, then ψ(x) is (strictly) convex.

In fact, (a) is a special case of

if the functions λ₁g₁(x), …, λ_mg_m(x) are all concave (that is, for i = 1, 2, …, m, either g_i(x) is concave and λ_i ≥ 0, or g_i(x) is convex and λ_i ≤ 0) and if ϕ(x) is convex, then ψ(x) is convex; furthermore, if at least one of these m + 1 conditions is strict, then ψ(x) is strictly convex.

15 A NOTE ON CONSTRAINTS IN MATRIX FORM

Let ϕ : S → ℝ be a real‐valued function defined on a set S in ℝ^n × q, and let G : S → ℝ^m × p be a matrix function defined on S. We shall frequently encounter the problem

This problem is, of course, mathematically equivalent to the case where X and G are vectors rather than matrices, so all theorems remain valid. We now introduce mp multipliers λ_ij (one for each constraint g_ij(X) = 0, i = 1, …, m; j = 1, …, p), and define the m × p matrix of Lagrange multipliers L = (λ_ij). The Lagrangian function then takes the convenient form

16 ECONOMIC INTERPRETATION OF LAGRANGE MULTIPLIERS

Consider the constrained minimization problem

where ϕ is a real‐valued function defined on an open set S in ℝⁿ, g is a vector function defined on S with values in ℝ^m(m < n) and b = (b₁, …, b_m)′ is a given m × 1 vector of constants (parameters). In this section, we shall examine how the optimal solution of this constrained minimization problem changes when the parameters change.

We shall assume that

ϕ and g are twice continuously differentiable on S,
(first‐order conditions) there exist points x₀ = (x₀₁, …, x_0n)′ in S and l₀ = (λ₀₁, …, λ_0m)′ in ℝ^m such that

(38)

(39)

Now let

and define, for r = 1, 2, …, n, B_r as the m × r matrix whose columns are the first r columns of B_n, and A_rr as the r × r matrix in the top left corner of A_nn. In addition to (i) and (ii), we assume that

|B_m| ≠ 0,
(second‐order conditions)

These assumptions are sufficient (in fact, more than sufficient) for the function ϕ to have a strict local minimum at x₀ under the constraint g(x) = b (see Theorem 7.12).

The vectors x₀ and l₀ for which the first‐order conditions (38) and (39) are satisfied will, in general, depend on the parameter vector b. The question is whether x₀ and l₀ are differentiable functions of b. Given assumptions (i)–(iv), this question can be answered in the affirmative. By using the implicit function theorem (Theorem 7.15), we can show that there exists an m‐ball B(0) with the origin as its center, and unique functions x* and l* defined on B(0) with values in ℝⁿ and ℝ^m, respectively, such that

x*(0) = x₀, l*(0) = l₀,
Dϕ(x* (y)) = (l^*(y))′Dg(x^*(y)) for all y in B(0),
g(x*(y)) = b for all y in B(0),
the functions x* and l* are continuously differentiable on B(0).

Now consider the real‐valued function ϕ* defined on B(0) by the equation

We first differentiate both sides of (c). This gives

(40)

using the chain rule. Next we differentiate ϕ*. Using (again) the chain rule, (b) and (40), we obtain

In particular, at y = 0,

Thus, the Lagrange multiplier λ_0j measures the rate at which the optimal value of the objective function changes with respect to a small change in the right‐hand side of the jth constraint. For example, suppose we are maximizing a firm's profit subject to one resource limitation, then the Lagrange multiplier λ₀ is the extra profit that could be earned if the firm had one more unit of the resource, and therefore represents the maximum price the firm is willing to pay for this additional unit. For this reason, λ₀ is often referred to as a shadow price.

Exercise

1. In Exercise 2 in Section 7.12, find whether a small relaxation of the constraint will increase or decrease the optimal function value. At what rate?

APPENDIX: THE IMPLICIT FUNCTION THEOREM

Let f : ℝ^m + k → ℝ^m be a linear function defined by

where, as the notation indicates, points in ℝ^m + k are denoted by (x; t) with x ∈ ℝ^m and t ∈ ℝ^k. If the m × m matrix A is nonsingular, then there exists a unique function g : ℝ^k → ℝ^m such that

g(0) = 0,
f(g(t); t) = 0 for all t ∈ ℝ^k,
g is infinitely times differentiable on ℝ^k.

This unique function is, of course,

The implicit function theorem asserts that a similar conclusion holds for certain differentiable transformations which are not linear. In this Appendix, we present (without proof) three versions of the implicit function theorem, each one being useful in slightly different circumstances.

BIBLIOGRAPHICAL NOTES

1. Apostol (1974, Chapter 13) has a good discussion of implicit functions and extremum problems. See also Luenberger (1969) and Sydsæter (1981, Chapter 5).

9 and §14. For an interesting approach to absolute minima with applications in statistics, see Rolle (1996).

Appendix. There are many versions of the implicit function theorem, but Theorem 7.15 is what most authors would call ‘the’ implicit function theorem. See Dieudonné (1969, Theorem 10.2.1) or Apostol (1974, Theorem 13.7). Theorems 7.14 and 7.16 are less often presented. See, however, Young (1910, Section 38).

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 7: Static optimization

Create new playlist

Sign In

Sign Up

1 INTRODUCTION

2 UNCONSTRAINED OPTIMIZATION

3 THE EXISTENCE OF ABSOLUTE EXTREMA

Exercises

4 NECESSARY CONDITIONS FOR A LOCAL MINIMUM

Exercises

5 SUFFICIENT CONDITIONS FOR A LOCAL MINIMUM: FIRST‐DERIVATIVE TEST

Exercises

6 SUFFICIENT CONDITIONS FOR A LOCAL MINIMUM: SECOND‐DERIVATIVE TEST

Exercises

7 CHARACTERIZATION OF DIFFERENTIABLE CONVEX FUNCTIONS

Exercises

8 CHARACTERIZATION OF TWICE DIFFERENTIABLE CONVEX FUNCTIONS

Exercises

9 SUFFICIENT CONDITIONS FOR AN ABSOLUTE MINIMUM

Exercises

10 MONOTONIC TRANSFORMATIONS

Exercise

11 OPTIMIZATION SUBJECT TO CONSTRAINTS

12 NECESSARY CONDITIONS FOR A LOCAL MINIMUM UNDER CONSTRAINTS

Exercises

13 SUFFICIENT CONDITIONS FOR A LOCAL MINIMUM UNDER CONSTRAINTS

Exercises

14 SUFFICIENT CONDITIONS FOR AN ABSOLUTE MINIMUM UNDER CONSTRAINTS

15 A NOTE ON CONSTRAINTS IN MATRIX FORM

16 ECONOMIC INTERPRETATION OF LAGRANGE MULTIPLIERS

Exercise

APPENDIX: THE IMPLICIT FUNCTION THEOREM

BIBLIOGRAPHICAL NOTES

Table of Contents for
Chapter 7: Static optimization