Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

2
Classical Optimization Techniques

2.1 Introduction

The classical methods of optimization are useful in finding the optimum solution of continuous and differentiable functions. These methods are analytical and make use of the techniques of differential calculus in locating the optimum points. Since some of the practical problems involve objective functions that are not continuous and/or differentiable, the classical optimization techniques have limited scope in practical applications. However, a study of the calculus methods of optimization forms a basis for developing most of the numerical techniques of optimization presented in subsequent chapters. In this chapter we present the necessary and sufficient conditions for locating the optimum solution of a single‐variable function, a multivariable function with no constraints, and a multivariable function with equality and inequality constraints.

2.2 Single‐Variable Optimization

A function of one variable f (x) is said to have a relative or local minimum at x = x* if f (x*) ≤ f (x* + h) for all sufficiently small positive and negative values of h. Similarly, a point x* is called a relative or local maximum if f (x*) ≥ f (x* + h) for all values of h sufficiently close to zero. A function f (x) is said to have a global or absolute minimum at x* if f (x*) ≤ f (x) for all x, and not just for all x close to x*, in the domain over which f (x) is defined. Similarly, a point x* will be a global maximum of f (x) if f (x*) ≥ f (x) for all x in the domain. Figure 2.1 shows the difference between the local and global optimum points.

c02f001 — Figure 2.1 Relative and global minima.

A single‐variable optimization problem is one in which the value of x = x* is to be found in the interval [a, b] such that x* minimizes f (x). The following two theorems provide the necessary and sufficient conditions for the relative minimum of a function of a single variable [1,2].

Proof

It is given that

(2.1)

exists as a definite number, which we want to prove to be zero. Since x* is a relative minimum, we have

for all values of h sufficiently close to zero. Hence

Thus Eq. (2.1) gives the limit as h tends to zero through positive values as

(2.2)

while it gives the limit as h tends to zero through negative values as

(2.3)

The only way to satisfy both Eqs. (2.2) and (2.3) is to have

(2.4)

This proves the theorem.

Notes:

This theorem can be proved even if x* is a relative maximum.
The theorem does not say what happens if a minimum or maximum occurs at a point x* where the derivative fails to exist. For example, in Figure 2.2,

depending on whether h approaches zero through positive or negative values, respectively. Unless the numbers m ⁺ and m ⁻ are equal, the derivative f′ (x*) does not exist. If f′(x*) does not exist, the theorem is not applicable.
The theorem does not say what happens if a minimum or maximum occurs at an endpoint of the interval of definition of the function. In this case

exists for positive values of h only or for negative values of h only, and hence the derivative is not defined at the endpoints.
The theorem does not say that the function necessarily will have a minimum or maximum at every point where the derivative is zero. For example, the derivative f′(x) = 0 at x = 0 for the function shown in Figure 2.3. However, this point is neither a minimum nor a maximum. In general, a point x* at which f′(x*) = 0 is called a stationary point.

If the function f (x) possesses continuous derivatives of every order that come into question, in the neighborhood of x = x*, the following theorem provides sufficient condition for the minimum or maximum value of the function [3,4].

c02f002 — Figure 2.2 Derivative undefined at x*.

c02f003 — Figure 2.3 Stationary (inflection) point.

Proof

Applying Taylor's theorem with remainder after n terms, we have

(2.5)

since f′(x*) = f″(x*) = ⋯ = f ^{(n − 1)}(x*) = 0, Eq. (2.5) becomes

As f ⁽ⁿ⁾ (x*) ≠ 0, there exists an interval around x* for every point x of which the nth derivative f ⁽ⁿ⁾ (x) has the same sign, namely, that of f ^{(n )}(x*). Thus, for every point x* + h of this interval, f ⁽ⁿ⁾ (x* + θh) has the sign of f ⁽ⁿ⁾ (x*). When n is even, h ⁿ /n! is positive irrespective of whether h is positive or negative, and hence f (x* + h) − f (x*) will have the same sign as that of f ⁽ⁿ⁾ (x*). Thus x* will be a relative minimum if f ⁽ⁿ⁾ (x*) is positive and a relative maximum if f ⁽ⁿ⁾ (x*) is negative. When n is odd, h ⁿ /n! changes sign with the change in the sign of h and hence the point x* is neither a maximum nor a minimum. In this case the point x* is called a point of inflection.

Example 2.2

In a two‐stage compressor, the working gas leaving the first stage of compression is cooled (by passing it through a heat exchanger) before it enters the second stage of compression to increase the efficiency [5]. The total work input to a compressor (W) for an ideal gas, for isentropic compression, is given by

where c _p is the specific heat of the gas at constant pressure, k is the ratio of specific heat at constant pressure to that at constant volume of the gas, and T ₁ is the temperature at which the gas enters the compressor. Find the pressure, p ₂, at which intercooling should be done to minimize the work input to the compressor. Also determine the minimum work done on the compressor.

SOLUTION

The necessary condition for minimizing the work done on the compressor is

which yields

The second derivative of W with respect to p ₂ gives

Since the ratio of specific heats k is greater than 1, we get

and hence the solution corresponds to a relative minimum. The minimum work done is given by

2.3 Multivariable Optimization with no Constraints

In this section we consider the necessary and sufficient conditions for the minimum or maximum of an unconstrained function of several variables [6,7]. Before seeing these conditions, we consider the Taylor's series expansion of a multivariable function.

2.3.1 Definition: rth Differential of f

If all partial derivatives of the function f through order r ≥ 1 exist and are continuous at a point X ^*, the polynomial

(2.6)

is called the rth differential of f at X*. Notice that there are r summations and one h _i is associated with each summation in Eq. (2.6).

For example, when r = 2 and n = 3, we have

The Taylor's series expansion of a function f (X) about a point X* is given by

(2.7)

where the last term, called the remainder, is given by

(2.8)

where 0 < θ < 1 and h = X − X*.

Proof

The proof given for Theorem 2.1 can easily be extended to prove the present theorem. However, we present a different approach to prove this theorem. Suppose that one of the first partial derivatives, say the kth one, does not vanish at X*. Then, by Taylor's theorem,

that is,

Since d ² f (X* + θ h) is of order , the terms of order h will dominate the higher‐order terms for small h. Thus, the sign of f (X* + h) − f (X*) is decided by the sign of h _k ∂f (X*)/∂x _k. Suppose that ∂f (X*)/∂x _k > 0. Then the sign of f (X* + h) − f (X*) will be positive for h _k > 0 and negative for h _k < 0. This means that X* cannot be an extreme point. The same conclusion can be obtained even if we assume that ∂f (X*)/∂x _k < 0. Since this conclusion is in contradiction with the original statement that X* is an extreme point, we may say that ∂f/∂x _k = 0 at X = X*. Hence the theorem is proved.

Proof

From Taylor's theorem we can write

(2.10)

Since X* is a stationary point, the necessary conditions give (Theorem 2.3)

Thus Eq. (2.10) reduces to

Therefore, the sign of

will be same as that of

Since the second partial derivative of ∂ ² f (X)/∂x _i ∂x _j is continuous in the neighborhood of X*,

will have the same sign as (∂ ² f/∂x _i ∂x _j)|X = X* for all sufficiently small h. Thus f (X* + h) − f (X*) will be positive, and hence X* will be a relative minimum, if

(2.11)

is positive. This quantity Q is a quadratic form and can be written in matrix form as

(2.12)

where

(2.13)

is the matrix of second partial derivatives and is called the Hessian matrix of f (X).

It is known from matrix algebra that the quadratic form of Eqs. (2.11) or (2.12) will be positive for all h if and only if [J] is positive definite at X = X*. This means that a sufficient condition for the stationary point X* to be a relative minimum is that the Hessian matrix evaluated at the same point be positive definite. This completes the proof for the minimization case. By proceeding in a similar manner, it can be proved that the Hessian matrix will be negative definite if X* is a relative maximum point.

Note: A matrix A will be positive definite if all its eigenvalues are positive; that is, all the values of λ that satisfy the determinantal equation

(2.14)

should be positive. Similarly, the matrix A will be negative definite if its eigenvalues are negative. A matrix A will be positive semidefinite (or negative semidefinite) if all of its eigenvalues are nonnegative (or nonpositive).

Another test that can be used to find the positive definiteness of a matrix A of order n involves evaluation of the determinants

(2.15)

The matrix A will be positive definite if and only if all the values A ₁, A ₂, A ₃, …, A _n are positive. The matrix A will be negative definite if and only if the sign of A _j is (−1)^j for j = 1, 2, …, n. If some of the A _j are positive and the remaining A _j are zero, the matrix A will be positive semidefinite.

Example 2.4

Figure 2.4 shows two frictionless rigid bodies (carts) A and B connected by three linear elastic springs having spring constants k ₁, k ₂, and k ₃. The springs are at their natural positions when the applied force P is zero. Find the displacements x ₁ and x ₂ under the force P by using the principle of minimum potential energy.

c02f004 — Figure 2.4 Spring–cart system.

SOLUTION

According to the principle of minimum potential energy, the system will be in equilibrium under the load P if the potential energy is a minimum. The potential energy of the system is given by

The necessary conditions for the minimum of U are

(E1)

(E2)

The values of x ₁ and x ₂ corresponding to the equilibrium state, obtained by solving Eqs. (E1) and (E2), are given by

The sufficiency conditions for the minimum at can also be verified by testing the positive definiteness of the Hessian matrix of U. The Hessian matrix of U evaluated at is

The determinants of the square submatrices of J are

since the spring constants are always positive. Thus, the matrix J is positive definite and hence corresponds to the minimum of potential energy.

2.3.2 Semidefinite Case

We now consider the problem of determining sufficient conditions for the case when the Hessian matrix of the given function is semidefinite. In the case of a function of a single variable, the problem of determining sufficient conditions for the case when the second derivative is zero was resolved quite easily. We simply investigated the higher‐order derivatives in the Taylor's series expansion. A similar procedure can be followed for functions of n variables. However, the algebra becomes quite involved, and hence we rarely investigate the stationary points for sufficiency in actual practice. The following theorem, analogous to Theorem 2.2, gives the sufficiency conditions for the extreme points of a function of several variables.

2.3.3 Saddle Point

In the case of a function of two variables, f (x, y), the Hessian matrix may be neither positive nor negative definite at a point (x*, y*) at which

In such a case, the point (x*, y*) is called a saddle point. The characteristic of a saddle point is that it corresponds to a relative minimum or maximum of f (x, y) with respect to one variable, say, x (the other variable being fixed at y = y*) and a relative maximum or minimum of f (x, y) with respect to the second variable y (the other variable being fixed at x*).

As an example, consider the function f (x, y) = x ² − y ². For this function,

These first derivatives are zero at x* = 0 and y* = 0. The Hessian matrix of f at (x*, y*) is given by

Since this matrix is neither positive definite nor negative definite, the point (x* = 0, y* = 0) is a saddle point. The function is shown graphically in Figure 2.5. It can be seen that f (x, y*) = f (x, 0) has a relative minimum and f (x*, y) = f (0, y) has a relative maximum at the saddle point (x*, y*). Saddle points may exist for functions of more than two variables also. The characteristic of the saddle point stated above still holds provided that x and y are interpreted as vectors in multidimensional cases.

c02f005 — Figure 2.5 Saddle point of the function f (x, y) = x ² − y ².

Example 2.5

Find the extreme points of the function

Solution

The necessary conditions for the existence of an extreme point are

These equations are satisfied at the points

To find the nature of these extreme points, we have to use the sufficiency conditions. The second‐order partial derivatives of f are given by

The Hessian matrix of f is given by

If J ₁ = |6x ₁ + 4| and images , the values of J ₁ and J ₂ and the nature of the extreme point are as given below:

Point X	Value of J ₁	Value of J ₂	Nature of J	Nature of X	f (X)
(0, 0)	+4	+32	Positive definite	Relative minimum	6
	+4	−32	Indefinite	Saddle point	418/27
	−4	−32	Indefinite	Saddle point	194/27
	−4	+32	Negative definite	Relative maximum	50/3

2.4 Multivariable Optimization with Equality Constraints

In this section we consider the optimization of continuous functions subjected to equality constraints:

(2.16)

where

Here m is less than or equal to n; otherwise (if m > n), the problem becomes overdefined and, in general, there will be no solution. There are several methods available for the solution of this problem. The methods of direct substitution, constrained variation, and Lagrange multipliers are discussed in the following sections [8–10].

2.4.1 Solution by Direct Substitution

For a problem with n variables and m equality constraints, it is theoretically possible to solve simultaneously the m equality constraints and express any set of m variables in terms of the remaining n − m variables. When these expressions are substituted into the original objective function, there results a new objective function involving only n − m variables. The new objective function is not subjected to any constraint, and hence its optimum can be found by using the unconstrained optimization techniques discussed in Section 2.3.

This method of direct substitution, although it appears to be simple in theory, is not convenient from a practical point of view. The reason for this is that the constraint equations will be nonlinear for most of practical problems, and often it becomes impossible to solve them and express any m variables in terms of the remaining n − m variables. However, the method of direct substitution might prove to be very simple and direct for solving simpler problems, as shown by the following example.

Example 2.6

Find the dimensions of a box of largest volume that can be inscribed in a sphere of unit radius.

SOLUTION

Let the origin of the Cartesian coordinate system x ₁, x ₂, x ₃ be at the center of the sphere and the sides of the box be 2x ₁, 2x ₂, and 2x ₃. The volume of the box is given by

(E1)

Since the corners of the box lie on the surface of the sphere of unit radius, x ₁, x ₂, and x ₃ have to satisfy the constraint

(E2)

This problem has three design variables and one equality constraint. Hence the equality constraint can be used to eliminate any one of the design variables from the objective function. If we choose to eliminate x ₃, Eq. (E2) gives

(E3)

Thus, the objective function becomes

(E4)

which can be maximized as an unconstrained function in two variables.

The necessary conditions for the maximum of f give

(E5)

(E6)

Equations (E5) and (E6) can be simplified to obtain

from which it follows that and hence . This solution gives the maximum volume of the box as

To find whether the solution found corresponds to a maximum or a minimum, we apply the sufficiency conditions to f (x ₁, x ₂) of Eq. (E4). The second‐order partial derivatives of f at are given by

Since

the Hessian matrix of f is negative definite at . Hence the point corresponds to the maximum of f.

2.4.2 Solution by the Method of Constrained Variation

The basic idea used in the method of constrained variation is to find a closed‐form expression for the first‐order differential of f (df) at all points at which the constraints g _j(X) = 0, j = 1, 2, …, m, are satisfied. The desired optimum points are then obtained by setting the differential df equal to zero. Before presenting the general method, we indicate its salient features through the following simple problem with n = 2 and m = 1:

(2.17)

subject to

(2.18)

A necessary condition for f to have a minimum at some point is that the total derivative of f (x ₁, x ₂) with respect to x ₁ must be zero at . By setting the total differential of f (x ₁, x ₂) equal to zero, we obtain

(2.19)

Since at the minimum point, any variations dx ₁ and dx ₂ taken about the point are called admissible variations provided that the new point lies on the constraint:

(2.20)

The Taylor's series expansion of the function in Eq. (2.20) about the point gives

(2.21)

where dx ₁ and dx ₂ are assumed to be small. Since , Eq. (2.21) reduces to

(2.22)

Thus Eq. (2.22) has to be satisfied by all admissible variations. This is illustrated in Figure 2.6, where PQ indicates the curve at each point of which Eq. (2.18) is satisfied. If A is taken as the base point , the variations in x ₁ and x ₂ leading to points B and C are called admissible variations. On the other hand, the variations in x ₁ and x ₂ representing point D are not admissible since point D does not lie on the constraint curve, g (x ₁, x ₂) = 0. Thus any set of variations (dx ₁, dx ₂) that does not satisfy Eq. (2.22) leads to points such as D, which do not satisfy constraint Eq. (2.18).

c02f006 — Figure 2.6 Variations about A.

Assuming that ∂g/∂x ₂ ≠ 0, Eq. (2.22) can be rewritten as

(2.23)

This relation indicates that once the variation in x ₁(dx ₁) is chosen arbitrarily, the variation in x ₂ (dx ₂) is decided automatically in order to have dx ₁ and dx ₂ as a set of admissible variations. By substituting Eq. (2.23) in Eq. (2.19), we obtain

(2.24)

The expression on the left‐hand side is called the constrained variation of f. Note that Eq. (2.24) has to be satisfied for all values of dx ₁. Since dx ₁ can be chosen arbitrarily, Eq. (2.24) leads to

(2.25)

Equation (2.25) represents a necessary condition in order to have as an extreme point (minimum or maximum).

Example 2.7

A beam of uniform rectangular cross section is to be cut from a log having a circular cross section of diameter 2a. The beam has to be used as a cantilever beam (the length is fixed) to carry a concentrated load at the free end. Find the dimensions of the beam that correspond to the maximum tensile (bending) stress carrying capacity.

Solution

From elementary strength of materials, we know that the tensile stress induced in a rectangular beam (σ) at any fiber located a distance y from the neutral axis is given by

where M is the bending moment acting and I is the moment of inertia of the cross section about the x axis. If the width and depth of the rectangular beam shown in Figure 2.7 are 2x and 2y, respectively, the maximum tensile stress induced is given by

c02f007 — Figure 2.7 Cross section of the log.

Thus, for any specified bending moment, the beam is said to have maximum tensile stress carrying capacity if the maximum induced stress (σ _max) is a minimum. Hence, we need to minimize k/xy ² or maximize Kxy ², where k = 3 M/4 and K = 1/k, subject to the constraint

This problem has two variables and one constraint; hence Eq. (2.25) can be applied for finding the optimum solution. Since

(E1)

(E2)

we have

Equation (2.25) gives

that is,

(E3)

Thus, the beam of maximum tensile stress carrying capacity has a depth of √2 times its breadth. The optimum values of x and y can be obtained from Eqs. (E3) and (E2) as

Necessary Conditions for a General Problem

The procedure indicated above can be generalized to the case of a problem in n variables with m constraints. In this case, each constraint equation g _j(X) = 0, j = 1, 2, …, m, gives rise to a linear equation in the variations dx _i, i = 1, 2, …, n. Thus, there will be in all m linear equations in n variations. Hence, any m variations can be expressed in terms of the remaining n − m variations. These expressions can be used to express the differential of the objective function, df, in terms of the n − m independent variations. By letting the coefficients of the independent variations vanish in the equation df = 0, one obtains the necessary conditions for the constrained optimum of the given function. These conditions can be expressed as [7]

(2.26)

where k = m + 1, m + 2, …, n. It is to be noted that the variations of the first m variables (dx ₁, dx ₂, …, dx _m) have been expressed in terms of the variations of the remaining n − m variables (dx _m+1, dx _m+2, …, dx _n) in deriving Eq. (2.26). This implies that the following relation is satisfied:

(2.27)

The n − m equations given by Eq. (2.26) represent the necessary conditions for the extremum of f (X) under the m equality constraints, g _j (X) = 0, j = 1, 2, …, m.

Example 2.8

(E1)

subject to

(E2)

(E3)

Solution

This problem can be solved by applying the necessary conditions given by Eq. (2.26). Since n = 4 and m = 2, we have to select two variables as independent variables. First, we show that any arbitrary set of variables cannot be chosen as independent variables since the remaining (dependent) variables have to satisfy the condition of Eq. (2.27).

In terms of the notation of our equations, let us take the independent variables as

Then the Jacobian of Eq. (2.27) becomes

and hence the necessary conditions of Eq. (2.26) cannot be applied.

Next, let us take the independent variables as x ₃ = y ₂ and x ₄ = y ₄ so that x ₁ = y ₁ and x ₂ = y ₃. Then the Jacobian of Eq. (2.27) becomes

and hence the necessary conditions of Eq. (2.26) can be applied. Equations (2.26) give for k = m + 1 = 3

and for k = m + 2 = n = 4,

Equations (E4) and (E5) give the necessary conditions for the minimum or the maximum of f as

(E6)

When Eqs. (E6) are substituted, Eqs. (E2) and (E3) take the form

from which the desired optimum solution can be obtained as

Sufficiency Conditions for a General Problem

By eliminating the first m variables, using the m equality constraints (this is possible, at least in theory), the objective function f can be made to depend only on the remaining variables, x _m+1, x _m+2, …, x _n. Then the Taylor's series expansion of f, in terms of these variables, about the extreme point X* gives

(2.28)

where (∂f/∂x _i)_g is used to denote the partial derivative of f with respect to x _i (holding all the other variables x _m+1, x _m+2, …, x _i−1, x _i+1, x _i+2, …, x _n constant) when x ₁, x ₂, …, x _m are allowed to change so that the constraints g _j (X* + d X) = 0, j = 1, 2, …, m, are satisfied; the second derivative, (∂ ² f/∂x _i ∂x _j)_g, is used to denote a similar meaning.

As an example, consider the problem of minimizing

subject to the only constraint

Since n = 3 and m = 1 in this problem, one can think of any of the m variables, say x ₁, to be dependent and the remaining n − m variables, namely x ₂ and x ₃, to be independent. Here the constrained partial derivative (∂f/∂x ₂)_g, for example, means the rate of change of f with respect to x ₂ (holding the other independent variable x ₃ constant) and at the same time allowing x ₁ to change about X* so as to satisfy the constraint g ₁ (X) = 0. In the present case, this means that dx ₁ has to be chosen to satisfy the relation

that is,

since g ₁ (X*) = 0 at the optimum point and dx ₃ = 0 (x ₃ is held constant).

Notice that (∂f/∂x _i)_g has to be zero for i = m + 1, m + 2, …, n since the dx _i appearing in Eq. (2.28) are all independent. Thus, the necessary conditions for the existence of constrained optimum at X* can also be expressed as

(2.29)

Of course, with little manipulation, one can show that Eqs. (2.29) are nothing but Eq. (2.26). Further, as in the case of optimization of a multivariable function with no constraints, one can see that a sufficient condition for X* to be a constrained relative minimum (maximum) is that the quadratic form Q defined by

(2.30)

is positive (negative) for all nonvanishing variations dx _i. As in Theorem 2.4, the matrix

has to be positive (negative) definite to have Q positive (negative) for all choices of dx _i. It is evident that computation of the constrained derivatives (∂ ² f/∂x _i ∂x _j)_g is a difficult task and may be prohibitive for problems with more than three constraints. Thus, the method of constrained variation, although it appears to be simple in theory, is very difficult to apply since the necessary conditions themselves involve evaluation of determinants of order m + 1. This is the reason that the method of Lagrange multipliers, discussed in the following section, is more commonly used to solve a multivariable optimization problem with equality constraints.

2.4.3 Solution By the Method of Lagrange Multipliers

The basic features of the Lagrange multiplier method is given initially for a simple problem of two variables with one constraint. The extension of the method to a general problem of n variables with m constraints is given later.

Problem with Two Variables and One Constraint

Consider the problem

(2.31)

subject to

For this problem, the necessary condition for the existence of an extreme point at X = X* was found in Section 2.4.2 to be

(2.32)

By defining a quantity λ, called the Lagrange multiplier, as

(2.33)

Equation (2.32) can be expressed as

(2.34)

and Eq. (2.33) can be written as

(2.35)

In addition, the constraint equation has to be satisfied at the extreme point, that is,

(2.36)

Thus Eqs. (2.34)–(2.36) represent the necessary conditions for the point to be an extreme point.

Notice that the partial derivative has to be nonzero to be able to define λ by Eq. (2.33). This is because the variation dx ₂ was expressed in terms of dx ₁ in the derivation of Eq. (2.32) (see Eq. (2.23)). On the other hand, if we choose to express dx ₁ in terms of dx ₂, we would have obtained the requirement that be nonzero to define λ. Thus, the derivation of the necessary conditions by the method of Lagrange multipliers requires that at least one of the partial derivatives of g (x ₁, x ₂) be nonzero at an extreme point.

The necessary conditions given by Eqs. (2.34)–(2.36) are more commonly generated by constructing a function L, known as the Lagrange function, as

(2.37)

By treating L as a function of the three variables x ₁, x ₂, and λ, the necessary conditions for its extremum are given by

(2.38)

Equations (2.38) can be seen to be same as Eqs. (2.34)–(2.36). The sufficiency conditions are given later.

Necessary Conditions for a General Problem

The equations derived above can be extended to the case of a general problem with n variables and m equality constraints:

(2.39)

subject to

The Lagrange function, L, in this case is defined by introducing one Lagrange multiplier λ_j for each constraint g _j(X) as

(2.40)

By treating L as a function of the n + m unknowns, x ₁, x ₂, …, x _n, λ₁, λ₂, …, λ_m, the necessary conditions for the extremum of L, which also correspond to the solution of the original problem stated in Eq. (2.39), are given by

(2.41)

(2.42)

Equations (2.41) and (2.42) represent n + m equations in terms of the n + m unknowns, x _i and λ_j. The solution of Eqs. (2.41) and (2.42) gives

The vector X* corresponds to the relative constrained minimum of f (X) (sufficient conditions are to be verified) while the vector λ* provides the sensitivity information, as discussed in the next subsection.

Sufficiency Conditions for a General Problem

A sufficient condition for f (X) to have a constrained relative minimum at X* is given by the following theorem.

Proof

The proof is similar to that of Theorem 2.4.

Notes

If

is negative for all choices of the admissible variations dx _i, X* will be a constrained maximum of f (X).
It has been shown by Hancock [1] that a necessary condition for the quadratic form Q, defined by Eq. (2.43), to be positive (negative) definite for all admissible variations d X is that each root of the polynomial z _i, defined by the following determinantal equation, be positive (negative):
(2.44)

where

(2.45)

(2.46)
Equation (2.44), on expansion, leads to an (n − m)th‐order polynomial in z. If some of the roots of this polynomial are positive while the others are negative, the point X* is not an extreme point.

The application of the necessary and sufficient conditions in the Lagrange multiplier method is illustrated with the help of the following example.

Example 2.10

Find the dimensions of a cylindrical tin (with top and bottom) made up of sheet metal to maximize its volume such that the total surface area is equal to A ₀ = 24π.

SOLUTION

If x ₁ and x ₂ denote the radius of the base and length of the tin, respectively, the problem can be stated as

subject to

The Lagrange function is

and the necessary conditions for the maximum of f give

(E1)

(E2)

(E3)

Equations (E1) and (E2) lead to

that is,

(E4)

and Eqs. (E3) and (E4) give the desired solution as

This gives the maximum value of f as

If A ₀ = 24π, the optimum solution becomes

To see that this solution really corresponds to the maximum of f, we apply the sufficiency condition of Eq. (2.44). In this case

Thus Eq. (2.44) becomes

that is,

This gives

Since the value of z is negative, the point corresponds to the maximum of f.

Interpretation of the Lagrange Multipliers

To find the physical meaning of the Lagrange multipliers, consider the following optimization problem involving only a single equality constraint:

(2.47)

subject to

(2.48)

where b is a constant. The necessary conditions to be satisfied for the solution of the problem are

(2.49)

(2.50)

Let the solution of Eqs. (2.49) and (2.50) be given by X*, λ*, and f* = f (X*). Suppose that we want to find the effect of a small relaxation or tightening of the constraint on the optimum value of the objective function (i.e. we want to find the effect of a small change in b on f*). For this we differentiate Eq. (2.48) to obtain

(2.51)

Equation (2.49) can be rewritten as

(2.52)

(2.53)

Substituting Eq. (2.53) into Eq. (2.51), we obtain

(2.54)

since

(2.55)

Equation (2.54) gives

(2.56)

(2.57)

Thus λ* denotes the sensitivity (or rate of change) of f with respect to b or the marginal or incremental change in f* with respect to b at x*. In other words, λ* indicates how tightly the constraint is binding at the optimum point. Depending on the value of λ* (positive, negative, or zero), the following physical meaning can be attributed to λ*:

λ* > 0. In this case, a unit decrease in b is positively valued since one gets a smaller minimum value of the objective function f. In fact, the decrease in f* will be exactly equal to λ* since df = λ* (−1) = −λ* < 0. Hence λ* may be interpreted as the marginal gain (further reduction) in f* due to the tightening of the constraint. On the other hand, if b is increased by 1 unit, f will also increase to a new optimum level, with the amount of increase in f* being determined by the magnitude of λ* since df = λ*(+1) > 0. In this case, λ* may be thought of as the marginal cost (increase) in f* due to the relaxation of the constraint.
λ* < 0. Here a unit increase in b is positively valued. This means that it decreases the optimum value of f. In this case the marginal gain (reduction) in f* due to a relaxation of the constraint by 1 unit is determined by the value of λ* as df* = λ*(+1) < 0. If b is decreased by 1 unit, the marginal cost (increase) in f* by the tightening of the constraint is df* = λ*(−1) > 0 since, in this case, the minimum value of the objective function increases.
λ* = 0. In this case, any incremental change in b has absolutely no effect on the optimum value of f and hence the constraint will not be binding. This means that the optimization of f subject to g = 0 leads to the same optimum point X* as with the unconstrained optimization of f

In economics and operations research, Lagrange multipliers are known as shadow prices of the constraints since they indicate the changes in optimal value of the objective function per unit change in the right‐hand side of the equality constraints.

Example 2.11

Find the maximum of the function f (X) = 2x ₁ + x ₂ + 10 subject to using the Lagrange multiplier method. Also find the effect of changing the right‐hand side of the constraint on the optimum value of f.

SOLUTION

The Lagrange function is given by

(E1)

The necessary conditions for the solution of the problem are

(E2)

The solution of Eq. (E2) is

(E3)

The application of the sufficiency condition of Eq. (2.44) yields

Hence X* will be a maximum of f with f* = f (X*) = 16.07.

One procedure for finding the effect on f* of changes in the value of b (right‐hand side of the constraint) would be to solve the problem all over with the new value of b. Another procedure would involve the use of the value of λ*. When the original constraint is tightened by 1 unit (i.e. db = −1), Eq. (2.57) gives

Thus, the new value of f* is f* + df* = 14.07. On the other hand, if we relax the original constraint by 2 units (i.e. db = 2), we obtain

and hence the new value of f* is f* + df* = 20.07.

2.5 Multivariable Optimization with Inequality Constraints

This section is concerned with the solution of the following problem:

subject to

(2.58)

The inequality constraints in Eq. (2.58) can be transformed to equality constraints by adding nonnegative slack variables, , as

(2.59)

where the values of the slack variables are yet unknown. The problem now becomes

subject to

(2.60)

where Y = {y ₁, y ₂, …, y _m}^T is the vector of slack variables.

This problem can be solved conveniently by the method of Lagrange multipliers. For this, we construct the Lagrange function L as

(2.61)

where λ = {λ₁, λ₂, …, λ_m}^T is the vector of Lagrange multipliers. The stationary points of the Lagrange function can be found by solving the following equations (necessary conditions):

(2.62)

(2.63)

(2.64)

It can be seen that Eqs. (2.62)–(2.64) represent (n + 2m) equations in the (n + 2m) unknowns, X, λ, and Y. The solution of Eqs. (2.62)–(2.64) thus gives the optimum solution vector, X*; the Lagrange multiplier vector, λ*; and the slack variable vector, Y*.

Equations (2.63) ensure that the constraints g _j(X) ≤ 0, j = 1, 2, …, m, are satisfied, while Eq. (2.64) imply that either λ_j = 0 or y _j = 0. If λ_j = 0, it means that the jth constraint is inactive1 and hence can be ignored. On the other hand, if y _j = 0, it means that the constraint is active (g _j = 0) at the optimum point. Consider the division of the constraints into two subsets, J ₁ and J ₂, where J ₁ + J ₂ represent the total set of constraints. Let the set J ₁ indicate the indices of those constraints that are active at the optimum point and J ₂ include the indices of all the inactive constraints.

Thus, for j ∈ J ₁,2 y _j = 0 (constraints are active), for j ∈ J ₂, λ_j = 0 (constraints are inactive), and Eq. (2.62) can be simplified as

(2.65)

Similarly, Eq. (2.63) can be written as

(2.66)

(2.67)

Equations (2.65)–(2.67) represent n + p + (m − p) = n + m equations in the n + m unknowns x _i (i = 1, 2, …, n), λ_j(j ∈ J ₁), and y _j(j ∈ J ₂), where p denotes the number of active constraints.

Assuming that the first p constraints are active, Eq. (2.65) can be expressed as

(2.68)

These equations can be written collectively as

(2.69)

where ∇f and ∇g _j are the gradients of the objective function and the jth constraint, respectively:

Equation (2.69) indicates that the negative of the gradient of the objective function can be expressed as a linear combination of the gradients of the active constraints at the optimum point.

Further, we can show that in the case of a minimization problem, the λ_j values (j ∈ J ₁) have to be positive. For simplicity of illustration, suppose that only two constraints are active (p = 2) at the optimum point. Then Eq. (2.69) reduces to

(2.70)

Let S be a feasible direction3 at the optimum point. By premultiplying both sides of Eq. (2.70) by S ^T, we obtain

(2.71)

where the superscript T denotes the transpose. Since S is a feasible direction, it should satisfy the relations

(2.72)

c02f008 — Figure 2.8 Feasible direction S.

Thus, if λ₁ > 0 and λ₂ > 0, the quantity S ^T ∇ f can be seen always to be positive. As ∇f indicates the gradient direction, along which the value of the function increases at the maximum rate,4 S ^T ∇ f represents the component of the increment of f along the direction S. If S ^T ∇ f > 0, the function value increases as we move along the direction S. Hence, if λ₁ and λ₂ are positive, we will not be able to find any direction in the feasible domain along which the function value can be decreased further. Since the point at which Eq. (2.72) is valid is assumed to be optimum, λ₁ and λ₂ have to be positive. This reasoning can be extended to cases where there are more than two constraints active. By proceeding in a similar manner, one can show that the λ_j values have to be negative for a maximization problem.

Example 2.12

Consider the following optimization problem:

Derive the conditions to be satisfied at the point X ₁ = {1, 7}^T by the search direction S = {s ₁, s ₂}^T if it is a (a) usable direction, and (b) feasible direction.

Solution

The objective function and the constraints can be stated as

At the given point X ₁ = {1, 7}^T, all the constraints can be seen to be satisfied with g ₁ and g ₂ being active. The gradients of the objective and active constraint functions at point X ₁ = {1, 7}^T are given by

For the search direction S = {s ₁, s ₂}^T, the usability and feasibility conditions can be expressed as

(a) Usability condition:
(E1)
(b) Feasibility conditions:
(E2)

(E3)

Note: Any two numbers for s ₁ and s ₂ that satisfy the inequality (E1) will constitute a usable direction S. For example, s ₁ = 1 and s ₂ = −1 gives the usable direction S = {1, −1}^T. This direction can also be seen to be a feasible direction because it satisfies the inequalities (E2) and (E3).

2.5.1 Kuhn–Tucker Conditions

As shown above, the conditions to be satisfied at a constrained minimum point, X*, of the problem stated in Eq. (2.58) can be expressed as

(2.73)

(2.74)

These are called Kuhn–Tucker conditions after the mathematicians who derived them as the necessary conditions to be satisfied at a relative minimum of f (X) [11]. These conditions are, in general, not sufficient to ensure a relative minimum. However, there is a class of problems, called convex programming problems,5 for which the Kuhn–Tucker conditions are necessary and sufficient for a global minimum.

If the set of active constraints is not known, the Kuhn–Tucker conditions can be stated as follows:

(2.75)

Note6 that if the problem is one of maximization or if the constraints are of the type g _j ≥ 0, the λ_j have to be nonpositive in Eq. (2.75). On the other hand, if the problem is one of maximization with constraints in the form g _j ≥ 0, the λ_j have to be nonnegative in Eq. (2.75).

2.5.2 Constraint Qualification

When the optimization problem is stated as

subject to

(2.76)

the Kuhn–Tucker conditions become

(2.77)

where λ_j and β _k denote the Lagrange multipliers associated with the constraints g _j ≤ 0 and h _k = 0, respectively. Although we found qualitatively that the Kuhn–Tucker conditions represent the necessary conditions of optimality, the following theorem gives the precise conditions of optimality.

Example 2.13

Consider the following problem:

(E1)

subject to

(E2)

(E3)

Determine whether the constraint qualification and the Kuhn–Tucker conditions are satisfied at the optimum point.

SOLUTION

The feasible region and the contours of the objective function are shown in Figure 2.9. It can be seen that the optimum solution is (0, 0). Since g ₁ and g ₂ are both active at the optimum point (0, 0), their gradients can be computed as

c02f009 — Figure 2.9 Feasible region and contours of the objective function.

It is clear that ∇g ₁(X*) and ∇g ₂(X*) are not linearly independent. Hence the constraint qualification is not satisfied at the optimum point. Noting that

the Kuhn–Tucker conditions can be written, using Eqs. (2.73) and (2.74), as

(E4)

(E5)

(E6)

(E7)

Since Eq. (E4) is not satisfied and Eq. (E5) can be satisfied for negative values of λ₁ = λ₂ also, the Kuhn–Tucker conditions are not satisfied at the optimum point.

Example 2.14

A manufacturing firm producing small refrigerators has entered into a contract to supply 50 refrigerators at the end of the first month, 50 at the end of the second month, and 50 at the end of the third. The cost of producing x refrigerators in any month is given by $(x ² + 1000). The firm can produce more refrigerators in any month and carry them to a subsequent month. However, it costs $20 per unit for any refrigerator carried over from one month to the next. Assuming that there is no initial inventory, determine the number of refrigerators to be produced in each month to minimize the total cost.

SOLUTION

Let x ₁, x ₂, and x ₃ represent the number of refrigerators produced in the first, second, and third month, respectively. The total cost to be minimized is given by

The constraints can be stated as

The Kuhn–Tucker conditions are given by

that is,

(E1)

(E2)

(E3)

that is,

(E4)

(E5)

(E6)

that is,

(E7)

(E8)

(E9)

that is,

(E10)

(E11)

(E12)

The solution of Eqs. (E1)–(E12) can be found in several ways. We proceed to solve these equations by first nothing that either λ₁ = 0 or x ₁ = 50 according to Eq. (E4). Using this information, we investigate the following cases to identify the optimum solution of the problem.

Case 1: λ₁ = 0.

Equations (E1)–(E3) give
(E13)

Substituting Eq. (E13) in Eqs. (E5) and (E6), we obtain

(E14)

The four possible solutions of Eq. (E14) are
1. . These equations, along with Eq. (E13), yield the solution
  
  This solution satisfies Eqs. (E10)–(E12) but violates Eqs. (E7) and (E8) and hence cannot be optimum.
2. λ₃ = 0, −130 − λ₂ − λ₃ = 0. The solution of these equations leads to
  
  This solution can be seen to satisfy Eqs. (E10)–(E12) but violate Eqs. (E7) and (E9).
3. λ₂ = 0, λ₃ = 0. Equations (E13) give
  
  This solution satisfies Eqs. (E10)–(E12) but violates the constraints, Eqs. (E7)–(E9).
4. −130 − λ₂ − λ₃ = 0, −180 − λ₂ − = 0. The solution of these equations and Eq. (E13) yields
  
  This solution satisfies Eqs. (E10)–(E12) but violates the constraint, Eq. (E7).

Case 2: x ₁ = 50.

In this case, Eqs. (E1)–(E3) give
(E15)

Substitution of Eq. (E15) in Eqs. (E5) and (E6) leads to

(E16)

Once again, it can be seen that there are four possible solutions to Eq. (E16), as indicated below:
1. −20 − 2x ₂ + 2x ₃ = 0, x ₁ + x ₂ + x ₃ − 150 = 0: The solution of these equations yields
  
  This solution can be seen to violate Eq. (E8).
2. −20 − 2x ₂ + 2x ₃ = 0, −2x ₃ = 0: These equations lead to the solution
  
  This solution can be seen to violate Eqs. (E8) and (E9).
3. x ₁ + x ₂ − 100 = 0, −2x ₃ = 0: These equations give
  
  This solution violates the constraint Eq. (E9).
4. x ₁ + x ₂ − 100 = 0, x ₁ + x ₂ + x ₃ − 150 = 0: The solution of these equations yields
  
  This solution can be seen to satisfy all the constraint Eqs. (E7)–(E9). The values of λ₁, λ₂, and λ₃ corresponding to this solution can be obtained from Eq. (E15) as
  
  Since these values of λ_i satisfy the requirements (Eqs. (E10)–(E12)), this solution can be identified as the optimum solution. Thus

2.6 Convex Programming Problem

The optimization problem stated in Eq. (2.58) is called a convex programming problem if the objective function f (X) and the constraint functions g _j(X) are convex. The definition and properties of a convex function are given in Appendix A. Suppose that f (X) and g _j (X), j = 1, 2, …, m, are convex functions. The Lagrange function of Eq. (2.61) can be written as

(2.78)

If λ_j ≥ 0, then λ_j g _j (X) is convex, and since λ_j y _j = 0 from Eq. (2.64), L (X, Y, λ) will be a convex function. As shown earlier, a necessary condition for f (X) to be a relative minimum at X* is that L (X, Y, λ) have a stationary point at X*. However, if L (X, Y, λ) is a convex function, its derivative vanishes only at one point, which must be an absolute minimum of the function f (X). Thus, the Kuhn–Tucker conditions are both necessary and sufficient for an absolute minimum of f (X) at X*.

Notes:

If the given optimization problem is known to be a convex programming problem, there will be no relative minima or saddle points, and hence the extreme point found by applying the Kuhn–Tucker conditions is guaranteed to be an absolute minimum of f (X). However, it is often very difficult to ascertain whether the objective and constraint functions involved in a practical engineering problem are convex.
The derivation of the Kuhn–Tucker conditions was based on the development given for equality constraints in Section 2.4. One of the requirements for these conditions was that at least one of the Jacobians composed of the m constraints and m of the n + m variables (x ₁, x ₂, …, x _n; y ₁, y ₂, …, y _m) be nonzero. This requirement is implied in the derivation of the Kuhn–Tucker conditions.

References and Bibliography

1 Hancock, H. (1960). Theory of Maxima and Minima. Dover, NY: Dover Publications.
2 Levenson, M.E. (1967). Maxima and Minima. New York: Macmillan.
3 Thomas, G.B. Jr. (1967). Calculus and Analytic Geometry. Reading, MA: Addison‐Wesley.
4 Richmond, A.E. (1972). Calculus for Electronics. New York: McGraw‐Hill.
5 Howell, J.R. and Buckius, R.O. (1992). Fundamentals of Engineering Thermodynamics, 2e. New York: McGraw‐Hill.
6 Kolman, B. and Trench, W.F. (1971). Elementary Multivariable Calculus. New York: Academic Press.
7 Beveridge, G.S.G. and Schechter, R.S. (1970). Optimization: Theory and Practice. New York: McGraw‐Hill.
8 Gue, R. and Thomas, M.E. (1968). Mathematical Methods of Operations Research. New York: Macmillan.
9 Ayres, F. Jr. (1962). Theory and Problems of Matrices, Schaum's Outline Series. New York: Schaum.
10 Panik, M.J. (1976). Classical Optimization: Foundations and Extensions. North‐Holland, Amsterdam.
11 Kuhn, H.W. and Tucker, A. (1951). Nonlinear programming. In: Proceedings of the 2nd Berkeley Symposium on Mathematical Statistics and Probability. Berkeley: University of California Press.
12 Bazaraa, M.S. and Shetty, C.M. (1979). Nonlinear Programming: Theory and Algorithms. New York: Wiley.
13 Simmons, D.M. (1975). Nonlinear Programming for Operations Research. Engle‐wood Cliffs, NJ: Prentice Hall.

Review Questions

1 1 State the necessary and sufficient conditions for the minimum of a function f (x).
2 2 Under what circumstances can the condition df (x)/dx = 0 not be used to find the minimum of the function f (x)?
3 3 Define the rth differential, d ^r f (X), of a multivariable function f (X).
4 4 Write the Taylor's series expansion of a function f (X).
5 5 State the necessary and sufficient conditions for the maximum of a multivariable function f (X).
6 6 What is a quadratic form?
7 7 How do you test the positive, negative, or indefiniteness of a square matrix [A]?
8 8 Define a saddle point and indicate its significance.
9 9 State the various methods available for solving a multivariable optimization problem with equality constraints.
10 10 State the principle behind the method of constrained variation.
11 11 What is the Lagrange multiplier method?
12 12 What is the significance of Lagrange multipliers?
13 13 Convert an inequality constrained problem into an equivalent unconstrained problem.
14 14 State the Kuhn–Tucker conditions.
15 15 What is an active constraint?
16 16 Define a usable feasible direction.
17 17 What is a convex programming problem? What is its significance?
18 Answer whether each of the following quadratic forms is positive definite, negative definite, or neither:
1. (18) (18)
2. (18) (18) f = 4x ₁ x ₂
3. (18) (18)
4. (18) (18)
5. (18) (18)
19 State whether each of the following functions is convex, concave, or neither:
1. (19) (19) f = − 2x ² + 8x + 4
2. (19) (19) f = x ² + 10x + 1
3. (19) (19)
4. (19) (19)
5. (19) (19) f = e ^−x, x > 0
6. (19) (19)
7. (19) (19) f = x ₁ x ₂
8. (19) (19) f = (x ₁ − 1)² + 10(x ₂ − 2)²

20 20 Match the following equations and their characteristics:

(a)	f = 4x ₁ − 3x ₂ + 2	Relative maximum at (1, 2)
(b)	f = (2x ₁ − 2)² + (x ₂ − 2)²	Saddle point at origin
(c)	f = − (x ₁ − 1)² − (x ₂ − 2)²	No minimum
(d)	f = x ₁ x ₂	Inflection point at origin
(e)	f = x ³	Relative minimum at (1, 2)

Problems

2.1 A dc generator has an internal resistance R ohms and develops an open‐circuit voltage of V volts (Figure 2.10). Find the value of the load resistance r for which the power delivered by the generator will be a maximum.

Figure 2.10 Electric generator with load.
2.2 Find the maxima and minima, if any, of the function
2.3 Find the maxima and minima, if any, of the function
2.4 The efficiency of a screw jack is given by

where α is the lead angle and ϕ is a constant. Prove that the efficiency of the screw jack will be maximum when α = 45° − ϕ/2 with η _max = (1 − sin ϕ)/(1 + sin ϕ).
2.5 Find the minimum of the function
2.6 Find the angular orientation of a cannon to maximize the range of the projectile.
2.7 In a submarine telegraph cable the speed of signaling varies as x ² log (1/x), where x is the ratio of the radius of the core to that of the covering. Show that the greatest speed is attained when this ratio is .
2.8 The horsepower generated by a Pelton wheel is proportional to u (V − u), where u is the velocity of the wheel, which is variable, and V is the velocity of the jet, which is fixed. Show that the efficiency of the Pelton wheel will be maximum when u = V/2.
2.9 A pipe of length l and diameter D has at one end a nozzle of diameter d through which water is discharged from a reservoir. The level of water in the reservoir is maintained at a constant value h above the center of nozzle. Find the diameter of the nozzle so that the kinetic energy of the jet is a maximum. The kinetic energy of the jet can be expressed as

where ρ is the density of water, f the friction coefficient and g the gravitational constant.
2.10 An electric light is placed directly over the center of a circular plot of lawn 100 m in diameter. Assuming that the intensity of light varies directly as the sine of the angle at which it strikes an illuminated surface, and inversely as the square of its distance from the surface, how high should the light be hung in order that the intensity may be as great as possible at the circumference of the plot?
2.11 If a crank is at an angle θ from dead center with θ = ωt, where ω is the angular velocity and t is time, the distance of the piston from the end of its stroke (x) is given by

where r is the length of the crank and l is the length of the connecting rod. For r = 1 and l = 5, find (a) the angular position of the crank at which the piston moves with maximum velocity, and (b) the distance of the piston from the end of its stroke at that instant.

Determine whether each of the matrices in Problems 2.12–2.14 is positive definite, negative definite, or indefinite by finding its eigenvalues.
2.12
2.13
2.14
Determine whether each of the matrices in Problems 2.15–2.17 is positive definite, negative definite, or indefinite by evaluating the signs of its submatrices.
2.15
2.16
2.17
2.18 Express the function

in matrix form as

and determine whether the matrix [A] is positive definite, negative definite, or indefinite.
2.19 Determine whether the following matrix is positive or negative definite:
2.20 Determine whether the following matrix is positive definite:
2.21 The potential energy of the two‐bar truss shown in Figure 2.11 is given by

where E is Young's modulus, A the cross‐sectional area of each member, l the span of the truss, s the length of each member, h the height of the truss, P the applied load, θ the angle at which the load is applied, and x ₁ and x ₂ are, respectively, the horizontal and vertical displacements of the free node. Find the values of x ₁ and x ₂ that minimize the potential energy when E = 207 × 10⁹ Pa, A = 10⁻⁵ m², l = 1.5 m, h = 4.0 m, P = 10⁴ N, and θ = 30°

Figure 2.11 Two‐bar truss.
2.22 The profit per acre of a farm is given by

where x ₁ and x ₂ denote, respectively, the labor cost and the fertilizer cost. Find the values of x ₁ and x ₂ to maximize the profit.
2.23 The temperatures measured at various points inside a heated wall are as follows:

Distance from the heated surface as a percentage of wall thickness, d 0 25 50 75 100

Temperature, t(°C) 380 200 100 20 0

It is decided to approximate this table by a linear equation (graph) of the form t = a + bd, where a and b are constants. Find the values of the constants a and b that minimize the sum of the squares of all differences between the graph values and the tabulated values.
2.24 Find the second‐order Taylor's series approximation of the function

at the points (a) (0,0) and (b) (1,1).
2.25 Find the third‐order Taylor's series approximation of the function

at point (1, 0, −2).
2.26 The volume of sales (f) of a product is found to be a function of the number of newspaper advertisements (x) and the number of minutes of television time (y) as

Each newspaper advertisement or each minute on television costs $1000. How should the firm allocate $48 000 between the two advertising media for maximizing its sales?
2.27 Find the value of x* at which the following function attains its maximum:
2.28 (a) (b) (c) (d) It is possible to establish the nature of stationary points of an objective function based on its quadratic approximation. For this, consider the quadratic approximation of a two‐variable function as

where

If the eigenvalues of the Hessian matrix, [c], are denoted as β ₁ and β ₂, identify the nature of the contours of the objective function and the type of stationary point in each of the following situations.
1. (a) β ₁ = β ₂; both positive
2. (b) β ₁ > β ₂; both positive
3. (c) |β ₁| = |β ₂|; β ₁ and β ₂ have opposite signs
4. (d) β ₁ > 0, β ₂ = 0
Plot the contours of each of the following functions and identify the nature of its stationary point.
2.29 f = 2 − x ² − y ² + 4xy
2.30 f = 2 + x ² − y ²
2.31 f = xy
2.32 f = x ³ − 3xy ²
2.33 Find the admissible and constrained variations at the point X = {0, 4}^T for the following problem:

subject to
2.34 Find the diameter of an open cylindrical can that will have the maximum volume for a given surface area, S.
2.35 A rectangular beam is to be cut from a circular log of radius r. Find the cross‐sectional dimensions of the beam to (a) maximize the cross‐sectional area of the beam, and (b) maximize the perimeter of the beam section.
2.36 Find the dimensions of a straight beam of circular cross section that can be cut from a conical log of height h and base radius r to maximize the volume of the beam.
2.37 The deflection of a rectangular beam is inversely proportional to the width and the cube of depth. Find the cross‐sectional dimensions of a beam, which corresponds to minimum deflection, that can be cut from a cylindrical log of radius r.
2.38 A rectangular box of height a and width b is placed adjacent to a wall (Figure 2.12). Find the length of the shortest ladder that can be made to lean against the wall.

Figure 2.12 Ladder against a wall.
2.39 Show that the right circular cylinder of given surface (including the ends) and maximum volume is such that its height is equal to the diameter of the base.
2.40 Find the dimensions of a closed cylindrical soft drink can that can hold soft drink of volume V for which the surface area (including the top and bottom) is a minimum.
2.41 An open rectangular box is to be manufactured from a given amount of sheet metal (area S). Find the dimensions of the box to maximize the volume.
2.42 Find the dimensions of an open rectangular box of volume V for which the amount of material required for manufacture (surface area) is a minimum.
2.43 A rectangular sheet of metal with sides a and b has four equal square portions (of side d) removed at the corners, and the sides are then turned up in order to form an open rectangular box. Find the depth of the box that maximizes the volume.
2.44 Show that the cone of the greatest volume that can be inscribed in a given sphere has an altitude equal to two‐thirds of the diameter of the sphere. Also prove that the curved surface of the cone is a maximum for the same value of the altitude.
2.45 Prove Theorem 2.6.
2.46 A log of length l is in the form of a frustum of a cone whose ends have radii a and b(a > b). It is required to cut from it a beam of uniform square section. Prove that the beam of greatest volume that can be cut has a length of al/[3(a − b)].
2.47 It has been decided to leave a margin of 30 mm at the top and 20 mm each at the left side, right side, and the bottom on the printed page of a book. If the area of the page is specified as 5 × 10⁴ mm², determine the dimensions of a page that provide the largest printed area.
2.48

subject to

by (a) direct substitution, (b) constrained variation, and (c) Lagrange multiplier method.
2.49

subject to

by (a) direct substitution, (b) constrained variation, and (c) Lagrange multiplier method.
2.50 Find the values of x, y, and z that maximize the function

when x, y, and z are restricted by the relation xyz = 16.
2.51 A tent on a square base of side 2a consists of four vertical sides of height b surmounted by a regular pyramid of height h. If the volume enclosed by the tent is V, show that the area of canvas in the tent can be expressed as

Also, show that the least area of the canvas corresponding to a given volume V, if a and h can both vary, is given by
2.52 A department store plans to construct a one‐story building with a rectangular planform. The building is required to have a floor area of 22 500 ft² and a height of 18 ft. It is proposed to use brick walls on three sides and a glass wall on the fourth side. Find the dimensions of the building to minimize the cost of construction of the walls and the roof, assuming that the glass wall costs twice as much as that of the brick wall and the roof costs three times as much as that of the brick wall per unit area.
2.53 Find the dimensions of the rectangular building described in Problem 2.52 to minimize the heat loss, assuming that the relative heat losses per unit surface area for the roof, brick wall, glass wall, and floor are in the proportion 4 : 2 : 5 : 1.
2.54 A funnel, in the form of a right circular cone, is to be constructed from a sheet metal. Find the dimensions of the funnel for minimum lateral surface area when the volume of the funnel is specified as 200 in.³
2.55 Find the effect on f* when the value of A ₀ is changed to (a) 25π and (b) 22π in Example 2.10 using the property of the Lagrange multiplier.
56
1. (56) (56) Find the dimensions of a rectangular box of volume V = 1000 in.³ for which the total length of the 12 edges is a minimum using the Lagrange multiplier method.
2. (56) (56) Find the change in the dimensions of the box when the volume is changed to 1200 in.³ by using the value of λ* found in part (a).
3. (56) (56) Compare the solution found in part (b) with the exact solution.
2.57 Find the effect on f* of changing the constraint to (a) x + x ₂ + 2x ₃ = 4 and (b) x + x ₂ + 2x ₃ = 2 in Problem 2.48. Use the physical meaning of Lagrange multiplier in finding the solution.
2.58 A real estate company wants to construct a multistory apartment building on a 500 × 500‐ft lot. It has been decided to have a total floor space of 8 × 10⁵ ft². The height of each story is required to be 12 ft, the maximum height of the building is to be restricted to 75 ft, and the parking area is required to be at least 10% of the total floor area according to the city zoning rules. If the cost of the building is estimated at $(500, 000h + 2000F + 500P), where h is the height in feet, F is the floor area in square feet, and P is the parking area in square feet. Find the minimum cost design of the building.
2.59 The Brinell hardness test is used to measure the indentation hardness of materials. It involves penetration of an indenter, in the form of a ball of diameter D (mm), under a load P (kg_f), as shown in Figure 2.13a. The Brinell hardness number (BHN) is defined as
(2.79)

where A (in mm²) is the spherical surface area and d (in mm) is the diameter of the crater or indentation formed. The diameter d and the depth h of indentation are related by (Figure 2.13b)

(2.80)

Figure 2.13 Brinell hardness test.

It is desired to find the size of indentation, in terms of the values of d and h, when a tungsten carbide ball indenter of diameter 10 mm is used under a load of P = 3000 kg_f on a stainless steel test specimen of BHN 1250. Find the values of d and h by formulating and solving the problem as an unconstrained minimization problem.

Hint: Consider the objective function as the sum of squares of the equations implied by Eqs. (2.79) and (2.80).
2.60 A manufacturer produces small refrigerators at a cost of $60 per unit and sells them to a retailer in a lot consisting of a minimum of 100 units. The selling price is set at $80 per unit if the retailer buys 100 units at a time. If the retailer buys more than 100 units at a time, the manufacturer agrees to reduce the price of all refrigerators by 10 cents for each unit bought over 100 units. Determine the number of units to be sold to the retailer to maximize the profit of the manufacturer.
2.61 Consider the following problem:

subject to

Using Kuhn–Tucker conditions, find which of the following vectors are local minima:
2.62 Using Kuhn–Tucker conditions, find the value(s) of β for which the point will be optimal to the problem:

subject to

Verify your result using a graphical procedure.
63 Consider the following optimization problem:

subject to
1. (63) (63) Find whether the design vector X = {1, 1}^T satisfies the Kuhn–Tucker conditions for a constrained optimum.
2. (63) (63) What are the values of the Lagrange multipliers at the given design vector?
2.64 Consider the following problem:

subject to

Determine whether the Kuhn–Tucker conditions are satisfied at the following points:
2.65 Find a usable and feasible direction S at (a) X ₁ = {−1, 5}^T and (b) X ₂ = {2, 3}^T for the following problem:

subject to
2.66 Consider the following problem:

subject to

Determine whether the following search direction is usable, feasible, or both at the design vector :
2.67 Consider the following problem:

subject to

Determine whether the following vector represents an optimum solution:
2.68

subject to the constraints

using Kuhn–Tucker conditions.
2.69

subject to

by (a) the graphical method and (b) Kuhn–Tucker conditions.
2.70

subject to

by applying Kuhn–Tucker conditions.
2.71 Consider the following problem:

subject to

Determine whether the constraint qualification and Kuhn–Tucker conditions are satisfied at the optimum point.
2.72 Consider the following problem:

subject to

Determine whether the constraint qualification and the Kuhn–Tucker conditions are satisfied at the optimum point.
2.73 Verify whether the following problem is convex:

subject to
74 Check the convexity of the following problems.
1. (74) (74)
  
  subject to
2. (74) (74)
  
  subject to
2.75 Identify the optimum point among the given design vectors, X ₁, X ₂, and X ₃, by applying the Kuhn–Tucker conditions to the following problem:

subject to
2.76 Consider the following optimization problem:

subject to

Find a usable feasible direction at each of the following design vectors:

Notes

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Distance from the heated surface as a percentage of wall thickness, d	0	25	50	75	100
Temperature, t(°C)	380	200	100	20	0

Table of Contents for 2 Classical Optimization Techniques

Create new playlist

Sign In

Sign Up

2.1 Introduction

2.2 Single‐Variable Optimization

2.3 Multivariable Optimization with no Constraints

2.3.1 Definition: rth Differential of f

2.3.2 Semidefinite Case

2.3.3 Saddle Point

2.4 Multivariable Optimization with Equality Constraints

2.4.1 Solution by Direct Substitution

2.4.2 Solution by the Method of Constrained Variation

Necessary Conditions for a General Problem

Sufficiency Conditions for a General Problem

2.4.3 Solution By the Method of Lagrange Multipliers

Problem with Two Variables and One Constraint

Necessary Conditions for a General Problem

Sufficiency Conditions for a General Problem

Interpretation of the Lagrange Multipliers

2.5 Multivariable Optimization with Inequality Constraints

2.5.1 Kuhn–Tucker Conditions

2.5.2 Constraint Qualification

2.6 Convex Programming Problem

Notes:

References and Bibliography

Review Questions

Problems

Notes

Table of Contents for
2 Classical Optimization Techniques