Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

13
Simplex‐Based Methods of Optimization

13.1 Introduction

In this chapter the standard simplex method, or a slight modification thereof, will be used to solve an assortment of specialized linear as well as essentially or structurally nonlinear decision problems. In this latter instance, a set of transformations and/or optimality conditions will be introduced that linearizes the problem so that the simplex method is only indirectly applicable. An example of the first type of problem is game theory, while the second set of problems includes quadratic and fractional functional programming.

13.2 Quadratic Programming

The quadratic programming problem under consideration has the general form

(13.1)

Here, the objective function is the sum of a linear form and a quadratic form and is to be optimized in the presence of a set of linear inequalities. Additionally, both C and X are of order (p × 1), Q is a (p × p) symmetric coefficient matrix, A is the (m × p) coefficient matrix of the linear structural constraint system, and b (assumed ≥O) is the (m × 1) requirements vector. If Q is not symmetric, then it can be transformed, by a suitable redefinition of coefficients, into a symmetric matrix without changing the value of X′QX. For instance, if a matrix A from X′AX is not symmetric, then it can be replaced by a symmetric matrix B if for all i, j or B = (A + A′)/2. Thus, b_ij + b_ji = 2b_ij = a_ij + a_ji is the coefficient of x_ix_j, i ≠ j, in

So if

then, under this transformation,

In this regard, for the quadratic function the coefficient matrix may simply be assumed symmetric and written

or if

the associated quadratic function may be expressed as For additional details on quadratic forms, see Appendix 13.A.

We mentioned in earlier chapters that an optimal solution to a linear programming problem always occurs at a vertex (extreme point) of the feasible region K = {X|AX ≤ b, X ≥ O} or at a convex combination of a finite number of extreme points of K (i.e. along an edge of K). This is not the case for a quadratic programming problem. An optimal solution to a quadratic program may occur at an extreme point or along an edge or at an interior point of K (see Figure 13.1a–c, respectively, wherein K and the contours of f are presented).

3 Graphs depicting 3 concentric circles with dot marker at the center indicating constrained maximum of f at extreme point A, constrained maximum of f occurs along the edge BC, and constraints not binding at A. — **Figure 13.1** (a) Constrained maximum of f at extreme point A. (b) Constrained maximum of f occurs along the edge BC. (c) Constraints not binding at A.

To further characterize the attainment of an optimal solution to (13.1), let us note that with K a convex set, any local maximum of f will also be the global maximum if f is concave. Clearly C′X is concave since it is linear, while X′QX is concave if it is negative semi‐definite or negative definite. Thus, f = C′X + X′QX is concave (since the sum of a finite number of concave functions is itself a concave function) on K so that the global maximum of f is attained. In what follows, we shall restrict Q to the negative definite case. Then f may be characterized as strictly concave so that it takes on a unique or strong global maximum on K.¹ If X′QX is negative semi‐definite, then it may be transformed to a negative definite quadratic form by making an infinitesimally small change in the diagonal elements of Q, i.e. X′QX is replaced by X′(Q − εI_p)X = X′QX − εX′I_pX < 0 since X′QX ≤ 0 for any X and −εX′I_pX < 0 for any X ≠ O.

From (13.1) we may form the Lagrangian function

Then from the Karush‐Kuhn‐Tucker necessary and sufficient conditions for a constrained maximum (12.4),

(13.2)

Before proceeding to the development of a solution algorithm for the quadratic programming problem, let us examine an important interpretation of (13.2). Specifically, if X_o solves (13.1), then there exists a vector such that (X_o,) is a saddle point (see Chapter 12 for a review of this concept) of the Lagrangian in which case (13.2) holds. To see this, we need only note that if L(X, U₁) has a local maximum in the X direction at X_o, then it follows from (13.2a, b, e) that

while if L(X_o, U₁) attains a minimum in the U₁ direction at , then, from (13.2d, c, e),

Upon examining (13.2) we see that solving the quadratic programming problem amounts to finding a solution in 2(m + p) nonnegative variables to the linear system

(13.2.1)

that satisfies the m + p (nonlinear) complementary slackness conditions Under this latter set of restrictions, at least p of the 2p x_js, u_2js must be zero while at least m of the 2m u_1is, y_is must equal zero. Thus, at least m + p of the 2(m + p) variables x_j, u_2j, u_1j, y_i must vanish at the optimal solution. In this regard, an optimal solution to (13.2.1) must be a basic feasible solution (Barankin and Dorfman 1958). So, if the (partitioned) vector is a feasible solution to (13.2.1) and the complementary slackness conditions are satisfied (no more than m + p of the mutually complementary variables are positive), then the resulting solution is also a basic feasible solution to (13.2.1) with the X_o component solving the quadratic programming problem.

To actually obtain an optimal basic feasible solution to (13.2.1) we shall use the complementary pivot method presented in Section 13.4. But first we look to dual quadratic programs.

13.3 Dual Quadratic Programs

A well‐known result from duality theory in generalized nonlinear programming that is applicable for our purposes here is that if the Lagrangian for the primal quadratic program (13.1) is written as then the dual quadratic program appears as

(13.3)

In what follows, a few important theorems for quadratic programming problems will be presented. The first two demonstrate the similarity between linear and quadratic programming as far as the relationship of the primal and dual objective functions is concerned. To this end, we state Theorems 13.1 as follows.

Next,

Finally, the principal duality result of this section is presented by

Theorem 13.3

(Dorn 1960; Saaty and Bram 1964). Let the (p × p) symmetric matrix Q be negative semi‐definite. If X_o solves the primal problem (13.1), then there exists a vector such that solves the dual problem (13.3) with Conversely, if solves the dual problem, then there exists a vector such that solves the primal problem with

Proof. Let X_o be an optimal solution to the primal problem. Then from (13.2a, e), is a feasible solution to the dual problem, i.e. If (X, U₁) is any other feasible solution to the dual, then

With Q negative semi‐definite,

From (13.2b, c), so that

Employing (13.4) (with ),

Since minimizes g. And since

Thus, the first half of this theorem is proved.

To verify the converse portion, let be an optional solution to the dual program. Upon forming the Lagrangian of the equivalent dual problem

we have

Then from the first‐order conditions

we obtain the Karush‐Kuhn‐Tucker necessary and sufficient conditions

(13.6)

From (13.6a,e), is a feasible solution to the primal problem. And since is a feasible solution to the dual, (13.6d) implies that satisfies (13.2a, e). Moreover, (13.6b) is (13.2c) while both (13.6c, d) imply (13.2b). Hence (13.2) is satisfied so that is an optimal solution to the primal problem. From (3.6d),

Then using (13.6c) we have so that

by virtue of (13.6b). Q.E.D.

13.4 Complementary Pivot Method

(Lemke 1965)

A computational routine that explicitly requires that a complementarity condition holds at each iteration is Lemke’s complementary pivot method. This technique is designed to generate a solution to the linear complementarity problem (LCP): given a vector q ∈ ℛⁿ and an nth order matrix M, find a vector Z ∈ ℛⁿ such that

(13.7)

If W = q + MZ, then (13.7) becomes: find vectors W, Z ∈ ℛⁿ, such that

(13.8)

Here is called the complementarity condition, where the variables w_i and z_i, called a complementary pair, are said to be complements of each other, i.e. at least one variable in each pair (w_i, z_i,) must be zero.

A nonnegative solution (W, Z) to the linear system (13.8a) is called a feasible solution to the LCP. Moreover, a feasible solution (W, Z) to the LCP that also satisfies the complementarity condition on W′ Z = 0 is termed a complementary solution. (Note that (13.7) and (13.8) contain no objective function; and there is only a single structural constraint in the variables W, Z, namely the complementary restriction W′ Z = 0.)

If q ≥ O, then there exists a trivial complementary basic solution given by W = q, Z = O. Hence (13.8) has a nontrivial solution only when q ≱ O. In this circumstance, the initial basic solution given by W = q, Z = O is infeasible even if the complementarity condition W′ Z = 0 holds.

For the general quadratic program

(13.1)

let us repeat the Karush‐Kuhn‐Tucker conditions for (13.1) as

(13.2)

(13.9)

Let us define

Then (13.9) can be rewritten in condensed form as

(13.10)

As we shall now see, these conditions are solved by a pivoting process. This will be accomplished by introducing an auxiliary (artificial) variable z_o ∈ ℛ and a sum vector 1 ∈ ℛ^{n + m} into (13.10) so as to obtain

(13.11)

for at least n−1 of the i values given z_o ≥ 0.

The algorithm starts with an initial basic feasible solution to (13.8) of the form W = q, Z ≥ O, where q ≱ O. To avoid having at least one w_i < 0, the nonnegative artificial variable z_o will be introduced at a sufficiently positive level (take ) into the left‐hand side of W − MZ = q. Hence our objective is to find vectors W, Z and a variable z_o such that (13.11) holds.

With each new right‐hand side value q_i + z_o nonnegative in (13.11), a basic solution to the same amounts to W + z_o1 ≥ O, Z = O. While this basic solution is nonnegative and satisfies all the relationships in (13.11) (including complementarity), it is infeasible for the original LCP (13.8) since z_o > 0. A solution such as this will be termed an almost complementary basic solution. So while a complementary basic solution to (13.8) is one that contains exactly one basic variable from each complementary pair of variables (w_i, z_i), i = 1, …, n, an almost complementary basic solution of (13.8) is a basic solution for (13.11) in which z_o is a basic variable and there is exactly one basic variable from each of only n − 1 complementary pairs of variables.

The complementary pivot method has us move from one almost complementary basic solution of (13.8) to another until we reach a complementary basic solution to (13.8), in which case we have w_i, z_i = 0 for all i = 1, …, n. At this point, the algorithm is terminated. To see this, let us rewrite (13.11) as

with associated simplex matrix

(13.12)

To find an initial almost complementary basic solution, the artificial variable z_o is made basic by having it replace the current basic variable with the most negative q_i value. Suppose Hence, z_o replaces w_r in the set of basic variables. To accomplish this we pivot on the (circled) element −1 in (13.12). This yields

(13.13)

where

As (13.13) reveals, an almost complementary basic solution to (13.8) is

The complementary pivot algorithm generates a sequence of almost complementary basic solutions until z_o becomes nonbasic or zero. Moreover, pivoting must be done in a fashion such that: (i) complementarity between the variables w_i, z_i is maintained at each iteration; and (ii) each successive basic solution is nonnegative.

Now, at the moment, both w_r and z_r are nonbasic (w_rz_r = 0 is satisfied). Since w_r turned nonbasic in (13.13), the appropriate variable to choose for entry into the set of basic variables is its complement z_r. In fact, this selection criterion for determining the nonbasic variable to become basic is referred to as the complementary pivot rule: choose as the incoming basic variable the one complementary to the basic variable, which just turned nonbasic.

Once the entering variable is selected, the outgoing variable can readily be determined from the standard simplex exit criterion, i.e. w_k is replaced by z_k in the set of basic variables if

(13.14)

A pivot operation is then performed using as the pivot element. Once w_k turns nonbasic, the complementary pivot rule next selects z_k for entry into the set of basic variables. Complementary pivoting continues until one of two possible outcomes obtains, at which point the complementary pivot algorithm terminates:

The simplex exit criterion selects row r as the pivot row and z_o turns nonbasic. The resulting solution is a complementary basic solution to (13.8).
No in (13.14). (In this latter instance, the problem either has no feasible solution or, if a primal (dual) feasible solution exists, the primal (dual) objective function is unbounded.)

Example 13.2

Let us use the complementary pivot routine to solve the quadratic program:

For this problem:

(Note that, as required, Q is negative definite since M₁ = − 3 < 0, M₂ = 2 > 0.)

Going directly to (13.12) we have, via (13.11),

(13.15)

Clearly, the requisite complementarity conditions hold in (13.15). And since 6 = − min {q_i, i = 1, …, 4}, z_o displaces w₄ in the set of basic variables. Upon pivoting on the circled element in (13.15) we obtain an almost complementary basic solution (see (13.16)) consisting of w₁ = 8, w₂ = 5, w₃ = 12, and z_o = 6.

(13.16)

Since w₄ turned nonbasic, its complementary variable z₄ may be introduced into the set of basic variables. With images we see that the circled element in (13.16) serves as the pivot. Upon pivoting, we obtain a second, almost complementary basic solution consisting of (see (13.17)).

(13.17)

Given that w₂ has turned nonbasic, its complementary variable z₂ can be made basic. Since 3 = min {3, 4}, the circled element in (13.17) becomes the pivot. A pivot operation now yields a complementary basic solution (since z_o turns nonbasic) with values (see (13.18)).

(13.18)

Since we ultimately have

it follows that max f = −15.

13.5 Quadratic Programming and Activity Analysis

In Chapter 7 we considered a generalized linear activity analysis model in which a perfectly competitive firm produces p different products, each corresponding to a separate activity operated at the level x_j, j = 1, …, p. The exact profit maximization model under consideration was given by (7.27), i.e.

or, in matrix form,

In this formulation, total revenue is total variable input cost is and total conversion cost is Here p_j (the jth component of P) is the constant price per unit of output of activity j, q_i (the ith component of Q) is the constant unit price of the ith variable input, and is the cost of converting the m − l fixed inputs to the operation of the jth activity at the unit level, where r_ij (the ith component of r_j) is the cost of converting one unit of ith fixed factor to the jth activity.

Let us now relax the assumption of given product prices p_j and assume, instead, that the firm is a monopolist in the markets for the p outputs. In this regard, there exists a downward sloping demand curve for each of the p products so that an inverse relationship exists between p_j and x_j, e. g., p_j = a_j − b_jx_j, dp_j/dx_j = − b_j < 0. Then total revenue from sales of the output of the jth activity Hence total revenue is now where a′ = (a₁, …, a_p) and B is a (p × p) symmetric positive definite diagonal matrix of the form

Clearly total revenue under this new specification of price behavior is the sum of a linear and a quadratic form in X. Moreover, the derivative of total revenue TR with respect to X is marginal revenue MR or

i.e. the jth component of this vector is the marginal revenue from the sale of x_j or MR_j = dTR_j/dx_j = a_j − 2b_jx_j, j = 1, …, p.

In view of this respecification of total revenue, the above profit maximization model appears as

(13.19)

Additionally, the Karush‐Kuhn‐Tucker necessary and sufficient conditions for an optimum appear (using (13.2)) as:

(13.20)

Upon examining (13.20) we see that or, in terms of the individual components of this inequality, the imputed cost of operating activity j at the unit level is at least as great as the gross profit margin for activity j, the latter being expressed as the marginal revenue of activity j less the total cost of operating activity j at the unit level less the cost of converting the m − l fixed factors to the operation of the jth activity at the unit level. (The interpretation of (13.20b, c, d, e) is similar to the one advanced for (12.28) and will not be duplicated here.)

Turning next to the dual program associated with (13.19) we have

(13.21)

or, for X fixed at X_o (the optimal solution vector to the primal quadratic program),

(13.21.1)

Either (13.21) or (13.21.1) is appropriate for considering the dual program from a computational viewpoint. However, for purposes of interpreting the dual program in an activity analysis context, let us first examine the generalized nonlinear dual objective from which g (X, U) in (13.21) was derived, namely

(13.22)

The first term on the right‐hand side of (13.22) is simply the total imputed value of the firm’s supply of fixed or scarce resources (the dual objective in a linear activity analysis model). The second term can be interpreted as economic rent (the difference between total profit and the total imputed cost of all inputs used).³ Finally, if the square‐bracketed portion of the third term (which is nonpositive by virtue of (13.20a)) is thought of as a set of accounting or opportunity losses generated by a marginal increase in the level of operation of activity j, then the entire third term is a weighted sum of these losses (the weights being the various activity levels) and thus amounts to the marginal opportunity loss of all outputs. At an optimal solution, however, (13.20b) reveals that this third term is zero (Balinski and Baumol 1968). Next, upon examining the dual structural constraint in (13.21) we see that so that, as in the linear activity model, the imputed cost of operating each activity at the unit level must equal or exceed its gross profit margin. In sum, the dual problem seeks a constrained minimum to the total imputed value of all scarce resources plus payments to economic rent plus losses due to unprofitable activities.

13.6 Linear Fractional Functional Programming

(Charnes and Cooper 1962; Lasdon 1970; Martos 1964; Craven 1978)

In what follows we shall employ the simplex algorithm to solve an optimization problem, known as a linear fractional programming problem, in which the objective function is nonlinear in that it is expressed as the ratio of two linear functions and the variables must satisfy a system of linear inequalities and nonnegativity conditions. Specifically, let us

(13.23)

where c_o, d_o are scalars, C and D are (p × 1) vectors with components c_j, d_j respectively, j = 1, …, p, and A is of order (m × p). Although f is neither convex nor concave, its contours (c_o + C′X)/(d_o + D′X) = constant are hyperplanes. Moreover, any finite constrained local maximum of f is also global in character and occurs at an extreme point of the feasible region K. That is,

Theorem 13.4

If K is a convex polyhedron and d_o + D′X > 0 for all X ε K, then f assumes a finite maximum at an extreme point or at a convex combination of extreme points of K.

Proof. Let denote the extreme points of K. If X is not an extreme point of K, then it can be expressed as a convex combination of these extreme points as

Assume that the maximum of f occurs at the extreme point K so that i.e.

Then and thus, upon multiplying each side of this inequality by θ_i and summing over all values of i,

(13.24)

Using the expression for X in terms of the extreme points of K, (13.24) becomes

and thus

Hence, there exists an extreme point at which f attains its maximum value.

Next, let f assume its maximum at more than one extreme point, e.g. for let Let be any convex combination of these extreme points, i.e.

Then,

since Q.E.D.

Let us next examine a couple of direct (in the sense that no variable transformation or additional structural constraints or variables are introduced) approaches to solving the above fractional program, which mirror the standard simplex method. For the first technique (Swarup 1965), let us start from an initial basic feasible solution and, under the assumption that d_o + D′X > 0 for all X ε K, demonstrate the conditions under which the objective value can be improved. Doing so will ultimately provide us with a set of conditions that an optimal basic feasible solution must satisfy.

Upon introducing the slack variables x_p + 1, …, x_n into the structural constraint system, we may write the (m × n) coefficient matrix of the same as A = , where the (m × m) matrix B = [b₁, …, b_m] has as its columns the columns of A corresponding to basic variables and the (m × n− m) matrix R contains all remaining columns of A. Then solving for the basic variables in terms of the nonbasic variables we have the familiar equality X_B = B⁻¹b − B⁻¹RX_R, where For X_R = O, we have the basic solution X_B = B⁻¹b. If X_B ≥ O, the solution is deemed feasible. Let us partition the vectors C, D as

respectively, where the (m × 1) vectors C_B, D_B contain as their elements the objective function coefficients corresponding to the basic variables while the components of the (n − m × 1) vectors C_R, D_R correspond to the objective function coefficients associated with nonbasic variables.

Let

and, for those nonbasic columns r_j, j = 1, …, n − m, of R, let r_j = BY_j so that Y_j = B⁻¹r_j, i.e. Y_j is the jth column of B⁻¹R. If then the optimality evaluators may be expressed as

where c_Rj, d_Rj are, respectively, the jth component of C_R, D_R, j = 1, …, n − m.

Our goal is now to find an alternative basic feasible solution that exhibits an improved value of f = z_B1/z_B2. If we change the basis one vector at a time by replacing b_j by r_j, we obtain a new basis matrix where Then the values of the new basic variables are, from (4.11.1),

And from (4.12), the new value of the objective function is

Clearly the value of f will improve if

since given that d_o + D′X > 0. Simplifying the preceding inequality yields

(13.25)

(If the value of f is unchanged and the degenerate case emerges.)

If for any r_j we find that Δ_j > 0 and if at least one component y_ij of Y_j is positive (here Y_j corresponds to the first m elements of the jth nonbasic column of the simplex matrix), then it is possible to obtain a new basic feasible solution from the old one by replacing one of the columns of B by r_j with the result that For the entry criterion, let us adopt the convention that the incoming basic variable is chosen according to

i.e. x_Br enters the set of basic variables. (Note that this choice criterion gives us the largest increase in f.) In addition, the exit criterion is the same as the one utilized by the standard simplex routine. This procedure may be repeated until, in the absence of degeneracy, the process converges to an optimal basic feasible solution. Termination occurs when Δ_j ≤ 0 for all nonbasic columns r_j, j = 1, …, n − m. It is imperative to note that this method cannot be used if both c_o, d_o are zero, i.e. if we start from an initial basic feasible solution with C_B = C_D = O and c_o = d_o = 0, then z_B1 = z_B2 = 0 also and thus Δ_j = 0 for all j, thus indicating that the initial basic feasible solution is optimal. Clearly, this violates the requirement that d_o + D′X > 0.

Example 13.3

(Swarup 1965, pp. 1034–1036) Let us

For x₃, x₄ nonnegative slack variables, the problem may be rewritten as

with c_o = 0, d_o = 1.

If we let x₃, x₄ serve as basic variables, then the initial basic feasible solution is x₃ = 15, x₄ = 10, and f = 0. In this instance C_B = D_B = O, B = I₂, and

Since the associated simplex matrix (with the usual objective function row deleted) assumes the form

(13.26)

it follows that

with Then

Using (13.25),

Since the maximum of these Δ values is 5, it follows that x₁ is introduced into the set of basic variables. Pivoting on the circled element in (13.26) yields the new simplex matrix

(13.27)

with x_B1 = x₃ = 9, x_B2 = x₁ = 2, and f = 1. Since we do not have Δ_j ≤ 0 for all j, let us undertake an additional round of calculations.

Since x_B1 = x₃, x_B2 = x₁ are now basic variables, it follows that

Then

By virtue of these calculations, we obtain

Upon introducing x₂ into the set of basic variables, (13.27) becomes

(13.28)

with x_B1 = x₂ = 45/19, x_B2 = x₁ = 20/19, and f = 235/209. We still do not have all Δ_j ≤ 0 so that another iteration is warranted.

For x_B1 = x₂, x_B2 = x₁ basic variables, we have

In addition,

z_B1 = 235/19,	z₁₁ = − 5/19,	Δ₁ = − 55/19,
z_B2 = 11,	z₁₂ = − 16/19,	Δ₂ = 59/19.
	z₂₁ = 0,
	z₂₂ = − 1,

Thus x₄ turns basic and, from (13.28), we obtain

with x_B1 = x₂ = 3, x_B2 = x₄ = 4, and f = 9/7. If the Δ_j^'s associated with this basic feasible solution are now determined, it is found that both are negative, thus indicating that the current solution is optimal.

Relative to the second direct technique (Bitran and Novaes 1973) for solving the fractional functional program, let us again assume that d_o + D′X > 0 for all X ε K and use the standard simplex method to solve a related problem that is constructed from the former and appears as

(13.29)

Here λ equals C minus the vector projection of C onto D (this latter notion is defined as a vector in the direction of D obtained by projecting C perpendicularly onto D). Upon solving (13.29), we obtain the suboptimal solution point X^*.

The next step in the algorithm involves utilizing X^* to construct a new objective function to be optimized subject to the same constraints, i.e. we now

(13.30)

The starting point for solving (13.30) is the optimal simplex matrix for problem (13.42); all that needs to be done to initiate (13.30) is to replace the objective function row of the optimal simplex matrix associated with (13.29) by the objective in (13.30). The resulting basic feasible solution will be denoted as X^**.

If X^** = X^*, then X^* represents the global maximal solution to the original fractional program; otherwise return to (13.30) with X^* = X^** and repeat the process until the solution vector remains unchanged. (For a discussion on the geometry underlying this technique, see Bitran and Novaes 1973, pp. 25–26.)

13.7 Duality in Linear Fractional Functional Programming

(Craven and Mond 1973, 1976; Schnaible 1976; Wagner and Yuan 1968; Chadha 1971; Kydland 1962; and Kornbluth and Salkin 1972)

To obtain the dual of a linear fractional functional program, let us dualize its equivalent linear program. That is, under the variable transformation Y = tX, t = (d_o + D′X)⁻¹, the linear fractional programming problem (13.23) or

is equivalent to the linear program

(13.32)

if d_o + D′X > 0 for all X ε K and for each vector Y ≥ O, the point (t, Y) = (0, Y) is not feasible, i.e. every feasible point (t, Y) has t > 0 (Charnes and Cooper 1962). (This latter restriction holds if K is bounded. For if is a feasible solution to the equivalent linear program (13.32), then, under the above variable transformation, If X ε K, then K, θ ≥ 0, thus contradicting the boundedness of K.) In this regard, if represents an optimal solution to the equivalent linear program, then is an optimal solution to the original fractional problem.

To demonstrate this equivalence let

Moreover, AX = A(Y/t) ≤ b or AY − tb ≤ O. If d_ot + D′Y = 1, then (13.32) immediately follows.

It now remains to dualize (13.32). Let

Then (13.32) may be rewritten as

Upon replacing the preceding problem becomes

The symmetric dual to this problem is then

or, for λ = w − v,

(13.33)

As was the case with the primal and dual problems under linear programming, we may offer a set of duality theorems (Theorems 13.5 and 13.6) that indicate the relationship between (13.23), (13.33). To this end, we have

We next have

Theorem 13.6

If X_o∈ E^p represents an optimal solution to the primal problem (13.23), then there exists a vector E^{m + 1} that solves the dual problem (13.33) with f(X_o) = g(λ_o). Conversely, if solves the dual problem, then there exists a vector such that solves the primal problem and

Proof. From (12.4) we know that a necessary condition for X_o to solve (13.23) is the existence of a vector Y_o∈ E^m such that

or, from (13.23),

(13.34)

For λ_o = (c_o + C′X_o)/(d_o + D′X_o), (13.34a) becomes

Upon setting the previous inequality simplifies to From (13.34b),

(13.35)

And from (13.34c), (13.35),

Clearly, the vector where provides us with a feasible solution to the dual problem. And since λ_o ≥ (c_o + C′X_o)/(d_o + D′X_o)(by Theorem 13.5), the first half of the theorem is verified.

To prove the converse portion, let us transform (13.33) to the equivalent dual program

(13.33.1)

The Lagrangian associated with this problem is

Then a necessary condition for to solve (13.33.1) is the existence of vectors such that

or, for

(13.36)

From (13.36b) we see that since otherwise thus implying that Clearly this violates (13.36a). Thus (13.36a, b) together indicate that is a feasible solution to the equivalent linear program. If then (13.36b) reveals that we also obtain a feasible solution to the original linear program. From (13.36c, e), we have Combining this equation with (13.36d) yields If is substituted into the preceding equation, then we obtain λ_o = (c_o + C′X_o)/(d_o + D′X_o). And since min λ = − max (−λ), the values of the objective functions for problems (13.23), (13.33) are thus equal. Q.E.D.

While (13.33) represents the dual of the equivalent linear program (13.32), what does the general nonlinear dual (Panik 1976) to (13.23) look like? Moreover, what is the connection between the dual variables in (13.33) and those found in the general nonlinear dual to (13.23)? To answer these questions, let us write the said dual to (13.33) as

where L(X, U) = (c_o + C′X)/(d_o + D′X) + U′(b − AX) is the Lagrangian associated with the primal problem (13.23). In this regard, this dual becomes

(13.37)

Since t = (d_o + D′X)⁻¹, the structural constraint in (13.37) becomes

(13.37.1)

Since at an optimal solution to the primal‐dual pair of problems we must have λ = (c_o + C′X)/(d_o + D′X) (see Theorem 13.6), (13.37.1) becomes

(13.37.2)

Then the structural constraint in (13.37) is equivalent to (13.33b) if Moreover, when the objective in (13.37) simplifies to λ, the same objective as found in (13.33). To summarize, solving (13.32) gives us m + 1 dual variables, one corresponding to each structural constraint. The dual variable for (13.32a) (namely λ) has the value attained by at its optimum. The dual variables for (13.23) are then found by multiplying the dual variables for the constraints (13.32b) by t.

Example 13.5

Let us solve the problem presented in Example 13.3 above by generating a solution to its equivalent linear programming problem. From (13.32), we desire to

(13.38)

For y₃ a nonnegative artificial variable and y₄, y₅ nonnegative slack variables, the optimal simplex matrix (using the two‐phase method) is

Thus the optimal basic feasible solution for this problem is Then with it follows that Moreover, the dual variable corresponding to (13.32a) is λ = 9/7 while the dual variables corresponding to (13.32b) are respectively The accounting loss figure associated with y₁ is (See also the last row of the optimal simplex matrix for Example 13.4.) Upon transforming these last three variables to those corresponding to the original fractional problem yields, upon multiplying by t, u₁ = 3/245, u₂ = 0, and u_s1 = 59/245.

The dual problem associated with (13.51) is

If λ = w − v, with both w, v nonnegative, is a nonnegative slack variable, are nonnegative surplus variables, and are nonnegative artificial variables, then the above dual problem may be rewritten as

The optimal simplex matrix (via the two‐phase method) is

with

13.8 Resource Allocation with a Fractional Objective

We previously specified a generalized multiactivity profit‐maximization model (7.27.1) as

An assumption that is implicit in the formulation of this model is that all activities can be operated at the same rate per unit of time. However, some activities may be operated at a slower pace than others so that the time needed to produce a unit of output by one activity may vary substantially between activities. Since production time greatly influences the optimum activity mix, the model should explicitly reflect the importance of this attribute. Let us assume, therefore, that activity j is operated at an average rate of t_j units per minute (hour, etc.). Then total production time is where is of order (p × 1). Note that dT/dt_j < 0 for each j, i.e. total production time is inversely related to the speed of operation of each activity.

If C denotes the (constant) rate per unit of time of operating the production facility and K represents fixed overhead cost, then the total overhead cost of operating the facility is h(X) = CT′X + K. In view of this discussion, total profit is We cannot directly maximize this function subject to the above constraints since two different activity mixes do not necessarily yield the same production time so that we must divide to get average profit per unit of time

(13.39)

Hence, the adjusted multiactivity profit‐maximization model involves maximizing (13.52) subject to

For instance, from Example 7.3 the problem

has as its optimal basic feasible solution x₁ = 10/19, x₃ = 110/19, and f = 1350/19. Moreover, the shadow prices are u₁ = 24/19, u₂ = 13/19 while the accounting loss figure for products x₂, x₄ are, respectively, u_s2 = 3/19, u_s4 = 10/19. If we now explicitly introduce the assumption that some activities are operated at a faster rate than others, say t₁ = 2, t₂ = 10, t₃ = 8, and t₄ = 2, then the preceding problem becomes, for C = 10, K = 100,

Upon converting this problem to the form provided by (13.32), i.e. to

and solving (via the two‐phase method) yields the optimal simplex matrix (with the artificial column deleted)

Thus With X_o = Y_o/t_o, the optimal solution, in terms of the original variables, is In addition, the dual variables obtained from the above matrix are Then the original dual variables are Moreover, the computed accounting loss figures for activities one and four are, respectively, Upon transforming these values to the original accounting loss values yields

We noted earlier that for the standard linear programming resource allocation problem the dual variable u_i represents the change in the objective (gross profit) function “per unit change” in the ith scarce resource b_i. However, this is not the case for linear fractional programs, the reason being that the objective function in (13.23) is nonlinear, i.e. the dual variables evaluate the effect on f precipitated by infinitesimal changes in the components of the requirements vector b and not unit changes. To translate per unit changes in the b_i (which do not change the optimal basis) into changes in f for fractional objectives so that the “usual” shadow price interpretation of dual variables can be offered, let us write the change in f as

(13.40)

where the (m × 1) vector D_B contains the coefficients of D corresponding to basic variables and e_i is the ith unit column vector (Martos 1964; Kydland 1962; Bitran and Magnanti 1976).

In view of this discussion, the change in f per unit change in b₁ is

while the change in f per unit change in b₂ is

13.9 Game Theory and Linear Programming

13.9.1 Introduction

We may view the notion of a game of strategy as involving a situation in which there is a contest or conflict situation between two or more players, where it is assumed that the players can influence the final outcome of the game, and the said outcome is not governed entirely by chance factors. The players can be individuals (two people engaging in a chess match, or multiple individuals playing a game of bridge) or, in a much broader sense, adversarial situations can emerge in a social, political, economic, or military context. Simply stated, a game is a set of rules for executing plays or moves. For example, the rules state what moves can be made, when they are to be made and by whom; what information is available to each participant; what are the termination criteria for a play or move, and so on. Moreover, after each play ends, we need to specify the reward or payoff to each player.

If the game or contest involves two individuals, organizations, or countries, it is called a two‐person game. And if the sum of the payoffs to all players at the termination of the game is zero, then the game is said to be a zero‐sum game, i.e. what one player wins, the other player loses. Hence a zero‐sum, two‐person game is one for which, at the end of each play, one player gains what the other player loses.

Suppose we denote the two players of a game as P₁ and P₂, respectively. Each player posits certain courses of action called strategies, i.e. these strategies indicate what one player will do for each possible move that his opponent might make. In addition, each player is assumed to have a finite number of strategies, where the number of strategies formulated by one player need not be the same for the other player. Hence the set of strategies for, say P₁ covers all possible alternative ways of executing a play of the game given all possible moves that P₂ might make.

It must be noted that for two‐person games, one of the opponents can be nature so that chance influences certain moves (provided, of course, that the players themselves ultimately control the outcome). However, if nature does not influence any moves and both parties select a strategy, then we say that the outcome of the game is strictly dominated. But if nature has a hand in affecting pays and the outcome of a game is not strictly determined, then it makes sense to consider the notion of expected outcome (more on this point later on).

13.9.2 Matrix Games

Let us specify a two‐person game G as consisting of the nonempty sets R and C and a real‐valued function ϕ defined on a pair (r, c), where r∈ ℛ and c∈ C. Here the elements r, c of sets R, C are the strategies for player P₁ and P₂, respectively, and the function ϕ is termed the payoff function. The number ϕ (r, c) is the amount that P₂ pays P₁ when P₁ plays strategy r and P₂ plays strategy c. If the sets R are finite, then the payoff ϕ can be represented by a matrix so that the game is called a matrix game.

For matrix games, if R, C contain strategies r₁, r₂, …, r_m and c₁, c₂, …, c_n, respectively, then the payoff to P₁ if he chooses strategy i and P₂ selects strategy j is ϕ (r_i, c_j) = a_ij, i = 1, …, m; j = 1, …, n. If some moves are determined by chance, then the expected payoff to P₁ is also denoted by a_ij. So when P₁ chooses strategy i and P₂ selects strategy j, a_ij depicts the payoff to P₁ if the game is strictly determined or if nature makes some of the plays. In general, a_ij may be positive, negative, or zero. Given that P₁ has m strategies and P₂ has n strategies, the payoffs a_ij can be arranged into an (m × n) payoff matrix:

Clearly the rows represent the m strategies of P₁ and the columns depict the n strategies of P₂_, where it is assumed that each player knows the strategies of his opponent. Thus, row i of A gives the payoff (or expected payoff) to P₁ if he uses strategy i, with the actual payoff to P₁ determined by the strategy selected by P₂. A game is said to be in normal form if the strategies for each player are specified and the elements within A are given.

We can think of the a_ij elements of A as representing the payoffs to P₁ while the payoffs to P₂ are the negatives of these. The “conflict of interest” aspect of a game is easily understood by noting that P₁ is trying to win as much as possible, with P₂ trying to preclude P₁ from winning more than is practicable. In this case, P₁ will be regarded as the maximizing player while P₂ will try to minimize the winnings of P₁.

Given that P₁ selects strategy i, he is sure of winning . (If “nature” impacts some of the moves, then P₁’s expected winnings are at least a_ij.) Hence P₁ should select the strategy that yields the maximum of these minima, i.e.

(13.41)

Since it is reasonable for P₂ to try to prevent P₁ from getting any more than is necessary, selecting strategy j ensures that P₁ will not get more than no matter what P₁ does. Hence, P₂ attempts to minimize his maximum loss, i.e. P₂ selects a strategy for which

(13.42)

The strategies r_i, i = 1, …, m, and c_j, j = 1, …, n, are called pure strategies. Suppose P₁ selects the pure strategy r_l and P₂ selects the pure strategy c_k. If

(13.43)

then the game is said to possess a saddle point solution. Here r_l turns out to be the optimal strategy for P₁ while c_k is the optimal strategy for P₂ – with P₂ selecting c_k, P₁ cannot get more than a_lk, and with P₁ choosing r_l, he is sure of winning at least a_lk.

Suppose (13.43) does not hold and that, say,

In this circumstance, P₁ might be able to do better than a_rv, and P₂ might be able to decrease the payoff to P₁ below the a_uk level. For each player to pursue these adjustments, we must abandon the execution of pure strategies and look to the implementation of mixed strategies via a chance device. For P₁, pure strategy i with probability u_i ≥ 0 and is randomly selected; and for P₂, pure strategy j with probability is randomly chosen. (More formally, a mixed strategy for P₁ is a real‐valued function f on ℛ such that f(r_i) = u_i ≥ 0, while a mixed strategy for P₂ is a real‐valued function h on C such that h(c_j) = v_j ≥ 0.) Thus random selection determines the strategy each player will use, and neither is cognizant of the other's strategy or even of his own strategy until it is determined by chance. In sum:


or	or

Under a regime of mixed strategies, we can no longer be sure what the outcome of the game will be. However, we can determine the expected payoff to P₁. So if P₁ uses mixed strategy U and P₂ employs mixed strategy V, then the expected payoff to P₁ is

(13.44)

Using an argument similar to that used to rationalize (13.41) and (13.42), P₁ seeks a U that maximizes his expected winnings. For any U chosen, he is sure that his expected winnings will be at least P₁ then maximizes his payoff relative to U so that his expected winnings are at least

(13.45)

Likewise, P₂ endeavors to find a V such that the expected winnings of P₁ do not exceed

(13.46)

Now, if there exist mixed strategies U^*, V^* such that then there exists a generalized saddle point of ϕ (U,V). P₁ should use the mixed strategy U^* and P₂ the mixed strategy V^* so that the expected payoff to P₁ is exactly the value of the game. How do we know that mixed strategies U, V exist such that the value of the game is The answer is provided by Theorem 13.7.

13.9.3 Transformation of a Matrix Game to a Linear Program

Let us express a matrix game for players P₁ and P₂ as

where a_j, j = 1, …, n, is the jth column of A. Suppose P₂ chooses to play the pure strategy j. If P₁ uses the mixed strategy U ≥ O, 1′U = 1, then the expected winnings of P₁ is

(13.47)

Then P₁’s expected payoff will be at least W₁ if there exists a mixed strategy U such that

(13.48)

i.e. P₁ can never expect to win more than the largest value of W₁ for which there exists a U > O, 1′U = 1, and (13.48) holds. Considering each j, (13.47) becomes

(13.49)

With W₁ assumed to be positive, we can set

(13.50)

Clearly y_i > 0 since W₁ > 0. Under this variable, transformation (13.49) can be rewritten as

(13.51)

Here P₁’s decision problem involves solving the linear program

(13.52)

Thus, minimizing will yield the maximum value of W₁, the expected winnings for P₁.

Next, suppose P₁ decides to play pure strategy i. From the viewpoint of P₂, he attempts to find a mixed strategy V ≥ O 1′V = 1, which will give the smallest W₂ such that, when P₁ plays strategy i,

(13.53)

where α_i is the ith row of the payoff matrix A and E(α_iV) is the expected payout of P₂. Considering each i, (13.53) becomes

(13.54)

With W₂ taken to be positive, let us define

(13.55)

Hence (13.54) becomes

(13.56)

Thus P₂’s decision problem is a linear program of the form

(13.57)

i.e. maximizing will yield the minimum value of W₂, the upper bound on the expected payout of P₂.

A moment’s reflection reveals that (13.57) can be called the primal problem and (13.52) is its symmetric dual. Moreover, from our previous discussion of duality theory, there are always feasible solutions U, V since P₁, P₂ can execute pure strategies. And with W₁, W₂ each positive, there exist feasible vectors X, Y for the primal and dual problems, respectively, and thus the primal and dual problems have optimal solutions with max z′ = min z or

These observations thus reveal that we have actually verified the fundamental theorem of game theory by constructing the primal‐dual pair of linear programs for P₁ and, P₂.

13.A Quadratic Forms

13.A.1 General Structure

Suppose Q is a function of the n variables x₁, …, x_n. Then Q is termed a quadratic form in x₁, …, x_n if

(13.A.1)

where at least one of the constant coefficients a_ij ≠ 0. More explicitly,

(13.A.2)

It is readily seen that Q is a homogeneous⁴ polynomial of the second degree since each term involves either the square of a variable or the product of two different variables. Moreover, Q contains n² distinct terms and is continuous for all values of the variables x_i, i = 1, …, n, and equals zero when x_i = 0 for all i = 1, …, n.

In matrix form Q equals, for X∈ Eⁿ, the scalar quantity

(13.A.3)

where

To see this, let us first write

Then

From (13.A.2) it can be seen that a_ij + a_ji is the coefficient of x_ix_j since a_ij, a_ji are both coefficients of x_ix_j = x_jx_i, i ≠ j.

13.A.2 Symmetric Quadratic Forms

Remember that if a matrix A is symmetric so that A = A′, then a_ij = a_ji, i ≠ j. In this regard, a quadratic form X′AX is symmetric if A is symmetric or a_ij = a_ji, i ≠ j. Hence a_ij + a_ji = 2a_ij is the coefficient on x_ix_j since a_ij = a_ji and a_ij, a_ji are both coefficients of x_ij = x_ji, i ≠ j.

If A is not a symmetric matrix (a_ij ≠ a_ji), we can always transform it into a symmetric matrix B by defining new coefficients

(13.A.4)

Then b_ij + b_ji = 2b_ij is the coefficient of x_ix_j, i ≠ j, in

(13.A.5)

(Since b_ij + b_ji = a_i + a_ij, this redefinition of coefficients clearly leaves the value of Q unchanged and thus, under (13.A.4), X′AX = X′BX, X∈ Eⁿ.) So given any quadratic form X′AX, the matrix A may be assumed to be symmetric. If it is not, it can be readily transformed into a symmetric matrix.

13.A.3 Classification of Quadratic Forms

As we shall now see, there are, in all, five mutually exclusive and collectively exhaustive varieties of quadratic forms. Specifically:

Definite Quadratic Form
A quadratic form is said to be positive definite (negative definite) if it is positive (negative) at every point X∈ Eⁿ except X = O. That is
1. X′AX is positive definite if X′AX > 0 for every X ≠ O;
2. X′AX is negative definite if X′AX < 0 for every X ≠ O.
  (Clearly a form that is either positive or negative definite cannot assume both positive and negative values.)
Semi‐Definite Quadratic Form
A quadratic form is said to be positive semi‐definite (negative semi‐definite) if it is nonnegative (nonpositive) at every point X∈ Eⁿ, and there exist points X ≠ O for which it is zero. That is:
1. X′AX is positive semi‐definite if X′AX ≥ 0 for every X and X′AX = 0 for some points X ≠ O;
2. X′AX is negative semi‐definite if X′AX ≤ 0 for every X and X′AX = 0 for some points X ≠ O.
Indefinite Quadratic Forms
A quadratic form is said to be indefinite if it is positive for some points X∈ Eⁿ and negative for others.

13.A.4 Necessary Conditions for the Definiteness and Semi‐Definiteness of Quadratic Forms

We now turn to an assortment of theorems (13.A.1–13.A.3) that will enable us to identify the various types of quadratic forms.

Note that the converse of this theorem does not hold since a quadratic form may have positive (negative) coefficients on all its terms involving second powers yet not be definite, e.g. Similarly,

(Here, too, the converse of the theorem is not true, e.g. the quadratic form has nonnegative coefficients associated with its second‐degree terms, yet is indefinite.)

Before stating our next theorem, let us define the kth naturally ordered principal minor of A as

In the light of this definition, we now look to

So, if any M_k = 0, k = 1, …, n, the form is not definite – but it may be semi‐definite or indefinite.

13.A.5 Necessary and Sufficient Conditions for the Definiteness and Semi‐Definiteness of Quadratic Forms

Let us modify Theorem 13.A.3 so as to obtain Theorem 13.A.4.

We next have Theorem 13.A.5.

A similar set of theorems (13.A.6 and 13.A.7) holds for semi‐definite quadratic forms. Specifically,

Similarly,

Notes

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.