Notes2 PDF
Notes2 PDF
Notes2 PDF
> min
′
[f(x ′ ) − λT (h(x ′ ) − b)]
x ∈X
Equality in the first line holds because h(x ′ ) − b = 0 when x ′ ∈ X(b). The inequality
on the second line holds because the minimum is taken over a larger set. In the third
line we finally use that x minimizes L and that h(x) = b.
Two remarks are in order. First, a vector λ of Lagrange multipliers satisfying the
conditions of the theorem is not guaranteed to exist in general, but it does exist for a
large class of problems. Second, the theorem appears to be useful mainly for showing
that a given solution x is optimal. In certain cases, however, it can also be used to find
an optimal solution. Our general strategy in these cases will be to minimize L(x, λ) for
all values of λ, in order to obtain a minimizer x∗ (λ) that depends on λ, and then find
λ∗ such that x∗ (λ∗ ) satisfies the constraints.
5
6 2 · The Method of Lagrange Multipliers
minimize x1 − x2 − 2x3
subject to x1 + x2 + x3 = 5
x21 + x22 = 4.
For every λ ∈ Y, the unique optimum of L(x, λ) occurs at x∗ (λ) = (3/(2λ2 ), 1/(2λ2 ), x3 )T ,
and we need to find λ ∈ Y such that x∗ (λ) is feasible to be able to apply Theorem 2.1.
Therefore,
9 1
x21 + x22 = 2 + 2 = 4
4λ2 4λ2
p
and thus λ2 = − 5/8. We can now use Theorem p 2.1 to conclude
p that the minimization
p has an optimal solution at x1 = −3 2/5, x2 = − 2/5, and x3 = 5 − x1 − x2 =
problem
5 + 4 2/5.
2.3 · Complementary Slackness 7
Let us formalize the strategy we have used to find x and λ satisfying the conditions
of Theorem 2.1 for a more general problem. To
we proceed as follows:
1. Introduce a vector z of slack variables to obtain the equivalent problem
4. For each λ ∈ Y, minimize L(x, z, λ) subject only to the regional constraints, i.e.,
find x∗ (λ), z∗ (λ) satisfying
5. Find λ∗ ∈ Y such that (x∗ (λ∗ ), z∗ (λ∗ )) is feasible, i.e., such that x∗ (λ∗ ) ∈ X,
z∗ (λ∗ ) > 0, and h(x∗ (λ∗ )) + z∗ (λ∗ ) = b. By Theorem 2.1, x∗ (λ∗ ) is optimal
for (2.2).
Indeed, if the conditions were violated for some i, then the value of the Lagrangian
could be reduced by reducing (z∗ (λ))i , while maintaining that (z∗ (λ))i > 0. This
would contradict (2.3). Further note that λ ∈ Y requires for each i = 1, . . . , m either
that λi 6 0 or that λi > 0, depending on the sign of bi . In the case where where λi 6 0,
we for example get that
Slack in the corresponding inequalities (h(x∗ (λ∗ )))i 6 bi and λ∗i 6 0 has to be comple-
mentary, in the sense that it cannot occur simultaneously in both of them.
8 2 · The Method of Lagrange Multipliers
minimize x1 − 3x2
subject to x21 + x22 6 4
x1 + x2 6 2.
minimize x1 − 3x2
subject to x21 + x22 + z1 = 4
x1 + x2 + z2 = 2
z1 > 0, z2 > 0.
Since z1 > 0 and z2 > 0, the terms −λ1 z1 and −λ2 z2 have a finite minimum only if
λ1 6 0 and λ2 6 0. In addition, the complementary slackness conditions λ1 z1 = 0 and
λ2 z2 = 0 must hold at the optimum.
Minimizing L(x, z, λ) in x1 and x2 yields
∂L
= 1 − λ2 − 2λ1 x1 = 0 and
∂x1
∂L
= −3 − λ2 − 2λ1 x2 = 0,
∂x2
and we indeed obtain a minimum, because
∂2 L ∂2 L
∂x1 ∂x1 ∂x1 ∂x2
−2λ1 0
HL = =
∂2 L ∂2 L
∂x2 ∂x1 ∂x2 ∂x2
0 −2λ1