Calabria University - Pierre Apkarian

Convex quadratic constraints can be rewritten. (Ax + b). ′ .... convert family of constrained quadratic inequalities ..... Easily computed by solving linear equation.
382KB taille 7 téléchargements 344 vues
1

L INEAR M ATRIX I NEQUALITIES AND S EMI D EFINITE P ROGRAMMING I MPACTS ON C ONTROL S YSTEM D ESIGN P. Apkarian

Université Paul Sabatier & ONERA-CERT Mathématiques pour l’Industrie et la Physique CNRS UMR 5640

invitation by Professor Giuseppe Franze - Calabria University

– p. 1/129

Seminar Outline ♦ ♦ ♦ ♦ ♦ ♦ ♦ ♦

2

Linear Matrix Inequalities and SDP Tricks to reformulate into LMIs System concepts via LMIs Multi-channel/objective with LMIs Uncertain systems analysis Gain-scheduling and LPV synthesis Hard non-LMI problems Conclusions, perspectives.

– p. 2/129

Linear Matrix Inequalities and SDP

• • • • • •

3

Definitions, manipulations Schur’s complements Classes of convex optimization problems Semi- Definite Programming Algorithms to solve SDP, duality, complexity Software, links.

– p. 3/129

linear matrix inequalities

4

an LMI is a constraint on a vector x ∈ Rn : F (x) := F0 + x1 F1 + . . . + xn Fn  0, where F0 , F1 , . . . , Fn are symmetric matrices ➠

 is inequality on symmetric matrix cone



LMI equivalent to λmin (F (x)) ≥ 0



F (x)  0 iff η ′ F (x)η ≥ 0, ∀η



F (x)  0 iff det {ppal mat.} ≥ 0



F (x) ≻ 0 iff η ′ F (x)η > 0, ∀η 6= 0

– p. 4/129

geometry of LMIs ➠

5

an LMI de fine a convex set F (λx+(1−λ)y) = λF (x)+(1−λ)F (y)  0 whenever F (x)  0, F (y)  0





set with non necessarily smooth boundary (corners)

LMI plane and curved faces

describe wide variety of constraints

– p. 5/129

LMI - diagonal augmentation

6

LMI constraints F1 (x)  0, . . . , Fq (x)  0 are equivalent to single LMI constraint 

F1 (x)  0 .. .

0 ... 0



... 0 0 Fq (x)

LMI 2

LMI 1

LMI 3

– p. 6/129

linear constraints

7

finite set of scalar linear (affine) constraints a′i x ≤ bi , i = 1, . . . , m can be represented as LMI F (x)  0, with F (x) = diag(a′1 x − b1 , . . . , a′m x − bm )

polyhedral LMI

– p. 7/129

Schur complements

8

partitioned symmetric matrix   P1 P2 P := P2′ P3 S = P3 − P2′ P1−1 P2 is the Schur complement of P1 in P (provided P1 invertible) Schur complement lemmas ➠

P ≻ 0 if and only if P1 ≻ 0 and S ≻ 0



if P1 ≻ 0, then P  0 if and only if S  0

– p. 8/129

Schur complement consequence

9

complicate constraint in variable x P3 (x) − P2 (x)′ P1 (x)−1 P2 (x) ≻ 0 is turned into simpler one   P1 (x) P2 (x) ≻ 0. ′ P2 (x) P3 (x) provided that P1 (x) ≻ 0.

– p. 9/129

ellipsoidal constraints

10

an ellipsoid can be described in different ways • as kAx + bk ≤ 1, iff   I Ax + b 0 ′ (Ax + b) 1 • as (x−x0 )′ W (x−x0 ) ≤ 1, with W > 0 iff   1 (x − x0 )′ 0 −1 (x − x0 ) W

LMI

– p. 10/129

fractional constraints

11

consider fractional constraints (c′ x)2 d′ x

≤ t

Ax + b ≥ 0 (assume d′ x > 0, whenever Ax + b ≥ 0) can be represented as   ′ t cx  0 ′ ′ cx dx Ax + b ≥ 0 – p. 11/129

convex quadratic constraints

12

Convex quadratic constraints can be rewritten (Ax + b)′ (Ax + b) − c′ x − d ≤ 0 has the LMI representation   I Ax + b 0 ′ ′ (Ax + b) c x + d • can be used to show that convex quadratic programming can be solved via SDP

– p. 12/129

classes of convex optimization problems

13

• linear prog. (LP) minimize c′ x, Ax  b (componentwise) • convex quadratic prog. (CQP) Qj  0 minimize x′ Q0 x + b′0 x + c0 s.t. x′ Qi x + b′i x + ci ≤ 0 All (and others) are generalized by SDP !:

– p. 13/129

LMIs in control with P variable • Lyapunov inequality A′ P + P A ≺ 0 can be represented in canonical form F0 +

n X i=1

xi Fi ≺ 0

14

pick a basis (Pi )i of the symmetric matrices, X P = xi Pi i

hence recover the canonical form with F0 = 0,

Fi = A′ Pi +Pi A

– p. 14/129

symmetric matrix expressions are LMIs

15

• Any (symmetric) linear constraints in the variables X, Y AY B + (AY B)′ + X + . . .  0 can be represented in the canonical form F (x) = F0 + x1 F1 + . . . + xn Fn  0 by appropriate selection of the Fi ’s.

– p. 15/129

Riccati and quadratic matrix inequality

16

quadratic matrix inequality in P A′ P + P A + P BR−1 B ′ P + Q  0 where R > 0, is equivalent to LMI  ′  A P + PA + Q PB 0 ′ BP −R (proof by Schur complements) Riccati-based control method can be solved via LMIs

– p. 16/129

classes of semidefinite programs

17

• I feasibility problem: find x : F0 + x1 F1 + . . . + xn Fn  0 • II linear objective minimization subject to LMIs minimize c′ x, s.t. F0 + x1 F1 + . . . + xn Fn  0 • III generalized eigenvalue minimization minimize subject to

λ A(x) − λB(x)  0, B(x)  0, C(x)  0

(A, B, C affine symmetric expressions in x) – p. 17/129

solving LMI - a rich set of algorithms

18

much work and progress since 1990 ! ➠

primal interior-point method (method of centers)



primal-dual interior-point method



non-differentiable methods (bundle, ...)

Primal-dual methods very efficient. other fast algorithms under development (aug. Lagrangian)

– p. 18/129

central property

19

because of structure and convexity algorithms are guaranteed to find global solutions !

– p. 19/129

primal-dual IPMs

20

ideas: ➠

instead of working in primal space, formulate problem in “primal-dual” space



target objective is duality gap, and is zero at optimum



try to solve (Lagrange) optimality conditions

– p. 20/129

SDP duality • primal

21

• dual

min c′ x s.t. F (x)  0

max − Tr (F0 Z) s. t.Z ≻ 0, Tr Fi Z = ci • optimality cond. if (x, Z) is primal-dual feasible c′ x =

n X i=1

≥0

z }| { xi Tr ZFi =Tr ZF (x) −Tr ZF0 ≥ −Tr ZF0

hence global optimality pairs (x, Z) such that Tr ZF (x) = 0 since primal and dual obj. coincide at solution

– p. 21/129

mechanisms of primal-dual algorithms solve subject to

22

Tr ZF (x) = 0 Pn F0 + i=1 xi Fi  0

Z ≻ 0, Tr Fi Z = ci , i = 1, . . . , n • Actually, one tries to solve Tr ZF (x) = µI for decreasing value of µ (µ −→ 0) • Newton steps for the linearization of Tr ZF (x). • superlinear convergence can be guaranteed kxk+1 − xopt k ≤ kxk − xopt kq , q > 1 very efficient in practice ! – p. 22/129

SDP software ➠

MATLAB LMI toolbox by Gahinet, Chilali, Laub, Nemirovski



DSDP by Benson, Ye



SDPpack by Alizadeh, Haeberly, Nayakkankuppam, Overton



SeDuMi by Sturm



Imitool-2.0 by Boyd et al.



Cutting plane methods by Helmberg, Oustry, Kiwiel, etc.



Many others ...

23

– p. 23/129

ftp addresses for SDP

24

• ftp addresses, codes, papers, courses on SDP

http://orion.math.uwaterloo.ca:80/ hwolkowi/henry/software/readme.html# http://www.zib.de/helmberg/semidef.html http://rutcor.rutgers.edu/ alizadeh/sdp.html

– p. 24/129

tricks to turn hard problems into LMIs • • • • • • •

25

Schur’s complements (see previous) LMIs and quadratic forms multi-convexity, monotonicity, etc. Finsler’s lemmas Projection lemmas changes of variables augmentation by slack

– p. 25/129

S-Procedure and quadratic inequalities ➠

26

S-Procedure transforms quadratic problems into LMIs(possibly conservative)

given Qi ’s symmetric or hermitian matrices, define F0 (x) = x′ Q0 x, F1 (x) = x′ Q1 x, . . . , FL (x) = x′ QL x, F0 (x) < 0 over the set F1 (x) ≤ 0, . . . , FL (x) ≤ 0 whenever ∃s1 ≥ 0, . . . , sL ≥ 0 (slacks), such that F0 (x) −

L X i=1

si Fi (x) < 0 or LMI Q0 −

L X

si Qi  0

i=1

– p. 26/129

Finsler’s Lemma

27

• converts checking the sign of a quadratic form over a subspace into solving an LMI problem x′ Qx < 0, ∀x 6= 0, M x = 0 if and only there exists a scalar σ such that Q − σM ′ M ≺ 0 M x = 0 can also be formulated as x′ M ′ M x = 0 • proof via convexity of numerical ranges

– p. 27/129

generalized Finsler’s Lemma

28

• convert family of constrained quadratic inequalities into an LMI feasibility problem Q = Q′ and M given, and a compact subset of real matrices U we have the equivalence • for all U ∈ U, x′ Qx < 0, ∀x 6= 0 with U M x = 0, iff there exists Θ s.t. Q + M ′ ΘM ≺ 0 ∀U ∈ U NU′ ΘNU  0, where NU is basis of nullspace of U – p. 28/129

multi-convexity

29

given a function f (δ1 , . . . , δK ) • it is multi-convex function if separately convex along each direction δi • multi-convexity is weaker than convexity • convexity iff • multi-convexity iff ∂2 f (δ)]1≤i,j≤K  0 [ ∂δi δj

∂2 f (δ) ≥ 0, 2 ∂δi

i = 1, . . . , K

Turn parameter-dependent LMIs into finite set of LMIs.

– p. 29/129

projection Lemma - two-Sided

30

given Ψ = Ψ′ ∈ Rm×m , P, Q of column dim. m find X such that Ψ + P ′ X ′ Q + Q′ XP ≺ 0 let columns of NP , NQ form bases of the null spaces of P and Q inequality is solvable for X if and only if NP′ Ψ NP ≺ 0 NQ′ Ψ NQ ≺ 0 (Gahinet & Apkarian 1993)

– p. 30/129

System concepts via LMIs

• • • • •

31

Stability L2 gain or H∞ norm H2 norm Pole clustering ...

– p. 31/129

stability & equilibria

32

• Equilibrium points x˙ = f (x) are defined as the solutions x∗ of 0 = f (x∗ ). system has trajectory x(t) = x∗ , ∀t ≥ 0 if initialized at x∗ From now on, we assume x∗ = 0.

– p. 32/129

types of stability

33

• stability (simple) ∀R > 0, ∃r > 0, kx(0)k < r ⇒ ∀t ≥ 0, kx(t)k < R • asymptotic stability if it is stable and ∃r > 0, kx(0)k < r ⇒ x(t) → 0, as t → ∞ • exponentially stable if ∃ α > 0 and λ > 0 s. t. ∀t > 0, kx(t)k ≤ αkx(0)ke−λt in some ball. λ rate of conv. – p. 33/129

positive-definite functions of the state

34

Assume D is open region containing x∗ = 0. • A function V (x) from Rn into R is positive semi-definite on a domain D if (1) V (0) = 0 (2) V (x) ≥ 0, ∀x ∈ D • A function V (x) from Rn into R is positive definite on a domain D if (1) V (0) = 0 (2) V (x) > 0, ∀x ∈ D, x 6= 0

– p. 34/129

postive-definite functions: level curves

35

x2 V (x) = c2 V (x) = c1 O V (x) = c3

x1

c3 < c2 < c1

Typical level curves of positive-definite functions

– p. 35/129

Lyapunov derivatives

36

• if x is state of system x˙ = f (x), then V (x) is implicitly a function of time. Its time derivative is ′



∂V dV (x) ∂V ˙ = x˙ = f (x) V (x) = dt ∂x ∂x since x is constrained to satisfy x˙ = f (x). • it is referred to as derivative of V along the system trajectories (also Lyapunov’s derivative).

– p. 36/129

Lyapunov function: definition

37

• V (x) is a Lyapunov function of the system x˙ = f (x) if ➠

it is C 1 with respect to x on D



it is positive definite (see earlier) on D



its derivative on the system trajectories is negative semi-definite, that is, V˙ (x) ≤ 0, on D

as a function of x.

– p. 37/129

Lyapunov theorem for local stability

38

• if in a ball around the origin (= x∗ ), there exists V (x) in C 1 such that V (x) is positive definite ➠ V˙ (x) is negative semi-definite ➠

then the equilibrium point x∗ = 0 is (loc.) stable. It is asymptotically stable if V (x) is negative definite, i. e., V˙ (x) < 0, ∀x 6= 0, x ∈ ball as a function of x. • global stability if ball = Rn .

– p. 38/129

Lyapunov geometry

39

∂V (x) ∂x

x(t) ˙ x(t) 0

V˙ (x) < 0

angle between derivatives is greater than 90 deg.

– p. 39/129

stability for linear systems

40

the system d x = Ax dt is exponentially stable if and only if there exists X with X ≻ 0, A′ X + XA ≺ 0 why ? ′

V (x) = x Xx is a quadratic Lyapunov function

– p. 40/129

proof of exponential stability

41

perturb Lyapunov LMI to A′ X + XA + εX ≺ 0

for any state trajectory x(t), we infer x(t)′ (A′ X + XA)x(t) + εx(t)′ Xx(t) ≤ 0 and thus d ′ ′ x(t) Xx(t) + εx(t) Xx(t) ≤ 0 dt – p. 41/129

trick !

42

• note that solution of d x(t)′ Xx(t) + εx(t)′ Xx(t) = z(t) with z(t) ≤ 0 dt is V (x(t)) = x(0)′ Xx(0)e−εt +

Z

t

e−ε(t−τ ) z(τ )dτ

0

hence x(t)′ Xx(t) ≤ x(0)′ Xx(0)e−εt , ∀t ≥ 0 .

– p. 42/129

proof of exponential stability - continued

43

we have, with initial condition x(0) yields ′



x(t) Xx(t) ≤ x(0) Xx(0)e

−εt

finally, using ′

2

λmin (X)kxk ≤ x Xx ≤ λmax (X)kxk

2

gives kx(t)k ≤ kx(0)k

s

λmax (X) −εt/2 e for t ≥ 0 λmin (X)

system is exponentially stable ! – p. 43/129

necessity of Lyapunov inequalities

44

Assume A is stable (Re λi (A) < 0) and consider for Q ≻ 0, the (well-defined) integral Z ∞ d A′ t −Q = (e QeAt )dt dt 0 =

Z

∞ ′ A′ t

(A e

Qe

At

+e

A′ t

QeAt A)dt

0



= A P + P A with P :=

Z

∞ A′ t

e QeAt dt ≻ 0 0

– p. 44/129

necessity continued

45

finally, we have A′ P + P A = −Q ≺ 0,

P ≻ 0.

LMI problem has a solution whenever A is stable. • condition is iff • for linear systems quadratic Lyapunov functions are rich enough

– p. 45/129

energy gain or L2 gain

46

Energy gain not larger than γ: with w ∈ L2 and x(0) = 0, every trajectory of d x = Ax + Bw dt z = Cx + Dw should satisfy kzk2 ≤ γkwk2 , or

Z

∞ 0

z(t)′ z(t) dt ≤ γ 2

∀w ∈ L2 Z



w(t)′ w(t) dt

0

– p. 46/129

H∞ norm

47

• stable and the L2 gain w −→ z is smaller than γ if and only if there exists X ≻ 0 " ′ ′ # A X + XA XB C B ′X −γI D′ ≺ 0 C D −γI • freq. domain kC(sI − A)−1 + Dk∞ < γ via KYP. • similarly, H2 norm, LQ, LQG, many others ...

– p. 47/129

H∞ norm

48

• necessity call for general LQ theory. • we shall only prove sufficiency.

– p. 48/129

proof of sufficiency

49

• Note first that the (1, 1) block of the LMI implies that A is stable • By Schur complement, LMI is rewritten  ′   ′ A X + XA XB −1 C +γ D] ≺ 0 ′ ′ [C BX −γI D   x(t) Left- and right-multiply with yields ... w(t)

– p. 49/129

proof of sufficiency cont.

50

d V dt}| { z x′ (A′ X + XA)x + x′ XBw + w′ B ′ Xx −γw′ w + γ −1 z ′ z ≤ 0

– p. 50/129

proof of sufficiency cont.

51

integrate over [0, T ] and exploit x(0) = 0: Z T x(T )′ Xx(T ) + γ −1 kz(t)k2 − γkw(t)k2 dt ≤ 0 0

Recall X ≻ 0 and take T → ∞ (w ∈ L2 ): Z ∞ Z ∞ kz(t)k2 dt ≤ γ 2 kw(t)k2 dt ≤ 0 0

0

Can perturb γ to γ − ε to get strict inequality

– p. 51/129

relation to frequency domain

52

For ω ∈ R, left- and right multiply with   (jω − A)−1 B I to get γ −1 T (jω)∗ T (jω) − γI ≺ 0 hence kT (jω)k < γ,

∀ω ∈ R

From the right-lower block, we also get   ′ −γI D ≺ 0 or kDk < γ D −γI – p. 52/129

relation to frequency domain

53

finally, kT (jω)k < γ for ω ∈ R ∪ {∞} hence, kT k∞ :=

sup

kT (jω)k < γ.

ω∈R∪{∞}

– p. 53/129

H2 performance

54

• H2 norm of T defined as s Z ∞ 1 Tr kT k2 := T (jω)∗ T (jω) dω 2π −∞ • in the time domain (via Parseval) sZ ∞

Tr (CeAt B)′ (CeAt B) dt

kT k2 :=

0

– p. 54/129

H2 performance computation

55

Easily computed by solving linear equation AP0 + P0 A′ + BB ′ = 0 ⇒ kT k22 = Tr (CP0 C ′ ) A′ Q0 + Q0 A + C ′ C = 0 ⇒ kT k22 = Tr (B ′ Q0 B) Why ? see stability notes. • note that D = 0 for H2 norm to be well defined.

– p. 55/129

stochastic interpretation of H2 norm

56

w white noise, x˙ = Ax + Bw, x(0) = 0, z = Cx. Recall: with solution of P˙ (t) = AP (t) + P (t)A′ + BB ′ ,

P (0) = 0

we have E(x(t)x(t)′ ) = P (t). Hence lim E(z(t)′ z(t)) = lim E(x(t)′ C ′ Cx(t))

t→∞

t→∞

= lim Tr E(Cx(t)x(t)′ C ′ ) t→∞

= Tr (CP0 C ′ ) = kT k22 • asymptotic variance of output of system. – p. 56/129

deterministic interpretation of H2 norm

57

let zj be impulse response to Bej δ(t) with standard unit vector ej of x˙ = Ax, x(0) = x0 , z = Cx Z





zj (t) zj (t) dt = 0

P

Z

∞ 0

′ A′ t ′ Bj e C CeAt Bj

dt

v v = Tr (vv ) and j Bj Bj′ = BB ′ implies XZ ∞ kzj (t)k2 dt = kT k22 . ′



j

0

– p. 57/129

How to get LMI characterization ?

58

With A stable, it is easy to see that Tr (CP0 C ′ ) < γ 2 for AP0 + P0 A′ + BB ′ = 0 if and only if there exists X with Tr (CXC ′ ) < γ 2 and AX + XA′ + BB ′ ≺ 0 . • for ⇐ take difference of Lyapunov conditions

– p. 58/129

How to get LMI characterization ?

59

• for ⇒ since trace inequality is strict and by continuity there exists ε > 0 and X such that AX + XA′ + BB ′ + εI = 0,

Tr (CXC ′ ) < γ 2 .

Note that AX + XA′ + BB ′ ≺ 0 and Z ∞ At ′ A′ t X= e (BB + εI)e dt ≻ P0 0

Hence, kC(sI−A)−1 Bk2H2 := Tr (CP0 C ′ ) < Tr (CXC ′ ) < γ 2 .

– p. 59/129

LMI characterization for H2 norm

60

♦ A is stable and kT k22 < γ if and only if Y ≻ 0 with Tr (CY C ′ ) < γ,

AY + Y A′ + BB ′ ≺ 0

or if and only if X ≻ 0 with Tr (B ′ XB) < γ,

A′ X + XA + C ′ C ≺ 0

– p. 60/129

regional pole constraints

61

• to shape transient responses of closed-loop system • damping, settling time, rise time related to location of poles • useful regions: vertical strips, disks, conic sectors, etc • An LMI region R is defined as R = {z ∈ C : U + zV + z¯V ′ ≺ 0} .

– p. 61/129

LMI region intersections

62

• a large variety of regions can be represented this way • intersections of LMI regions are LMI regions

– p. 62/129

a short catalog of useful LMI regions LMI Regions

α

β

63

Characterization

fR (z) =

"

1 (z + z −α + 2 ¯)

0

0

(z + z ¯) β− 1 2

fR (z) =

"

i (z − z −α − 2 ¯)

0

0

¯) β + 2i (z − z

fR (z) =

»

−r q + z q+z ¯ −r

fR (z) =

»

sin θ(z + z ¯) cos θ(z − z ¯) cos θ(¯ z − z) sin θ(z + z ¯)

α

#

#

β

r −q

θ





– p. 63/129

Lyapunov theorem for LMI regions

64

• System d x = Ax has all its poles in LMI region R dt iff there exists X ≻ 0 s. t. U ⊗ X + V ⊗ (A′ X) + V ′ ⊗ (XA) ≺ 0 . is an LMI with respect to X. (⊗ is Kronecker product A ⊗ B := ((Aij B))) ➠

classical Lyapunov theorem with U = 0, V = 1



intersection by diagonal augmentation of U , V .

other specs. can be combined by just merging LMI constraints – p. 64/129

LMI regions- proof of sufficiency

65

condition is X ≻ 0,

U ⊗ X + V ⊗ (A′ X) + V ′ ⊗ (XA) ≺ 0 .

pick an eigenpair of A, (λ, v), Av = λv, and pre- and post-multiply inequality by I ⊗ v ∗ , I ⊗ v, gives >0

Hence,

z }| { (v ∗ Xv) (U + λ∗ V + λV ′ ) < 0 U + λ∗ V + λV ′ < 0.

Implies λ∗ , λ are in R. – p. 65/129

multi-objective/channel controller synthesis

• • • • •

66

formulation linearizing change of variables state-feedback synthesis output-feedback synthesis projected form.

– p. 66/129

controller synthesis

67

• synthesis structure z1 z2

.. .

.. .

P

y

w1 w2

u K

• given P (s), find K(s) to achieve a set of specifications for channels w1 → z1 , w2 → z2 , ...

– p. 67/129

example of multi-channel/objective problem 68 min kTw21 ←z21 k2 1 ←z 1 k∞ < γ1 , kTw∞ ∞

kTw22 ←z22 k2 < γ2

poles in LMI region R .

– p. 68/129

controller synthesis - data

69

• synthesis interconnection  d x = Ax + B w + B u, A ∈ Rn×n  1 2  dt P (s) z = C1 x + D11 w + D12 u   y = C2 x + D21 w • controller ( d x = A x + B y, K K K K d t K(s) u = C K xK + D K y

AK ∈ Rn×n

• Stability, Perfo.: H∞ , H2 , pole plac. on various channels – p. 69/129

derivation ➠

compute closed-loop data



write stability/performance (ineq.) conditions in closed loop



apply congruence transformations



use suitable linearizing transformations

70

– p. 70/129

controller synthesis - state-feedback

71

• turns out to be very simple problem  n×n  x ˙ = Ax + B w + B u, A ∈ R 1 2  P (s) z = C1 x + D11 w + D12 u   y = x ←− measurable state vector

and

u = Kx ←− state-feedback

closed-loop data are x˙ = (A + B2 K)x + B1 w z = (C1 + D12 K)x + D11 w – p. 71/129

state-feedback H∞ synthesis • characterization is X ≻ 0 and  ∗ (A + B2 K)′ X + ∗  −γI B1′ X C1 + D12 K D11

72



∗ ∗ ≺0 −γI

perform congruence transformation diag(Y = X −1 , I, I) to get Y ≻ 0 and  (A + B2 K)Y + Y (A + B2 K)′ ∗  (C1 + D12 K)Y −γI ′ D11 B1′



∗ ∗  ≺ 0, −γI – p. 72/129

state-feedback H∞ synthesis continued

73

note Y is invertible perform change of variable W = KY to get LMI !: Y ≻ 0 and   AY + Y A′ + B2 W + (B2 W )′ ∗ ∗  C1 Y + D12 W −γI ∗  ≺ 0. ′ −γI D11 B1′

• note change of variable is without loss (NSC) • when solved, deduce (state-feedback) controller using K = W Y −1 .

– p. 73/129

trick

74

• ⇐ (Y, KY ) solution → (Y, W ) easy • ⇒ (Y, W ) solution → (Y, KY ) note that term B2 W is B2 W Y −1 Y hence (Y, K = W Y

−1

) is a solution.

– p. 74/129

state-feedback H2 synthesis

75

• similar derivation • characterization (A + B2 K)′ X + ∗ + (C1 + D12 K)′ (C1 + D12 K) ≺ 0, Tr (B1′ XB1 ) < η 2 become via Schur complements   ′ (A + B2 K) X + ∗ ∗ ≺0 (C1 + D12 K) −I   ′ Z B1 2  0, Tr Z < η B1 X −1 – p. 75/129

state-feedback H2 synthesis

76

• perform congruence transformations diag(Y = X −1 , I) and diag(I, Y ) to get   AY + B2 KY + ∗ ∗ ≺0 C1 Y + D12 KY −I   Z B1′ Y  0, Tr Z < η 2 Y B1 Y

– p. 76/129

state-feedback H2 synthesis

77

• change of variable W = KY yields LMIs !   AY + B2 W + ∗ ∗ ≺ 0, C1 Y + D12 W −I with



Z Y B1

B1′ Y Y



 0, Tr Z < η 2

– p. 77/129

state-feedback pole clustering

78

• similarly Y ≻ 0 and U ⊗ Y + V ⊗ (A + B2 K)Y + V ′ ⊗ Y (A + B2 K)′ ≺ 0 . change of variable W = KY leads to LMI!: Y ≻0 U ⊗ Y + V ⊗ (AY + B2 W ) + V ′ ⊗ (AY + B2 W )′ ≺ 0 .

– p. 78/129

multiple constraints

79

• the Y ’s are not the same for all perfs. • hard problem is relaxed by taking a single Y for all perfs. • technique is constantly refined to exploit different Y ’s by spec. (active area).

– p. 79/129

output feedback case - closed-loop data

80

• 2 4

A

B1

C1

D11

3

2

6 5 := 6 4

A

0

B1

0

0

0

C1

0

D11

3 2 7 6 7 +6 5 4

0

B2

I

0

0

D12

3

»

7 AK 7 5 CK

BK DK



2 4

0

I

0

C2

0

D21

3

5,

• Above analysis condition must be satisfied in closed-loop. Synthesis conditions in 3 steps 1- introduce a single variable P common specification/channel (conservative step), 2- perform adequate congruence transformations, 3- use linearizing changes of variables to end up with LMI synthesis conditions. – p. 80/129

linearizing change of variable

81

Introduce notation 2

P=4

X N′

N ⋆

3

5,

P

−1

2

=4

Y

M

M′



3 5

From PP −1 = I infer 2

PΠY = ΠX with ΠY := 4

Y M′

I 0

3

2

5 , ΠX := 4

I

X

0

N′

3

5.

Define change of variable (wlog N , M are invertible) 8 bK < A

:=

N AK M ′ + N BK C2 Y + XB2 CK M ′ + X(A + B2 DK C2 )Y,

:=

b K := CK M ′ + DK C2 Y, D b K := DK . N BK + XB2 DK , C

(1)

: b BK

and, perform congruence transformations to get

bK , B bK, C bK, D bK ! linear terms in the new variables X, Y, A – p. 81/129

LMI for H∞ specification 2

L11 6 6 A b K C2 )′ 6 bK + (A + B2 D 6 6 b K D21 )′ (B1 + B2 D 4 bK C1 Y + D12 C

where

b′ + (A + B2 D b K C2 ) A K L22 b K D21 )′ (XB1 + B b K C2 C1 + D12 D

82









−γI



b K D21 D11 + D12 D

−γI

3

7 7 7 7≺0 7 5

b K + (B2 C b K )′ , L22 := A′ X+ XA + B b K C2 + (B b K C2 )′ . L11 := AY + YA′ + B2 C

• similarly for H2 and LMI region specs. • for multi- channel/objective just stack together various LMI specs.

– p. 82/129

LMI for H2 specification 2 6 6 4

b K + (B2 C b K )′ + B2 C ∗ b K C2 )′ b K C2 + (B b K C2 )′ + (A + B2 D A′ X+ XA + B bK b K C2 C1 Y + D12 C C1 + D12 D

83



YA′

AY + bK A 2 6 6 4

I

X b K D21 )′ (XB1 + B b K D21 )′ (B1 + B2 D b K D21 = 0. Tr (Q) < ν, D11 + D12 D

"

Y I I X

3

−I

b K D21 B1 + B2 D 7 b XB1 + BK D21 7 5 ≻ 0, Q

I

Y



#

3

7 7 ≺ 0, 5

≻0

– p. 83/129

LMI region constraint specification

84

• congruence diag(ΠY , . . . , ΠY ) yields h i i   h b b AY +B2 CK A+B2 DK C2 Y I + ∗ ≺ 0. λjk I X + µjk bK b K C2 A XA+ B "

Y I I X

#

≻0

– p. 84/129

multiple constraints

85

• again for multiple constraints take the same X, Y bK B b K , . . . for all LMIs. and A

• controller construction: just reverse the change of variables

– p. 85/129

pure H∞ synthesis: projected characterization86 For a single objective, LMI can be simplified, Projection Lemma yields 2 4 2 4

NY

0

0

I

NX

0

0

I

3′

2

6 5 6 4 3′

2

6 5 6 4

YC1′

B1

C1 Y

−γI

D11

B1′

′ D11

−γI

XB1

C1′ ′ D11

AY +

A′ X+

YA′

XA

B1′ X

−γI

C1

D11

−γI

3

2

7 N 74 Y 5 0 3

2

0

Y

I

I

X

4 NY and NX null spaces of

B2′

′ D12

i

and

h

C2

I

7 N 74 X 5 0 2

h

0

D21

I

3



0

3 5



0

3 5



0.

5

i ,

– p. 86/129

avantages of LMI formulations ➠

very general wrt DGKF, no assumptions required



singular problems



admits similar discrete-time counterpart



has educational value for students (shorter proofs)



See http://www.cert.fr/dcsd/cdin/apkarian/ for details



See M ATLAB LMI Control Toolbox for codes.

87

– p. 87/129

uncertain systems analysis

88

• Lyapunov technique • Time-invariant and time-varying parameters • Parameter-dependent Lyapunov functions.

– p. 88/129

analysis of uncertain systems - example

89

Consider the uncertain system d x(t) = A(δ) x(t); dt

x(0) = x0



δ = [δ1 , . . . , δL ]′ ∈ RL uncertain and possibly time-varying real parameters



A(δ) = A0 + δ1 A1 + . . . + δL AL

δ(t)

˙ δ(t)

is the system stable for all admissible δ(t) ? – p. 89/129

Affine Quadratic Stability (AQS)

90

The system is Affinely Quadratically Stable, if ∃ V (x, δ) := x′ P (δ)x,

P (δ) = P0 +δ1 P1 +. . .+δL PL

s. t. V (x, δ) > 0, dV /dt < 0 along all admissible parameter trajectories. • Lyapunov theory ⇒ (exponential) stability. P (δ) := P0 + δ1 P1 + . . . + δL PL L(δ, d δ) := A(δ)′ P (δ) + P (δ)A(δ) + dt

> 0 dP (δ) dt

< 0

• turned into LMIs ⇒ multi-convexity, S-procedure ,... ! – p. 90/129

analysis - continued • cases Time-Invariant Parameters Arbitrary rate of variation (quad. stab.)

• extensions H∞ , H2 , LMI regions,...

91

• components LFT uncertainties nonlinear components (IQC theory, (Rantzer & Megretsky) µ analysis

– p. 91/129

gain-scheduling and LPV control

92

• motivations and concepts • classes of LPV system • synthesis conditions for LFT systems

– p. 92/129

motivations #1

93



handle full operating range



gain-scheduled controllers exploit knowledge on the plant’s dynamics in real time knowledge on plant

measurement signal

controller dynamics

control signal

controller mechanism is changed during operation

– p. 93/129

motivations #2

94

Gain-Scheduling techniques are applicable to • Linear Parameter-Varying Systems (LPV): d x = A(θ)x + B(θ)u , dt y = C(θ)x + D(θ)u. where θ := θ(t) is an exogenous variable. • “ Quasi-Linear” Systems: d x = A(y )x + B(y )u , sche sche dt y = C(ysche )x + D(ysche )u. where ysche is a sub-vector of the plant’s output y. – p. 94/129

motivations #3 ➠

to get higher performance



some LPV system are not stabilizable via a fixed LTI controller



bypass critical phases of pointwise interpolation and switching



engineering insight is preserved (freeze scheduled variable for analysis).



nonlinear models can be handled by immersion into an LPV plant.

95

– p. 95/129

LPV systems in practice ➠ Aeronautics (longitudinal motion of aircraft)           0 α˙ −Zα 0 α az −Zα V + = δ, = q˙ −mα 0 q mδ q 0

96

  α 0 , q 1

where Zα , mα and mδ are functions of speed, altitude and angle of attack. ➠ Robotics (flexible two-link manipulator) M (θ2 )¨ q (t) + Dq(t) ˙ + Kq(t) = F u(t), where θ2 is the scheduled variable (conf. of 2nd beam). ➠ and many others

– p. 96/129

example: different control principles -d 6

-d 6

-d 6

- K

- P (θ)

97

y

-

Robust control

?

θ

- K(θ)

- P (θ)

y

-

LPV control ?

- K(y)

y - P (y)

y

-

Output gain-scheduling

– p. 97/129

description of LPV systems

98

• LPV systems x˙ = A(θ)x + B(θ)u , y = C(θ)x + D(θ)u. are characterized by





A() B() ➠ the functional dependence of on θ, C() D() ➠

the operating domain Θ of the system trajectories, θ(t) ∈ Θ,



the rate of variations of θ(t) (if available) in the form of bounds θ˙i (t) ∈ [θi ; θ¯i ]. – p. 98/129

LPV / LTV / LTI systems LPV

99

θ(t) ∈ Θ ∀t ≥ 0

select trajectory

freeze parameter

θ = θ0

θ(t) = θ∗ (t)

LTV

LTI freeze the time

t = t0

– p. 99/129

LPV / LTV / LTI - off-line vs. on-line

100



LTI and LTV systems are off-line systems, the state-space data A, B,... and A(t), B(t),... must be known in advance.



LPV systems are on-line systems since the dynamics depend on the trajectory θ(t) experienced by the plant in Θ. Θ

θ(t) – p. 100/129

LPV systems interpretations x˙ = A(θ)x + B(θ)u, y = C(θ)x + D(θ)u.

101

θ(t) ∈ Θ

• θ may be subject to various assumptions: ➠

θ(t) is uncertain → robust control problem,

θ(t) is known in real-time → Gain-scheduling problem,   θ1 (t) ➠ θ(t) := , where θ1 is known and θ2 is θ2 (t) uncertain → mixed problem ➠

– p. 101/129

LPV pathologies - LPV/LTI stabilities

102

• stability over a domain ➠

LTI Stability : Reλi (A(θ)) < 0, ∀θ ∈ Θ,



LPV Stability : Φθ (t) → 0, for t → ∞, for all trajectory θ(t) in Θ.

• intuitive conjectures like ➠

LTI stability ⇒ LPV stability,



LPV stability ⇒ LTI stability,

are FALSE !

– p. 102/129

LPV vs. LTI Stability • Conjecture #1    −1 + aθ12 x˙ 1 = x˙ 2 −1 + aθ1 θ2

1 + aθ1 θ2 −1 + aθ22

103



x1 x2



,

with trajectories θ1 := cos(t) and θ2 (t) := sin(t) is LTI stable (for a < 2) but LPV unstable. • Conjecture #2      2 −1 − 5θ1 θ2 1 − 5θ1 x˙ 1 x1 = , 2 x˙ 2 x2 −1 + 5θ2 −1 + 5θ1 θ2 with trajectories θ1 := cos(t) and θ2 (t) := sin(t) is LTI unstable (poles +1 and −3) but LPV stable. – p. 103/129

creating an LPV stability

104

consider the autonomous LPV system: x¨ + ω 2 (t)x = 0 , where we are allowed to switch between two values ω1 and ω2 . x2

system trajectories

switch

ω1 ω2

x1

unstable behavior

– p. 104/129

Slowly Varying LPV Systems x˙ = A(θ)x

105

• Sufficient stability cond. • Sufficient instability cond. (1) Reλi (A(θ)) < 0, (1) Reλi (A(θ)) < 0, ˙ < α, with α sufficiently i = 1, . . . , k (2) kθk small, (2) Reλi (A(θ)) > 0, ⇒ LPV stability (Rosen. 63) i = k + 1, . . . , n (3) stable and unstable eigenvalues do not mix ˙ < α, with α sufficiently (4) kθk small, ⇒ LPV instability (Skoog 72)

– p. 105/129

slowly varying parameters

106

LPV stability can be inferred from LTI stability for slowly varying parameters (but not constructive conditions).

– p. 106/129

LPV systems in the LFT class

107

diag(θi Iri )



 A Bθ B  Cθ Dθθ Dθ•  C D•θ D

y

»

A(θ) C(θ)

B(θ) D(θ)



:=

»

A C

B D



»

Bθ + D•θ



u

Θ(I − Dθθ Θ)−1 [ Cθ

Dθ• ] , – p. 107/129

LPV systems in the polytopic class θ ∈ Θ , Θ Polytope

»

A(θ) C(θ)

B(θ) D(θ)



∈ Cov

»

Ai Ci

Bi Di



108

, i = 1, 2, . . . , r



θ(t)

»

y

A1 B1 C1 D1

»



A2 B2 C2 D»2

– Ai Bi Ci Di

u –

– p. 108/129

general LPV systems

109

A(θ), B(θ), C(θ), D(θ) are arbitrary but continuous matrix-valued function of θ. • far more difficult to handle but of great practical interest since they capture arbitrary nonlinearities

– p. 109/129

formulation of synthesis problem: LFT

110

Θ

w

z

P (s) y

u

K(s)

Θ

gain−scheduled controller – p. 110/129

formulation of synthesis problem: LFT

111

find LPV controller Fl (K(s), Θ(t)) s.t. ➠

closed-loop stability,



the L2 -induced norm of the operator Tw→z satisfies kTw→z (Θ)k < γ

for all admissible trajectory θ(t).

– p. 111/129

LPV-LFT systems: notations

112

• 

Dθθ P (s) =  D1θ D2θ

Dθ1 D11 D21

 



Dθ2 Cθ D12 + C1  (sI−A)−1 [ Bθ D22 C2

B1

B2 ] ,

Assumptions: (A, B2 , C2 ) stabilizable  and  detectable,  D22 = 0. Cθ b Dθθ Dθ1 b b Notations: B1 = [ Bθ B1 ], C1 = , D11 = , C1 D1θ D11 T T 0 ], D12 NY := Ker [ B2T Dθ2 NX := Ker [ C2 D2θ D21 0 ].

– p. 112/129

LPV-LFT systems - proof scheme synthesis structure

113

synthesis structure with parameter augmentation

Θ(t) zθ



z

w

P(s)

y

w ˜θ

Θ(t)

0

0

Θ(t)



u z

z˜θ

wθ w

P(s)

K(s)

Pa (s) z˜θ

w ˜θ Θ(t)

y



u K(s)

u ˜

– p. 113/129

LPV-LFT systems - proof scheme

114



redraw the control configuration into a robust control problem with repeated uncertainty,



formulate the Bounded Real Lemma with scalings for the closed-loop system,



apply the Projection Lemma to derive the LMI characterization.

– p. 114/129

LMI characterization

NYT

2 AY + Y AT 6 Cθ Y + Γ3 BθT 6 6 C1 Y 6 4 Σ BT B1T

⋆ T −D Γ −Σ3 + Γ3 Dθθ θθ 3 −D1θ Γ3 T Σ3 Dθθ T Dθ1

⋆ ⋆ −γI T Σ3 D1θ T D11

⋆ ⋆ ⋆ −Σ3 0

⋆ 3 ⋆ 7 7 ⋆ 7 7 NY ≺ 0, ⋆ 5 −γI

2 AT X + XA 6 BθT X + T3 Cθ 6 6 B1T X 6 4 S3 Cθ C1

⋆ T T −S3 + T3 Dθθ − Dθθ 3 T T −Dθ1 3 S3 Dθθ D1θ

⋆ ⋆ −γI T S3 Dθ1 D11

⋆ ⋆ ⋆ −S3 0

⋆ 3 ⋆ 7 7 ⋆ 7 7 NX ≺ 0, ⋆ 5 −γI

3

T NX

115

θ

»

Y I

I X



0

S3 ≻ 0, Σ3 > 0; T3 , Γ3 skew − symmetric .

– p. 115/129

scaling sets asso. with structure Θ ⊕ Θ

116

• symmetric SΘ := {S : S > 0, SΘ = ΘS} • symmetric augmented   S1 S2 SΘ⊕Θ = { T : S1 , S2 ∈ SΘ and S2 Θ = ΘS2 , ∀Θ ∈ Θ}. S2 S3 • skew-symmetric   T1 T2 TΘ⊕Θ = { : T1 , T2 ∈ TΘ and T2 Θ = ΘT2 , ∀Θ ∈ Θ}. T −T2 T3

– p. 116/129

robust synthesis condition 

ATcℓ Xcℓ + Xcℓ Acℓ  B T Xcℓ + T Ccℓ cℓ Ccℓ

T T Xcℓ Bcℓ + Ccℓ T T T −S + T Dcℓ + Dcℓ T Dcℓ

where

117



T Ccℓ T ≺ 0 Dcℓ −S −1



Acℓ, Bcℓ , . . . closed-loop data



S, T scalings for Θ ⊗ Θ ⊗ ∆, and ∆ fictitious performance block.

– p. 117/129

cast in Projection Lemma form

118

Can be rewritten Ψ + QTX ΩP + P T ΩT QX ≺ 0, where 

T

A Xcℓ + Xcℓ A Ψ =  B1T Xcℓ + T C1 C1

P = [ C2

D21

0],

C1T T T



C1T T  D11 −1

Xcℓ B1 + , −S + T D11 + D11 T T D11 −S   T T T T QX = B2 Xcℓ D12 T D12 .

– p. 118/129

cast in Projection Lemma form continued 2

2

A 4 C1 C2

B1 D11 D21

6 6 6 6 3 6 6 B2 6 6 5 D12 = 6 6 6 ΩT 6 6 6 6 4

A

0

0



B1

0

B2

0

0

0

0

0

0

I

0

0

0

0

0

0

0

0

0

I



0

0

Dθθ

Dθ1

0

Dθ2

0

C1

0

0

D1θ

D11

0

D12

0

0

I

0

0

0

AT K

T CK1

T CKθ

C2

0

0

D2θ

D21

T BK1

T DK11

T DKθ1

0

0

I

0

0

T BKθ

T DK1θ

T DKθθ

and b1 = [ Bθ B

B1 ] ,

b1 = C

»

– Cθ , C1

b 11 = D

»

Dθθ D1θ

Dθ1 D11



119

3 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 5

.

• LMI characterization follows from explicit computation of projections and using matrix completion Lemmas.

– p. 119/129

LPV-LFT systems - controller Construction120 ➠

Testing solvability falls within the scope of convex semi-definite programming



A gain-scheduled controller is easily constructed from the quadruple (Y, X, L3 , J3 ) by solving a scaled Bounded Real Lemma LMI condition. u

y 3 AK BK1 BKθ 4 CK1 DK11 DK1θ 5 CKθ DKθ1 DKθθ 2

Θ(t) – p. 120/129

other variants of this technique ➠

polytopic LPV systems



general LPV systems (capture slow variations of parameters)



LFT systems ang generalized scalings



multi-objective/channel LPV synthesis

121

see webpage: http://www.cert.fr/dcsd/cdin/apkarian/

– p. 121/129

hard non-LMI problems

122

• most analysis problems reduce to LMIs • some synthesis problems reduce to LMIs but • many practical problems do not reduce to LMI/SDP (synthesis) ➠

reduced- and fixed-order synthesis (PID H∞ , etc.)



structured and decentralized synthesis problems



general robust control with uncertain and/or nonlinear components



simultaneous model/controller design, multimodel control



unrelaxed LTI and LPV multi-objective



combinations of the above – p. 122/129

new algorithms for hard problems

123

new algorithms needed ! good research direction

– p. 123/129

example: synthesis of static controller stabilize x˙ = Ax + Bu y = Cx with u = Ky (K static )

124

has characterization NC′ (A′ X + XA)NC < 0 ′ ′ ′ (Y A + AY )N < 0 N ′ B B   X I > 0 I Y XY − I = 0

constraints XY − I = 0 leads to hard problems LMI + nonlinear equality constraints

– p. 124/129

augmented Lagrangian method

125

with g(x) = 0 equ. constraints and A(x) ≺ 0 LMI, replace the difficult program by the more convenient (Pλ,µ )

minimize c′ x + λ′ g(x) + µ1 kg(x)k2 subject to A(x)  0



µ is penalty, xµ → x∗ when µ → 0



for good estimates λ (Lagrange multiplier), solution of (Pλ,µ ) is close to solution of original problem



use first-order update rule to improve estimate λ



solve (Pλ,µ ) by a succession of SDPs – p. 125/129

augmented Lagrangian method



B. Fares and P. Apkarian and D. Noll, IJC, 2001



B. Fares and D. Noll and P. Apkarian , SIAM Cont. Optim. 2002



P. Apkarian and D. Noll and H. D. Tuan, 2002, IJRNC to appear.



D. Noll and M. Torki and P. Apkarian, working paper, 2002

126

– p. 126/129

conclusions, perspectives

127



A single framework for a great variety of methods



LMI techniques extend the scope of classical techniques



LPV control is a very successful example (industrial)



Analysis meth. immediately applicable for validation



Have educational merits see http://www.cert.fr/dcsd/cdin/apkarian/ for course plan



not discussed: robust filtering and estimation, combinatorial optimization, graphs, etc. – p. 127/129

recent concrete control applications ➠

Analysis robustness evaluation of controllers for: ➟ ARIANE Launcher ➟ satellites ➟ long flexible civil aircraft (structural modes)



Synthesis Preliminary tests show that LPV controllers are competitive for launcher control in atmospheric flight



Synthesis control of the landing phase for civil aircraft under study with multiobjective LMI methods



Synthesis Missiles ? still on paper

128

– p. 128/129

The End

129

GRAZIE MILLE !

– p. 129/129