1
L INEAR M ATRIX I NEQUALITIES AND S EMI D EFINITE P ROGRAMMING I MPACTS ON C ONTROL S YSTEM D ESIGN P. Apkarian
Université Paul Sabatier & ONERA-CERT Mathématiques pour l’Industrie et la Physique CNRS UMR 5640
invitation by Professor Giuseppe Franze - Calabria University
– p. 1/129
Seminar Outline ♦ ♦ ♦ ♦ ♦ ♦ ♦ ♦
2
Linear Matrix Inequalities and SDP Tricks to reformulate into LMIs System concepts via LMIs Multi-channel/objective with LMIs Uncertain systems analysis Gain-scheduling and LPV synthesis Hard non-LMI problems Conclusions, perspectives.
– p. 2/129
Linear Matrix Inequalities and SDP
• • • • • •
3
Definitions, manipulations Schur’s complements Classes of convex optimization problems Semi- Definite Programming Algorithms to solve SDP, duality, complexity Software, links.
– p. 3/129
linear matrix inequalities
4
an LMI is a constraint on a vector x ∈ Rn : F (x) := F0 + x1 F1 + . . . + xn Fn 0, where F0 , F1 , . . . , Fn are symmetric matrices ➠
is inequality on symmetric matrix cone
➠
LMI equivalent to λmin (F (x)) ≥ 0
➠
F (x) 0 iff η ′ F (x)η ≥ 0, ∀η
➠
F (x) 0 iff det {ppal mat.} ≥ 0
➠
F (x) ≻ 0 iff η ′ F (x)η > 0, ∀η 6= 0
– p. 4/129
geometry of LMIs ➠
5
an LMI de fine a convex set F (λx+(1−λ)y) = λF (x)+(1−λ)F (y) 0 whenever F (x) 0, F (y) 0
➠
➠
set with non necessarily smooth boundary (corners)
LMI plane and curved faces
describe wide variety of constraints
– p. 5/129
LMI - diagonal augmentation
6
LMI constraints F1 (x) 0, . . . , Fq (x) 0 are equivalent to single LMI constraint
F1 (x) 0 .. .
0 ... 0
... 0 0 Fq (x)
LMI 2
LMI 1
LMI 3
– p. 6/129
linear constraints
7
finite set of scalar linear (affine) constraints a′i x ≤ bi , i = 1, . . . , m can be represented as LMI F (x) 0, with F (x) = diag(a′1 x − b1 , . . . , a′m x − bm )
polyhedral LMI
– p. 7/129
Schur complements
8
partitioned symmetric matrix P1 P2 P := P2′ P3 S = P3 − P2′ P1−1 P2 is the Schur complement of P1 in P (provided P1 invertible) Schur complement lemmas ➠
P ≻ 0 if and only if P1 ≻ 0 and S ≻ 0
➠
if P1 ≻ 0, then P 0 if and only if S 0
– p. 8/129
Schur complement consequence
9
complicate constraint in variable x P3 (x) − P2 (x)′ P1 (x)−1 P2 (x) ≻ 0 is turned into simpler one P1 (x) P2 (x) ≻ 0. ′ P2 (x) P3 (x) provided that P1 (x) ≻ 0.
– p. 9/129
ellipsoidal constraints
10
an ellipsoid can be described in different ways • as kAx + bk ≤ 1, iff I Ax + b 0 ′ (Ax + b) 1 • as (x−x0 )′ W (x−x0 ) ≤ 1, with W > 0 iff 1 (x − x0 )′ 0 −1 (x − x0 ) W
LMI
– p. 10/129
fractional constraints
11
consider fractional constraints (c′ x)2 d′ x
≤ t
Ax + b ≥ 0 (assume d′ x > 0, whenever Ax + b ≥ 0) can be represented as ′ t cx 0 ′ ′ cx dx Ax + b ≥ 0 – p. 11/129
convex quadratic constraints
12
Convex quadratic constraints can be rewritten (Ax + b)′ (Ax + b) − c′ x − d ≤ 0 has the LMI representation I Ax + b 0 ′ ′ (Ax + b) c x + d • can be used to show that convex quadratic programming can be solved via SDP
– p. 12/129
classes of convex optimization problems
13
• linear prog. (LP) minimize c′ x, Ax b (componentwise) • convex quadratic prog. (CQP) Qj 0 minimize x′ Q0 x + b′0 x + c0 s.t. x′ Qi x + b′i x + ci ≤ 0 All (and others) are generalized by SDP !:
– p. 13/129
LMIs in control with P variable • Lyapunov inequality A′ P + P A ≺ 0 can be represented in canonical form F0 +
n X i=1
xi Fi ≺ 0
14
pick a basis (Pi )i of the symmetric matrices, X P = xi Pi i
hence recover the canonical form with F0 = 0,
Fi = A′ Pi +Pi A
– p. 14/129
symmetric matrix expressions are LMIs
15
• Any (symmetric) linear constraints in the variables X, Y AY B + (AY B)′ + X + . . . 0 can be represented in the canonical form F (x) = F0 + x1 F1 + . . . + xn Fn 0 by appropriate selection of the Fi ’s.
– p. 15/129
Riccati and quadratic matrix inequality
16
quadratic matrix inequality in P A′ P + P A + P BR−1 B ′ P + Q 0 where R > 0, is equivalent to LMI ′ A P + PA + Q PB 0 ′ BP −R (proof by Schur complements) Riccati-based control method can be solved via LMIs
– p. 16/129
classes of semidefinite programs
17
• I feasibility problem: find x : F0 + x1 F1 + . . . + xn Fn 0 • II linear objective minimization subject to LMIs minimize c′ x, s.t. F0 + x1 F1 + . . . + xn Fn 0 • III generalized eigenvalue minimization minimize subject to
λ A(x) − λB(x) 0, B(x) 0, C(x) 0
(A, B, C affine symmetric expressions in x) – p. 17/129
solving LMI - a rich set of algorithms
18
much work and progress since 1990 ! ➠
primal interior-point method (method of centers)
➠
primal-dual interior-point method
➠
non-differentiable methods (bundle, ...)
Primal-dual methods very efficient. other fast algorithms under development (aug. Lagrangian)
– p. 18/129
central property
19
because of structure and convexity algorithms are guaranteed to find global solutions !
– p. 19/129
primal-dual IPMs
20
ideas: ➠
instead of working in primal space, formulate problem in “primal-dual” space
➠
target objective is duality gap, and is zero at optimum
➠
try to solve (Lagrange) optimality conditions
– p. 20/129
SDP duality • primal
21
• dual
min c′ x s.t. F (x) 0
max − Tr (F0 Z) s. t.Z ≻ 0, Tr Fi Z = ci • optimality cond. if (x, Z) is primal-dual feasible c′ x =
n X i=1
≥0
z }| { xi Tr ZFi =Tr ZF (x) −Tr ZF0 ≥ −Tr ZF0
hence global optimality pairs (x, Z) such that Tr ZF (x) = 0 since primal and dual obj. coincide at solution
– p. 21/129
mechanisms of primal-dual algorithms solve subject to
22
Tr ZF (x) = 0 Pn F0 + i=1 xi Fi 0
Z ≻ 0, Tr Fi Z = ci , i = 1, . . . , n • Actually, one tries to solve Tr ZF (x) = µI for decreasing value of µ (µ −→ 0) • Newton steps for the linearization of Tr ZF (x). • superlinear convergence can be guaranteed kxk+1 − xopt k ≤ kxk − xopt kq , q > 1 very efficient in practice ! – p. 22/129
SDP software ➠
MATLAB LMI toolbox by Gahinet, Chilali, Laub, Nemirovski
➠
DSDP by Benson, Ye
➠
SDPpack by Alizadeh, Haeberly, Nayakkankuppam, Overton
➠
SeDuMi by Sturm
➠
Imitool-2.0 by Boyd et al.
➠
Cutting plane methods by Helmberg, Oustry, Kiwiel, etc.
➠
Many others ...
23
– p. 23/129
ftp addresses for SDP
24
• ftp addresses, codes, papers, courses on SDP
http://orion.math.uwaterloo.ca:80/ hwolkowi/henry/software/readme.html# http://www.zib.de/helmberg/semidef.html http://rutcor.rutgers.edu/ alizadeh/sdp.html
– p. 24/129
tricks to turn hard problems into LMIs • • • • • • •
25
Schur’s complements (see previous) LMIs and quadratic forms multi-convexity, monotonicity, etc. Finsler’s lemmas Projection lemmas changes of variables augmentation by slack
– p. 25/129
S-Procedure and quadratic inequalities ➠
26
S-Procedure transforms quadratic problems into LMIs(possibly conservative)
given Qi ’s symmetric or hermitian matrices, define F0 (x) = x′ Q0 x, F1 (x) = x′ Q1 x, . . . , FL (x) = x′ QL x, F0 (x) < 0 over the set F1 (x) ≤ 0, . . . , FL (x) ≤ 0 whenever ∃s1 ≥ 0, . . . , sL ≥ 0 (slacks), such that F0 (x) −
L X i=1
si Fi (x) < 0 or LMI Q0 −
L X
si Qi 0
i=1
– p. 26/129
Finsler’s Lemma
27
• converts checking the sign of a quadratic form over a subspace into solving an LMI problem x′ Qx < 0, ∀x 6= 0, M x = 0 if and only there exists a scalar σ such that Q − σM ′ M ≺ 0 M x = 0 can also be formulated as x′ M ′ M x = 0 • proof via convexity of numerical ranges
– p. 27/129
generalized Finsler’s Lemma
28
• convert family of constrained quadratic inequalities into an LMI feasibility problem Q = Q′ and M given, and a compact subset of real matrices U we have the equivalence • for all U ∈ U, x′ Qx < 0, ∀x 6= 0 with U M x = 0, iff there exists Θ s.t. Q + M ′ ΘM ≺ 0 ∀U ∈ U NU′ ΘNU 0, where NU is basis of nullspace of U – p. 28/129
multi-convexity
29
given a function f (δ1 , . . . , δK ) • it is multi-convex function if separately convex along each direction δi • multi-convexity is weaker than convexity • convexity iff • multi-convexity iff ∂2 f (δ)]1≤i,j≤K 0 [ ∂δi δj
∂2 f (δ) ≥ 0, 2 ∂δi
i = 1, . . . , K
Turn parameter-dependent LMIs into finite set of LMIs.
– p. 29/129
projection Lemma - two-Sided
30
given Ψ = Ψ′ ∈ Rm×m , P, Q of column dim. m find X such that Ψ + P ′ X ′ Q + Q′ XP ≺ 0 let columns of NP , NQ form bases of the null spaces of P and Q inequality is solvable for X if and only if NP′ Ψ NP ≺ 0 NQ′ Ψ NQ ≺ 0 (Gahinet & Apkarian 1993)
– p. 30/129
System concepts via LMIs
• • • • •
31
Stability L2 gain or H∞ norm H2 norm Pole clustering ...
– p. 31/129
stability & equilibria
32
• Equilibrium points x˙ = f (x) are defined as the solutions x∗ of 0 = f (x∗ ). system has trajectory x(t) = x∗ , ∀t ≥ 0 if initialized at x∗ From now on, we assume x∗ = 0.
– p. 32/129
types of stability
33
• stability (simple) ∀R > 0, ∃r > 0, kx(0)k < r ⇒ ∀t ≥ 0, kx(t)k < R • asymptotic stability if it is stable and ∃r > 0, kx(0)k < r ⇒ x(t) → 0, as t → ∞ • exponentially stable if ∃ α > 0 and λ > 0 s. t. ∀t > 0, kx(t)k ≤ αkx(0)ke−λt in some ball. λ rate of conv. – p. 33/129
positive-definite functions of the state
34
Assume D is open region containing x∗ = 0. • A function V (x) from Rn into R is positive semi-definite on a domain D if (1) V (0) = 0 (2) V (x) ≥ 0, ∀x ∈ D • A function V (x) from Rn into R is positive definite on a domain D if (1) V (0) = 0 (2) V (x) > 0, ∀x ∈ D, x 6= 0
– p. 34/129
postive-definite functions: level curves
35
x2 V (x) = c2 V (x) = c1 O V (x) = c3
x1
c3 < c2 < c1
Typical level curves of positive-definite functions
– p. 35/129
Lyapunov derivatives
36
• if x is state of system x˙ = f (x), then V (x) is implicitly a function of time. Its time derivative is ′
′
∂V dV (x) ∂V ˙ = x˙ = f (x) V (x) = dt ∂x ∂x since x is constrained to satisfy x˙ = f (x). • it is referred to as derivative of V along the system trajectories (also Lyapunov’s derivative).
– p. 36/129
Lyapunov function: definition
37
• V (x) is a Lyapunov function of the system x˙ = f (x) if ➠
it is C 1 with respect to x on D
➠
it is positive definite (see earlier) on D
➠
its derivative on the system trajectories is negative semi-definite, that is, V˙ (x) ≤ 0, on D
as a function of x.
– p. 37/129
Lyapunov theorem for local stability
38
• if in a ball around the origin (= x∗ ), there exists V (x) in C 1 such that V (x) is positive definite ➠ V˙ (x) is negative semi-definite ➠
then the equilibrium point x∗ = 0 is (loc.) stable. It is asymptotically stable if V (x) is negative definite, i. e., V˙ (x) < 0, ∀x 6= 0, x ∈ ball as a function of x. • global stability if ball = Rn .
– p. 38/129
Lyapunov geometry
39
∂V (x) ∂x
x(t) ˙ x(t) 0
V˙ (x) < 0
angle between derivatives is greater than 90 deg.
– p. 39/129
stability for linear systems
40
the system d x = Ax dt is exponentially stable if and only if there exists X with X ≻ 0, A′ X + XA ≺ 0 why ? ′
V (x) = x Xx is a quadratic Lyapunov function
– p. 40/129
proof of exponential stability
41
perturb Lyapunov LMI to A′ X + XA + εX ≺ 0
for any state trajectory x(t), we infer x(t)′ (A′ X + XA)x(t) + εx(t)′ Xx(t) ≤ 0 and thus d ′ ′ x(t) Xx(t) + εx(t) Xx(t) ≤ 0 dt – p. 41/129
trick !
42
• note that solution of d x(t)′ Xx(t) + εx(t)′ Xx(t) = z(t) with z(t) ≤ 0 dt is V (x(t)) = x(0)′ Xx(0)e−εt +
Z
t
e−ε(t−τ ) z(τ )dτ
0
hence x(t)′ Xx(t) ≤ x(0)′ Xx(0)e−εt , ∀t ≥ 0 .
– p. 42/129
proof of exponential stability - continued
43
we have, with initial condition x(0) yields ′
′
x(t) Xx(t) ≤ x(0) Xx(0)e
−εt
finally, using ′
2
λmin (X)kxk ≤ x Xx ≤ λmax (X)kxk
2
gives kx(t)k ≤ kx(0)k
s
λmax (X) −εt/2 e for t ≥ 0 λmin (X)
system is exponentially stable ! – p. 43/129
necessity of Lyapunov inequalities
44
Assume A is stable (Re λi (A) < 0) and consider for Q ≻ 0, the (well-defined) integral Z ∞ d A′ t −Q = (e QeAt )dt dt 0 =
Z
∞ ′ A′ t
(A e
Qe
At
+e
A′ t
QeAt A)dt
0
′
= A P + P A with P :=
Z
∞ A′ t
e QeAt dt ≻ 0 0
– p. 44/129
necessity continued
45
finally, we have A′ P + P A = −Q ≺ 0,
P ≻ 0.
LMI problem has a solution whenever A is stable. • condition is iff • for linear systems quadratic Lyapunov functions are rich enough
– p. 45/129
energy gain or L2 gain
46
Energy gain not larger than γ: with w ∈ L2 and x(0) = 0, every trajectory of d x = Ax + Bw dt z = Cx + Dw should satisfy kzk2 ≤ γkwk2 , or
Z
∞ 0
z(t)′ z(t) dt ≤ γ 2
∀w ∈ L2 Z
∞
w(t)′ w(t) dt
0
– p. 46/129
H∞ norm
47
• stable and the L2 gain w −→ z is smaller than γ if and only if there exists X ≻ 0 " ′ ′ # A X + XA XB C B ′X −γI D′ ≺ 0 C D −γI • freq. domain kC(sI − A)−1 + Dk∞ < γ via KYP. • similarly, H2 norm, LQ, LQG, many others ...
– p. 47/129
H∞ norm
48
• necessity call for general LQ theory. • we shall only prove sufficiency.
– p. 48/129
proof of sufficiency
49
• Note first that the (1, 1) block of the LMI implies that A is stable • By Schur complement, LMI is rewritten ′ ′ A X + XA XB −1 C +γ D] ≺ 0 ′ ′ [C BX −γI D x(t) Left- and right-multiply with yields ... w(t)
– p. 49/129
proof of sufficiency cont.
50
d V dt}| { z x′ (A′ X + XA)x + x′ XBw + w′ B ′ Xx −γw′ w + γ −1 z ′ z ≤ 0
– p. 50/129
proof of sufficiency cont.
51
integrate over [0, T ] and exploit x(0) = 0: Z T x(T )′ Xx(T ) + γ −1 kz(t)k2 − γkw(t)k2 dt ≤ 0 0
Recall X ≻ 0 and take T → ∞ (w ∈ L2 ): Z ∞ Z ∞ kz(t)k2 dt ≤ γ 2 kw(t)k2 dt ≤ 0 0
0
Can perturb γ to γ − ε to get strict inequality
– p. 51/129
relation to frequency domain
52
For ω ∈ R, left- and right multiply with (jω − A)−1 B I to get γ −1 T (jω)∗ T (jω) − γI ≺ 0 hence kT (jω)k < γ,
∀ω ∈ R
From the right-lower block, we also get ′ −γI D ≺ 0 or kDk < γ D −γI – p. 52/129
relation to frequency domain
53
finally, kT (jω)k < γ for ω ∈ R ∪ {∞} hence, kT k∞ :=
sup
kT (jω)k < γ.
ω∈R∪{∞}
– p. 53/129
H2 performance
54
• H2 norm of T defined as s Z ∞ 1 Tr kT k2 := T (jω)∗ T (jω) dω 2π −∞ • in the time domain (via Parseval) sZ ∞
Tr (CeAt B)′ (CeAt B) dt
kT k2 :=
0
– p. 54/129
H2 performance computation
55
Easily computed by solving linear equation AP0 + P0 A′ + BB ′ = 0 ⇒ kT k22 = Tr (CP0 C ′ ) A′ Q0 + Q0 A + C ′ C = 0 ⇒ kT k22 = Tr (B ′ Q0 B) Why ? see stability notes. • note that D = 0 for H2 norm to be well defined.
– p. 55/129
stochastic interpretation of H2 norm
56
w white noise, x˙ = Ax + Bw, x(0) = 0, z = Cx. Recall: with solution of P˙ (t) = AP (t) + P (t)A′ + BB ′ ,
P (0) = 0
we have E(x(t)x(t)′ ) = P (t). Hence lim E(z(t)′ z(t)) = lim E(x(t)′ C ′ Cx(t))
t→∞
t→∞
= lim Tr E(Cx(t)x(t)′ C ′ ) t→∞
= Tr (CP0 C ′ ) = kT k22 • asymptotic variance of output of system. – p. 56/129
deterministic interpretation of H2 norm
57
let zj be impulse response to Bej δ(t) with standard unit vector ej of x˙ = Ax, x(0) = x0 , z = Cx Z
∞
′
zj (t) zj (t) dt = 0
P
Z
∞ 0
′ A′ t ′ Bj e C CeAt Bj
dt
v v = Tr (vv ) and j Bj Bj′ = BB ′ implies XZ ∞ kzj (t)k2 dt = kT k22 . ′
′
j
0
– p. 57/129
How to get LMI characterization ?
58
With A stable, it is easy to see that Tr (CP0 C ′ ) < γ 2 for AP0 + P0 A′ + BB ′ = 0 if and only if there exists X with Tr (CXC ′ ) < γ 2 and AX + XA′ + BB ′ ≺ 0 . • for ⇐ take difference of Lyapunov conditions
– p. 58/129
How to get LMI characterization ?
59
• for ⇒ since trace inequality is strict and by continuity there exists ε > 0 and X such that AX + XA′ + BB ′ + εI = 0,
Tr (CXC ′ ) < γ 2 .
Note that AX + XA′ + BB ′ ≺ 0 and Z ∞ At ′ A′ t X= e (BB + εI)e dt ≻ P0 0
Hence, kC(sI−A)−1 Bk2H2 := Tr (CP0 C ′ ) < Tr (CXC ′ ) < γ 2 .
– p. 59/129
LMI characterization for H2 norm
60
♦ A is stable and kT k22 < γ if and only if Y ≻ 0 with Tr (CY C ′ ) < γ,
AY + Y A′ + BB ′ ≺ 0
or if and only if X ≻ 0 with Tr (B ′ XB) < γ,
A′ X + XA + C ′ C ≺ 0
– p. 60/129
regional pole constraints
61
• to shape transient responses of closed-loop system • damping, settling time, rise time related to location of poles • useful regions: vertical strips, disks, conic sectors, etc • An LMI region R is defined as R = {z ∈ C : U + zV + z¯V ′ ≺ 0} .
– p. 61/129
LMI region intersections
62
• a large variety of regions can be represented this way • intersections of LMI regions are LMI regions
– p. 62/129
a short catalog of useful LMI regions LMI Regions
α
β
63
Characterization
fR (z) =
"
1 (z + z −α + 2 ¯)
0
0
(z + z ¯) β− 1 2
fR (z) =
"
i (z − z −α − 2 ¯)
0
0
¯) β + 2i (z − z
fR (z) =
»
−r q + z q+z ¯ −r
fR (z) =
»
sin θ(z + z ¯) cos θ(z − z ¯) cos θ(¯ z − z) sin θ(z + z ¯)
α
#
#
β
r −q
θ
–
–
– p. 63/129
Lyapunov theorem for LMI regions
64
• System d x = Ax has all its poles in LMI region R dt iff there exists X ≻ 0 s. t. U ⊗ X + V ⊗ (A′ X) + V ′ ⊗ (XA) ≺ 0 . is an LMI with respect to X. (⊗ is Kronecker product A ⊗ B := ((Aij B))) ➠
classical Lyapunov theorem with U = 0, V = 1
➠
intersection by diagonal augmentation of U , V .
other specs. can be combined by just merging LMI constraints – p. 64/129
LMI regions- proof of sufficiency
65
condition is X ≻ 0,
U ⊗ X + V ⊗ (A′ X) + V ′ ⊗ (XA) ≺ 0 .
pick an eigenpair of A, (λ, v), Av = λv, and pre- and post-multiply inequality by I ⊗ v ∗ , I ⊗ v, gives >0
Hence,
z }| { (v ∗ Xv) (U + λ∗ V + λV ′ ) < 0 U + λ∗ V + λV ′ < 0.
Implies λ∗ , λ are in R. – p. 65/129
multi-objective/channel controller synthesis
• • • • •
66
formulation linearizing change of variables state-feedback synthesis output-feedback synthesis projected form.
– p. 66/129
controller synthesis
67
• synthesis structure z1 z2
.. .
.. .
P
y
w1 w2
u K
• given P (s), find K(s) to achieve a set of specifications for channels w1 → z1 , w2 → z2 , ...
– p. 67/129
example of multi-channel/objective problem 68 min kTw21 ←z21 k2 1 ←z 1 k∞ < γ1 , kTw∞ ∞
kTw22 ←z22 k2 < γ2
poles in LMI region R .
– p. 68/129
controller synthesis - data
69
• synthesis interconnection d x = Ax + B w + B u, A ∈ Rn×n 1 2 dt P (s) z = C1 x + D11 w + D12 u y = C2 x + D21 w • controller ( d x = A x + B y, K K K K d t K(s) u = C K xK + D K y
AK ∈ Rn×n
• Stability, Perfo.: H∞ , H2 , pole plac. on various channels – p. 69/129
derivation ➠
compute closed-loop data
➠
write stability/performance (ineq.) conditions in closed loop
➠
apply congruence transformations
➠
use suitable linearizing transformations
70
– p. 70/129
controller synthesis - state-feedback
71
• turns out to be very simple problem n×n x ˙ = Ax + B w + B u, A ∈ R 1 2 P (s) z = C1 x + D11 w + D12 u y = x ←− measurable state vector
and
u = Kx ←− state-feedback
closed-loop data are x˙ = (A + B2 K)x + B1 w z = (C1 + D12 K)x + D11 w – p. 71/129
state-feedback H∞ synthesis • characterization is X ≻ 0 and ∗ (A + B2 K)′ X + ∗ −γI B1′ X C1 + D12 K D11
72
∗ ∗ ≺0 −γI
perform congruence transformation diag(Y = X −1 , I, I) to get Y ≻ 0 and (A + B2 K)Y + Y (A + B2 K)′ ∗ (C1 + D12 K)Y −γI ′ D11 B1′
∗ ∗ ≺ 0, −γI – p. 72/129
state-feedback H∞ synthesis continued
73
note Y is invertible perform change of variable W = KY to get LMI !: Y ≻ 0 and AY + Y A′ + B2 W + (B2 W )′ ∗ ∗ C1 Y + D12 W −γI ∗ ≺ 0. ′ −γI D11 B1′
• note change of variable is without loss (NSC) • when solved, deduce (state-feedback) controller using K = W Y −1 .
– p. 73/129
trick
74
• ⇐ (Y, KY ) solution → (Y, W ) easy • ⇒ (Y, W ) solution → (Y, KY ) note that term B2 W is B2 W Y −1 Y hence (Y, K = W Y
−1
) is a solution.
– p. 74/129
state-feedback H2 synthesis
75
• similar derivation • characterization (A + B2 K)′ X + ∗ + (C1 + D12 K)′ (C1 + D12 K) ≺ 0, Tr (B1′ XB1 ) < η 2 become via Schur complements ′ (A + B2 K) X + ∗ ∗ ≺0 (C1 + D12 K) −I ′ Z B1 2 0, Tr Z < η B1 X −1 – p. 75/129
state-feedback H2 synthesis
76
• perform congruence transformations diag(Y = X −1 , I) and diag(I, Y ) to get AY + B2 KY + ∗ ∗ ≺0 C1 Y + D12 KY −I Z B1′ Y 0, Tr Z < η 2 Y B1 Y
– p. 76/129
state-feedback H2 synthesis
77
• change of variable W = KY yields LMIs ! AY + B2 W + ∗ ∗ ≺ 0, C1 Y + D12 W −I with
Z Y B1
B1′ Y Y
0, Tr Z < η 2
– p. 77/129
state-feedback pole clustering
78
• similarly Y ≻ 0 and U ⊗ Y + V ⊗ (A + B2 K)Y + V ′ ⊗ Y (A + B2 K)′ ≺ 0 . change of variable W = KY leads to LMI!: Y ≻0 U ⊗ Y + V ⊗ (AY + B2 W ) + V ′ ⊗ (AY + B2 W )′ ≺ 0 .
– p. 78/129
multiple constraints
79
• the Y ’s are not the same for all perfs. • hard problem is relaxed by taking a single Y for all perfs. • technique is constantly refined to exploit different Y ’s by spec. (active area).
– p. 79/129
output feedback case - closed-loop data
80
• 2 4
A
B1
C1
D11
3
2
6 5 := 6 4
A
0
B1
0
0
0
C1
0
D11
3 2 7 6 7 +6 5 4
0
B2
I
0
0
D12
3
»
7 AK 7 5 CK
BK DK
–
2 4
0
I
0
C2
0
D21
3
5,
• Above analysis condition must be satisfied in closed-loop. Synthesis conditions in 3 steps 1- introduce a single variable P common specification/channel (conservative step), 2- perform adequate congruence transformations, 3- use linearizing changes of variables to end up with LMI synthesis conditions. – p. 80/129
linearizing change of variable
81
Introduce notation 2
P=4
X N′
N ⋆
3
5,
P
−1
2
=4
Y
M
M′
⋆
3 5
From PP −1 = I infer 2
PΠY = ΠX with ΠY := 4
Y M′
I 0
3
2
5 , ΠX := 4
I
X
0
N′
3
5.
Define change of variable (wlog N , M are invertible) 8 bK < A
:=
N AK M ′ + N BK C2 Y + XB2 CK M ′ + X(A + B2 DK C2 )Y,
:=
b K := CK M ′ + DK C2 Y, D b K := DK . N BK + XB2 DK , C
(1)
: b BK
and, perform congruence transformations to get
bK , B bK, C bK, D bK ! linear terms in the new variables X, Y, A – p. 81/129
LMI for H∞ specification 2
L11 6 6 A b K C2 )′ 6 bK + (A + B2 D 6 6 b K D21 )′ (B1 + B2 D 4 bK C1 Y + D12 C
where
b′ + (A + B2 D b K C2 ) A K L22 b K D21 )′ (XB1 + B b K C2 C1 + D12 D
82
∗
∗
∗
∗
−γI
∗
b K D21 D11 + D12 D
−γI
3
7 7 7 7≺0 7 5
b K + (B2 C b K )′ , L22 := A′ X+ XA + B b K C2 + (B b K C2 )′ . L11 := AY + YA′ + B2 C
• similarly for H2 and LMI region specs. • for multi- channel/objective just stack together various LMI specs.
– p. 82/129
LMI for H2 specification 2 6 6 4
b K + (B2 C b K )′ + B2 C ∗ b K C2 )′ b K C2 + (B b K C2 )′ + (A + B2 D A′ X+ XA + B bK b K C2 C1 Y + D12 C C1 + D12 D
83
∗
YA′
AY + bK A 2 6 6 4
I
X b K D21 )′ (XB1 + B b K D21 )′ (B1 + B2 D b K D21 = 0. Tr (Q) < ν, D11 + D12 D
"
Y I I X
3
−I
b K D21 B1 + B2 D 7 b XB1 + BK D21 7 5 ≻ 0, Q
I
Y
∗
#
3
7 7 ≺ 0, 5
≻0
– p. 83/129
LMI region constraint specification
84
• congruence diag(ΠY , . . . , ΠY ) yields h i i h b b AY +B2 CK A+B2 DK C2 Y I + ∗ ≺ 0. λjk I X + µjk bK b K C2 A XA+ B "
Y I I X
#
≻0
– p. 84/129
multiple constraints
85
• again for multiple constraints take the same X, Y bK B b K , . . . for all LMIs. and A
• controller construction: just reverse the change of variables
– p. 85/129
pure H∞ synthesis: projected characterization86 For a single objective, LMI can be simplified, Projection Lemma yields 2 4 2 4
NY
0
0
I
NX
0
0
I
3′
2
6 5 6 4 3′
2
6 5 6 4
YC1′
B1
C1 Y
−γI
D11
B1′
′ D11
−γI
XB1
C1′ ′ D11
AY +
A′ X+
YA′
XA
B1′ X
−γI
C1
D11
−γI
3
2
7 N 74 Y 5 0 3
2
0
Y
I
I
X
4 NY and NX null spaces of
B2′
′ D12
i
and
h
C2
I
7 N 74 X 5 0 2
h
0
D21
I
3
≺
0
3 5
≺
0
3 5
≻
0.
5
i ,
– p. 86/129
avantages of LMI formulations ➠
very general wrt DGKF, no assumptions required
➠
singular problems
➠
admits similar discrete-time counterpart
➠
has educational value for students (shorter proofs)
➠
See http://www.cert.fr/dcsd/cdin/apkarian/ for details
➠
See M ATLAB LMI Control Toolbox for codes.
87
– p. 87/129
uncertain systems analysis
88
• Lyapunov technique • Time-invariant and time-varying parameters • Parameter-dependent Lyapunov functions.
– p. 88/129
analysis of uncertain systems - example
89
Consider the uncertain system d x(t) = A(δ) x(t); dt
x(0) = x0
➠
δ = [δ1 , . . . , δL ]′ ∈ RL uncertain and possibly time-varying real parameters
➠
A(δ) = A0 + δ1 A1 + . . . + δL AL
δ(t)
˙ δ(t)
is the system stable for all admissible δ(t) ? – p. 89/129
Affine Quadratic Stability (AQS)
90
The system is Affinely Quadratically Stable, if ∃ V (x, δ) := x′ P (δ)x,
P (δ) = P0 +δ1 P1 +. . .+δL PL
s. t. V (x, δ) > 0, dV /dt < 0 along all admissible parameter trajectories. • Lyapunov theory ⇒ (exponential) stability. P (δ) := P0 + δ1 P1 + . . . + δL PL L(δ, d δ) := A(δ)′ P (δ) + P (δ)A(δ) + dt
> 0 dP (δ) dt
< 0
• turned into LMIs ⇒ multi-convexity, S-procedure ,... ! – p. 90/129
analysis - continued • cases Time-Invariant Parameters Arbitrary rate of variation (quad. stab.)
• extensions H∞ , H2 , LMI regions,...
91
• components LFT uncertainties nonlinear components (IQC theory, (Rantzer & Megretsky) µ analysis
– p. 91/129
gain-scheduling and LPV control
92
• motivations and concepts • classes of LPV system • synthesis conditions for LFT systems
– p. 92/129
motivations #1
93
➠
handle full operating range
➠
gain-scheduled controllers exploit knowledge on the plant’s dynamics in real time knowledge on plant
measurement signal
controller dynamics
control signal
controller mechanism is changed during operation
– p. 93/129
motivations #2
94
Gain-Scheduling techniques are applicable to • Linear Parameter-Varying Systems (LPV): d x = A(θ)x + B(θ)u , dt y = C(θ)x + D(θ)u. where θ := θ(t) is an exogenous variable. • “ Quasi-Linear” Systems: d x = A(y )x + B(y )u , sche sche dt y = C(ysche )x + D(ysche )u. where ysche is a sub-vector of the plant’s output y. – p. 94/129
motivations #3 ➠
to get higher performance
➠
some LPV system are not stabilizable via a fixed LTI controller
➠
bypass critical phases of pointwise interpolation and switching
➠
engineering insight is preserved (freeze scheduled variable for analysis).
➠
nonlinear models can be handled by immersion into an LPV plant.
95
– p. 95/129
LPV systems in practice ➠ Aeronautics (longitudinal motion of aircraft) 0 α˙ −Zα 0 α az −Zα V + = δ, = q˙ −mα 0 q mδ q 0
96
α 0 , q 1
where Zα , mα and mδ are functions of speed, altitude and angle of attack. ➠ Robotics (flexible two-link manipulator) M (θ2 )¨ q (t) + Dq(t) ˙ + Kq(t) = F u(t), where θ2 is the scheduled variable (conf. of 2nd beam). ➠ and many others
– p. 96/129
example: different control principles -d 6
-d 6
-d 6
- K
- P (θ)
97
y
-
Robust control
?
θ
- K(θ)
- P (θ)
y
-
LPV control ?
- K(y)
y - P (y)
y
-
Output gain-scheduling
– p. 97/129
description of LPV systems
98
• LPV systems x˙ = A(θ)x + B(θ)u , y = C(θ)x + D(θ)u. are characterized by
A() B() ➠ the functional dependence of on θ, C() D() ➠
the operating domain Θ of the system trajectories, θ(t) ∈ Θ,
➠
the rate of variations of θ(t) (if available) in the form of bounds θ˙i (t) ∈ [θi ; θ¯i ]. – p. 98/129
LPV / LTV / LTI systems LPV
99
θ(t) ∈ Θ ∀t ≥ 0
select trajectory
freeze parameter
θ = θ0
θ(t) = θ∗ (t)
LTV
LTI freeze the time
t = t0
– p. 99/129
LPV / LTV / LTI - off-line vs. on-line
100
➠
LTI and LTV systems are off-line systems, the state-space data A, B,... and A(t), B(t),... must be known in advance.
➠
LPV systems are on-line systems since the dynamics depend on the trajectory θ(t) experienced by the plant in Θ. Θ
θ(t) – p. 100/129
LPV systems interpretations x˙ = A(θ)x + B(θ)u, y = C(θ)x + D(θ)u.
101
θ(t) ∈ Θ
• θ may be subject to various assumptions: ➠
θ(t) is uncertain → robust control problem,
θ(t) is known in real-time → Gain-scheduling problem, θ1 (t) ➠ θ(t) := , where θ1 is known and θ2 is θ2 (t) uncertain → mixed problem ➠
– p. 101/129
LPV pathologies - LPV/LTI stabilities
102
• stability over a domain ➠
LTI Stability : Reλi (A(θ)) < 0, ∀θ ∈ Θ,
➠
LPV Stability : Φθ (t) → 0, for t → ∞, for all trajectory θ(t) in Θ.
• intuitive conjectures like ➠
LTI stability ⇒ LPV stability,
➠
LPV stability ⇒ LTI stability,
are FALSE !
– p. 102/129
LPV vs. LTI Stability • Conjecture #1 −1 + aθ12 x˙ 1 = x˙ 2 −1 + aθ1 θ2
1 + aθ1 θ2 −1 + aθ22
103
x1 x2
,
with trajectories θ1 := cos(t) and θ2 (t) := sin(t) is LTI stable (for a < 2) but LPV unstable. • Conjecture #2 2 −1 − 5θ1 θ2 1 − 5θ1 x˙ 1 x1 = , 2 x˙ 2 x2 −1 + 5θ2 −1 + 5θ1 θ2 with trajectories θ1 := cos(t) and θ2 (t) := sin(t) is LTI unstable (poles +1 and −3) but LPV stable. – p. 103/129
creating an LPV stability
104
consider the autonomous LPV system: x¨ + ω 2 (t)x = 0 , where we are allowed to switch between two values ω1 and ω2 . x2
system trajectories
switch
ω1 ω2
x1
unstable behavior
– p. 104/129
Slowly Varying LPV Systems x˙ = A(θ)x
105
• Sufficient stability cond. • Sufficient instability cond. (1) Reλi (A(θ)) < 0, (1) Reλi (A(θ)) < 0, ˙ < α, with α sufficiently i = 1, . . . , k (2) kθk small, (2) Reλi (A(θ)) > 0, ⇒ LPV stability (Rosen. 63) i = k + 1, . . . , n (3) stable and unstable eigenvalues do not mix ˙ < α, with α sufficiently (4) kθk small, ⇒ LPV instability (Skoog 72)
– p. 105/129
slowly varying parameters
106
LPV stability can be inferred from LTI stability for slowly varying parameters (but not constructive conditions).
– p. 106/129
LPV systems in the LFT class
107
diag(θi Iri )
A Bθ B Cθ Dθθ Dθ• C D•θ D
y
»
A(θ) C(θ)
B(θ) D(θ)
–
:=
»
A C
B D
–
»
Bθ + D•θ
–
u
Θ(I − Dθθ Θ)−1 [ Cθ
Dθ• ] , – p. 107/129
LPV systems in the polytopic class θ ∈ Θ , Θ Polytope
»
A(θ) C(θ)
B(θ) D(θ)
–
∈ Cov
»
Ai Ci
Bi Di
–
108
, i = 1, 2, . . . , r
ff
θ(t)
»
y
A1 B1 C1 D1
»
–
A2 B2 C2 D»2
– Ai Bi Ci Di
u –
– p. 108/129
general LPV systems
109
A(θ), B(θ), C(θ), D(θ) are arbitrary but continuous matrix-valued function of θ. • far more difficult to handle but of great practical interest since they capture arbitrary nonlinearities
– p. 109/129
formulation of synthesis problem: LFT
110
Θ
w
z
P (s) y
u
K(s)
Θ
gain−scheduled controller – p. 110/129
formulation of synthesis problem: LFT
111
find LPV controller Fl (K(s), Θ(t)) s.t. ➠
closed-loop stability,
➠
the L2 -induced norm of the operator Tw→z satisfies kTw→z (Θ)k < γ
for all admissible trajectory θ(t).
– p. 111/129
LPV-LFT systems: notations
112
•
Dθθ P (s) = D1θ D2θ
Dθ1 D11 D21
Dθ2 Cθ D12 + C1 (sI−A)−1 [ Bθ D22 C2
B1
B2 ] ,
Assumptions: (A, B2 , C2 ) stabilizable and detectable, D22 = 0. Cθ b Dθθ Dθ1 b b Notations: B1 = [ Bθ B1 ], C1 = , D11 = , C1 D1θ D11 T T 0 ], D12 NY := Ker [ B2T Dθ2 NX := Ker [ C2 D2θ D21 0 ].
– p. 112/129
LPV-LFT systems - proof scheme synthesis structure
113
synthesis structure with parameter augmentation
Θ(t) zθ
wθ
z
w
P(s)
y
w ˜θ
Θ(t)
0
0
Θ(t)
zθ
u z
z˜θ
wθ w
P(s)
K(s)
Pa (s) z˜θ
w ˜θ Θ(t)
y
y˜
u K(s)
u ˜
– p. 113/129
LPV-LFT systems - proof scheme
114
➠
redraw the control configuration into a robust control problem with repeated uncertainty,
➠
formulate the Bounded Real Lemma with scalings for the closed-loop system,
➠
apply the Projection Lemma to derive the LMI characterization.
– p. 114/129
LMI characterization
NYT
2 AY + Y AT 6 Cθ Y + Γ3 BθT 6 6 C1 Y 6 4 Σ BT B1T
⋆ T −D Γ −Σ3 + Γ3 Dθθ θθ 3 −D1θ Γ3 T Σ3 Dθθ T Dθ1
⋆ ⋆ −γI T Σ3 D1θ T D11
⋆ ⋆ ⋆ −Σ3 0
⋆ 3 ⋆ 7 7 ⋆ 7 7 NY ≺ 0, ⋆ 5 −γI
2 AT X + XA 6 BθT X + T3 Cθ 6 6 B1T X 6 4 S3 Cθ C1
⋆ T T −S3 + T3 Dθθ − Dθθ 3 T T −Dθ1 3 S3 Dθθ D1θ
⋆ ⋆ −γI T S3 Dθ1 D11
⋆ ⋆ ⋆ −S3 0
⋆ 3 ⋆ 7 7 ⋆ 7 7 NX ≺ 0, ⋆ 5 −γI
3
T NX
115
θ
»
Y I
I X
–
0
S3 ≻ 0, Σ3 > 0; T3 , Γ3 skew − symmetric .
– p. 115/129
scaling sets asso. with structure Θ ⊕ Θ
116
• symmetric SΘ := {S : S > 0, SΘ = ΘS} • symmetric augmented S1 S2 SΘ⊕Θ = { T : S1 , S2 ∈ SΘ and S2 Θ = ΘS2 , ∀Θ ∈ Θ}. S2 S3 • skew-symmetric T1 T2 TΘ⊕Θ = { : T1 , T2 ∈ TΘ and T2 Θ = ΘT2 , ∀Θ ∈ Θ}. T −T2 T3
– p. 116/129
robust synthesis condition
ATcℓ Xcℓ + Xcℓ Acℓ B T Xcℓ + T Ccℓ cℓ Ccℓ
T T Xcℓ Bcℓ + Ccℓ T T T −S + T Dcℓ + Dcℓ T Dcℓ
where
117
T Ccℓ T ≺ 0 Dcℓ −S −1
➠
Acℓ, Bcℓ , . . . closed-loop data
➠
S, T scalings for Θ ⊗ Θ ⊗ ∆, and ∆ fictitious performance block.
– p. 117/129
cast in Projection Lemma form
118
Can be rewritten Ψ + QTX ΩP + P T ΩT QX ≺ 0, where
T
A Xcℓ + Xcℓ A Ψ = B1T Xcℓ + T C1 C1
P = [ C2
D21
0],
C1T T T
C1T T D11 −1
Xcℓ B1 + , −S + T D11 + D11 T T D11 −S T T T T QX = B2 Xcℓ D12 T D12 .
– p. 118/129
cast in Projection Lemma form continued 2
2
A 4 C1 C2
B1 D11 D21
6 6 6 6 3 6 6 B2 6 6 5 D12 = 6 6 6 ΩT 6 6 6 6 4
A
0
0
Bθ
B1
0
B2
0
0
0
0
0
0
I
0
0
0
0
0
0
0
0
0
I
Cθ
0
0
Dθθ
Dθ1
0
Dθ2
0
C1
0
0
D1θ
D11
0
D12
0
0
I
0
0
0
AT K
T CK1
T CKθ
C2
0
0
D2θ
D21
T BK1
T DK11
T DKθ1
0
0
I
0
0
T BKθ
T DK1θ
T DKθθ
and b1 = [ Bθ B
B1 ] ,
b1 = C
»
– Cθ , C1
b 11 = D
»
Dθθ D1θ
Dθ1 D11
–
119
3 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 5
.
• LMI characterization follows from explicit computation of projections and using matrix completion Lemmas.
– p. 119/129
LPV-LFT systems - controller Construction120 ➠
Testing solvability falls within the scope of convex semi-definite programming
➠
A gain-scheduled controller is easily constructed from the quadruple (Y, X, L3 , J3 ) by solving a scaled Bounded Real Lemma LMI condition. u
y 3 AK BK1 BKθ 4 CK1 DK11 DK1θ 5 CKθ DKθ1 DKθθ 2
Θ(t) – p. 120/129
other variants of this technique ➠
polytopic LPV systems
➠
general LPV systems (capture slow variations of parameters)
➠
LFT systems ang generalized scalings
➠
multi-objective/channel LPV synthesis
121
see webpage: http://www.cert.fr/dcsd/cdin/apkarian/
– p. 121/129
hard non-LMI problems
122
• most analysis problems reduce to LMIs • some synthesis problems reduce to LMIs but • many practical problems do not reduce to LMI/SDP (synthesis) ➠
reduced- and fixed-order synthesis (PID H∞ , etc.)
➠
structured and decentralized synthesis problems
➠
general robust control with uncertain and/or nonlinear components
➠
simultaneous model/controller design, multimodel control
➠
unrelaxed LTI and LPV multi-objective
➠
combinations of the above – p. 122/129
new algorithms for hard problems
123
new algorithms needed ! good research direction
– p. 123/129
example: synthesis of static controller stabilize x˙ = Ax + Bu y = Cx with u = Ky (K static )
124
has characterization NC′ (A′ X + XA)NC < 0 ′ ′ ′ (Y A + AY )N < 0 N ′ B B X I > 0 I Y XY − I = 0
constraints XY − I = 0 leads to hard problems LMI + nonlinear equality constraints
– p. 124/129
augmented Lagrangian method
125
with g(x) = 0 equ. constraints and A(x) ≺ 0 LMI, replace the difficult program by the more convenient (Pλ,µ )
minimize c′ x + λ′ g(x) + µ1 kg(x)k2 subject to A(x) 0
➠
µ is penalty, xµ → x∗ when µ → 0
➠
for good estimates λ (Lagrange multiplier), solution of (Pλ,µ ) is close to solution of original problem
➠
use first-order update rule to improve estimate λ
➠
solve (Pλ,µ ) by a succession of SDPs – p. 125/129
augmented Lagrangian method
➠
B. Fares and P. Apkarian and D. Noll, IJC, 2001
➠
B. Fares and D. Noll and P. Apkarian , SIAM Cont. Optim. 2002
➠
P. Apkarian and D. Noll and H. D. Tuan, 2002, IJRNC to appear.
➠
D. Noll and M. Torki and P. Apkarian, working paper, 2002
126
– p. 126/129
conclusions, perspectives
127
➠
A single framework for a great variety of methods
➠
LMI techniques extend the scope of classical techniques
➠
LPV control is a very successful example (industrial)
➠
Analysis meth. immediately applicable for validation
➠
Have educational merits see http://www.cert.fr/dcsd/cdin/apkarian/ for course plan
➠
not discussed: robust filtering and estimation, combinatorial optimization, graphs, etc. – p. 127/129
recent concrete control applications ➠
Analysis robustness evaluation of controllers for: ➟ ARIANE Launcher ➟ satellites ➟ long flexible civil aircraft (structural modes)
➠
Synthesis Preliminary tests show that LPV controllers are competitive for launcher control in atmospheric flight
➠
Synthesis control of the landing phase for civil aircraft under study with multiobjective LMI methods
➠
Synthesis Missiles ? still on paper
128
– p. 128/129
The End
129
GRAZIE MILLE !
– p. 129/129