Quantiles and Convexity

always claimed he was a Mechanician (in the academic sense) and he devel- ... Let P be a probability law on R. If necessary X will denote a random variable ...
308KB taille 46 téléchargements 292 vues
Quantiles and Convexity Michel Valadier Universit´e Montpellier 2, Case courrier 051 place Eug`ene Bataillon, 34095 Montpellier cedex 5, FRANCE 6 f´evrier 2015 Abstract A real random variable admits median(s) and quantiles. These values minimize convex functions on R. This is known. We discuss these results and their relationship with some notions about functions of bounded variation developed by J.J. Moreau in his mathematical works in view of the mechanical phenomena which are shocks and friction. Specially filled-in graphs and Stieltjes measures of products of BV functions. Keywords. Median, quantile, convexity, subdifferential, monotone operator, minimizer, BV function, Jean Jacques Moreau. Classification. 62E99, 52A99

1

Remembrance

I began research in Mathematics under the supervision of Robert Pallu de `re (and with the companionship of Charles Castaing). In the La Barrie middle of the sixties Pallu de La Barri`ere foreboded the future importance of convexity. Around 1967 I received from Castaing a copy of Fonctionnelles convexes (Coll`ege de France) by Jean Jacques Moreau [Mor2]. An impressive basic fundamental text! Some time later (in October 1970) we all three (Moreau, Castaing and me) were together in Montpellier for more than a decade, at the big time of the S´eminaire d’Analyse Convexe. J.J. Moreau was not only mathematician (and a very good one): he always claimed he was a Mechanician (in the academic sense) and he developed Mathematics in view of Mechanics. But in Mathematics he had a taste for generality and avoided useless hypotheses: for examples in [Mor2] the framework was (not necessarily locally convex) infinite dimensional topological vector spaces, and in [Mor4] he proved basic results about BV functions 1

with values in a metric space whereas he needed only results in a Hilbert space (BV abbreviates bounded variation)! This paper provides “Convex Analysis” proofs of an elementary result of Statistics. We discuss the hypotheses and involved structures and invoke some favorite notions of J.J. Moreau, specially concerning BV functions. We follow a logical way which throws light on the naturalness of the result. I began this paper (for a first version see [V4]) being unaware of Koltchinskii [Kol]. This 1997 work is more devoted to the multivariate case, as well as the 1996 paper by Chaudhuri [C]. See some other comments in Section 4.

2

Definitions and notations

Let P be a probability law on R. If necessary X will denote a random variable obeying the law P. The law P is of order p (p ∈ {1, 2}) if X is of order p, which writes X ∈ Lp . If no integrability condition on X is assumed, specialists write X ∈ L0 (order 0). When P is of order 1 the mean E(X) does exist. When P is of order 0 there exists a non-empty compact interval of medians (the definition of a median is included in that of a quantile below). Let F be the right-continuous distribution function: F (x) := P(]−∞, x]) = P(X ≤ x) . For τ ∈ ]0, 1[ (usually in Statistics 0.95 or 0.99 etc.) a real number q¯ is a τ -quantile if P(X ≤ q¯) ≥ τ and P(X ≥ q¯) ≥ 1 − τ or maybe more plainly P(X < q¯) ≤ τ ≤ P(X ≤ q¯) (note that x 7→ P(X < x) is the left-continuous version F − of F ). When τ = 1/2 one recovers median(s). A geometrical definition of τ -quantiles is the following: their set is the projection on R of the intersection of R × {τ } with the filled-in graph 1 G of F . That is, the set of τ -quantiles is {¯ q ∈ R ; (¯ q , τ ) ∈ G} . 1

This graph is obtained from the graph of F by adding vertical segments when there are gaps, thus obtaining an arcwise connected curve: G = {(x, y) ; x ∈ R and y ∈ [F − (x), F (x)]} . This comes from Monteiro Marques and Moreau’s works on the sweeping process: [MM1, p.147], [MM2, p.15], some ideas coming back to [Mor3]. In Section 4 the filled-in graph will be seen as a maximal monotone operator.

2

This geometrical definition operates when τ = 0 or 1. For example the set of 1-quantiles is the closed, possibly empty, interval {q ∈ R ; F (q) = 1} and the set of 0-quantiles is the closed, possibly empty, interval {q ∈ R ; F − (q) = 0}. A real number candidate to be the mean or a median will be denoted by m. A candidate to be a quantile will be denoted by q.

3

Medians and quantiles as minimizers

Well known2 is the result: If P is of order 1, m is a median if and only if it minimizes on R the function Z m 7→ |x − m| dP(x) . R

Surprising are: the assumption about order 1, and the fact that medians depend only on the structure of ordered space of R and not on its metric (nor on its group structure, nor on Haar measure). An “answer”, at least relatively to the “order 1” assumption, is the following. Well known too3 is the result: If P is any law, x 7→ |x − m| − |x| is P-integrable (obviously |x − m| − |x| ≤ |m|) and m is a median if and only if it minimizes on R the function Z  m 7→ |x − m| − |x| dP(x) . R

The notion of median extends to Rd and to Banach spaces: see [Kem, Kol, MD] and there exist conditional medians [V3]; Kemperman [Kem] and Milasevic & Ducharme [MD] gave in the multivariate case a sufficient condition implying uniqueness of the median (see already in 1948 Haldane [Ha]). These questions will not be considered here. The interest of medians comes from robustness, i.e., stability with respect to outliers values. Maybe the notions of means in metric spaces going back to Fr´echet (see [Fr] and many other papers by the same author) should be revisited. See the papers by Armatte [A1, A2], the first one containing more than six pages of Fr´echet’s references. Numerous authors studied random variables with values in a metric space: for instance [BH, RF]. 2 In French textbooks: [DCD1, Exercice E.11 p.71] (solution page 28 of [DCD2]), and Th´eor`eme 10.1 page 93 in [FF]. See also [Fe1, ex.1.8.2 p.51] and its solution in [Fe2]. 3 Exercises in French Universities and surely in many countries...

3

Now we turn to quantiles. T.S. Ferguson [Fe1, Exercise 1.8.3 p.51, solution in [Fe2]] says that4 : If τ ∈ ]0, 1[ and P is of order 1, q is a τ -quantile if and only if it minimizes on R the function Z q 7→ ρτ (x − q) dP(x) where (1)

1 1 ρτ (x) = |x| + τ − x= 2 2

( (τ − 1) x τx

if x ≤ 0, if x ≥ 0.

My purpose here is to provide a Convex Analysis proof of the extension to any probability law following a logical way which makes clear the naturalness of the result. See also Koltchinskii [Kol, Example 2.2 p.440].

4

A Convex Analysis point of view

The filled-in graph G defined in Section 2 is a subset of R2 which is a maximal monotone operator [RW, Chapter 12] (and therefore a maximal cyclically monotone one because of dimension 1, cf. [RW, 12.6 pp.547–548]). It is known ([RW, Th.12.25 p.547]) that a maximal cyclically monotone graph is the graph of the subdifferential (definition recalled below) of a l.s.c. convex function (unique up to an additive constant). This is due to Rockafellar; for comments see the alinea pages 575–576 of the first edition of [RW] (a small shift occurs in further editions) where Moreau is quoted and credited for his famous 1965 paper [Mor1]. Here we will get such a function in a direct way. Let F be the primitive (antiderivative) of F null at 0: Z x   F (u) du if x ≥ 0,  0Z F(x) = 0   F (u) du if x < 0 . − x

Since F is nondecreasing F is convex. The subdifferential of F at x is ∂F(x) = {` ∈ R ; ∀h ∈ R, ` h ≤ F(x + h) − F(x)} . Lemma 1 The graph of the multifunction x 7→ ∂F(x) is nothing else but the filled-in graph G of the graph of F : G = {(x, `) ∈ R2 ; ` ∈ ∂F(x)} . 4

This result is appreciated by specialists: [Koe, first chapter] and already [KB, p.38].

4

Proof. Let F0 (x; w) denote the directional derivative of F at x in the direction w. There holds ∂F(x) = [−F0 (x; −1), F0 (x; 1)] . Then, since F is right continuous, F(x + h) − F(x) h&0 h Z x+h 1 F (u) du = lim h&0 h x = F (x)

F0 (x; 1) = lim

and, since F is as well the primitive of F − , F(x − h) − F(x) h&0 h Z 1 x = − lim F − (u) du h&0 h x−h

F0 (x; −1) = lim

= −F − (x) . Therefore ∂F(x) = [F − (x), F (x)] . 

In such a situation a natural question is: what does the minimisation of F give? Obviously the infimum may be −∞: this is why we cannot consider Z x F (u) du. But with a slope τ belonging to ]0, 1[ the function −∞

q 7→ F(q) − τ q does achieve minimum(s) and q¯ is a minimum if and only if 0 ∈ −τ + ∂F(¯ q ). This is equivalent to (¯ q , τ ) ∈ G, that is, q¯ is a τ -quantile. Scholium Let F denote a primitive of F . For τ ∈ ]0, 1[, (2)

q¯ is a τ -quantile ⇐⇒ q¯ minimizes q 7→ F(q) − τ q .

Next Lemma is an integration by parts result. We will indicate its connexion with Moreau [Mor4, Section 11] and give other references in the Appendix. 5

Lemma 2 Let a < b in R. Then (note the half-open interval ]a, b]) Z Z b (3) x dP(x) = b F (b) − a F (a) − F (x) dx . ]a,b]

a

Proof (from Schilling [S1, Exercise 13.13 p.133] and its solution on the Net [S2, pp.10–12]). For the product of the Lebesgue measure (we denote it by dx) and of P (dx ⊗ dP)(]a, b]2 ) = (b − a) [F (b) − F (a)] .

(4) And moreover

(dx ⊗ dP)(]a, b]2 ) = ZZ ZZ = 1]a,b] (x) 1]x,b] (y) dx dP(y) + 1]a,b] (x) 1]a,x] (y) dx dP(y) ZZ ZZ Tonelli = 1]a,b] (x) 1]x,b] (y) dx dP(y) + 1]a,b] (y) 1[y,b] (x) dx dP(y) Z Z [F (b) − F (x)] dx + (b − y) dP(y) = ]a,b]

]a,b]

Z (5)

= (b − a) F (b) −

b

Z F (x) dx + b [F (b) − F (a)] −

a

y dP(y) . ]a,b]

Comparing (4) and (5) one gets (3).  Now let5 (ρτ is defined in (1) above) Z  (6) Φ(q) := ρτ (x − q) − ρτ (x) dP(x) R

(note that the function x 7→ ρτ (x − q) − ρτ (x) is bounded on R). Theorem 1 Let τ ∈ ]0, 1[ and P be a law on R. There holds ( R −τ q + ]0,q] F (x) dx if q ≥ 0, R Φ(q) = −τ q − ]q,0] F (x) dx if q < 0, that is, Φ(q) = F(q) − τ q. The function Φ is convex and inf-compact. The value q¯ is a τ -quantile if and only if it minimizes the function Φ. 5

Koltchinskii [Kol] considers the same functional. See specially two lines after his formula (1.1) page 436 where he sets fP,t (s) := fP (s) − s t: there s is my variable q and t my threshold τ . Comparisons are not easy. He also uses (on pp.439–440) a multivalued integration result [IT, Th.8.3.4]. I got similar results earlier in [V1, V2] but Ioffe and Tihomirov published also some papers on these questions before their book.

6

Comments. Part 3) below is a bit technical. Surely careless calculus would be quicker: the formal calculus Z  d ρτ (x − q) − ρτ (x) dP(x) Φ0 (q) = R dq Z q Z +∞ = (1 − τ ) dP + (−τ ) dP −∞ q  = (1 − τ ) F (q) − τ 1 − F (q) = F (q) − τ leads to Φ0 (q) = 0 ⇐⇒ F (q) = τ ! Proof. 1) Firstly x 7→ ρτ (x − q) − ρτ (x) is less than τ |q| if τ ≥ 1/2 or than (1 − τ ) |q| if τ ≤ 1/2. Or more roughly ∀x, |ρτ (x − q) − ρτ (x)| ≤ |q|. So the integral in (6) is well defined. 2) The function ρτ defined by (1) being convex, the function q 7→ ρτ (x − q) − ρτ (x) is also convex, hence the convexity of Φ. 3) We now prove Φ(q) → +∞ as |q| → +∞. For q ≥ 0,   if x ∈ ]−∞, 0] , (1 − τ ) q ρτ (x − q) − ρτ (x) = (1 − τ ) q − x if x ∈ ]0, q] ,   −τ q if x ∈ ]q, +∞[ , and, for q < 0,   (1 − τ ) q ρτ (x − q) − ρτ (x) = x − τ q   −τ q

7

if x ∈ ]−∞, q] , if x ∈ ]q, 0] , if x ∈ ]0, +∞[ .

Hence, if q ≥ 0 (we will apply formula (3) between lines 2 and 3), Z  Φ(q) = (1 − τ ) q P(]−∞, 0]) + (1 − τ ) q − x dP(x) − τ q P(]q, +∞[) ]0,q] Z x dP(x) − τ q P(]q, +∞[) = (1 − τ ) q P(]−∞, q]) − ]0,q] Z = (1 − τ ) q P(]−∞, q]) − [x F (x)]q0 + x F (x) dx − τ q P(]q, +∞[) ]0,q] Z x F (x) dx − τ q (1 − F (q)) = (1 − τ ) q F (q) − q F (q) + ]0,q] Z (7) = −τ q + F (x) dx . ]0,q]

When q → +∞ the mean value of F over ]0, q] ultimately exceeds R any value in ]τ, 1[, so Φ(q) → +∞. (More plainly for q > τ , Φ(q) = ]0,τ ] F (x) −   R τ dx + ]τ,q] F (x) − τ dx and F (x) − τ → 1 − τ > 0 as x → ∞.) On the other hand, if q < 0, (we again apply (3) between lines 2 and 3), Z Φ(q) = (1 − τ ) q P(]−∞, q]) + (x − τ q) dP(x) − τ q P(]0, +∞[) ]q,0] Z = (1 − τ ) q P(]−∞, q]) + x dP(x) − τ q P(]q, +∞[) ]q,0] Z 0 = (1 − τ ) q P(]−∞, q]) + [x F (x]q − x F (x) dx − τ q P(]q, +∞[) ]q,0] Z = (1 − τ ) q F (q) − q F (q) − x F (x) dx − τ q (1 − F (q)) ]q,0] Z (8) = −τ q − F (x) dx . ]q,0]

When q → −∞ the mean value of F over ]q, 0] ultimately passes under any value in ]0, τ [, so Φ(q) → +∞. Thus the finite valued convex function Φ is inf-compact, so it achieves its infimum over a non-empty compact interval. By (7) and (8) Φ(q) = F(q) − τ q . From (2) the minimizers are the τ -quantiles. 

8

Corollary 1 Let τ ∈ ]0, 1[ and P be a first order law on R then Z ρτ (x − q) dP(x) . q¯ is a τ -quantile ⇐⇒ q¯ minimizes q 7→ R

Proof. The term

Z ρτ (x) dP(x) R

is finite and does not depend on q. Hence one can substract it from the right-hand side of (6). 

5

A direct proof

Quantiles are basic knowledge for Statisticians, and their existence is not a problem. Then their characterization as minimizers is an exercise. Classically the convex function Φ on R achieves a minimum at q¯ if and only if the left and right derivatives of Φ at q¯ are respectively ≤ 0 and ≥ 0. Theorem 2 Let τ ∈ ]0, 1[ and P be a law on R. Let Φ defined by (6). Its right derivative at q¯ is ≥ 0 if and only if P(]−∞, q¯ ]) is ≥ τ . The derivative of Φ at q¯ in direction −1 is ≥ 0 if and only if P(]−∞, q¯[) is ≤ τ . Proof. Let ϕx (q) := ρτ (x − q) − ρτ (x) Z so that Φ(q) =

ϕx (q) dP(x). R

a) We are going to check (right derivation under the integral sign) Z 0 Φ (¯ q ; 1) = (ϕx )0 (¯ q ; 1) dP(x) . R

Let (hn )n a sequence in ]0, +∞[ decreasing to 0. There holds Z Φ(¯ q + hn ) − Φ(¯ q) ϕx (¯ q + hn ) − ϕx (¯ q) Φ0 (¯ q ; 1) = lim = lim dP(x) . n n hn hn R The last integrand decreases to (ϕx )0 (¯ q ; 1) and Lebesgue’s theorem applies6 thanks to the inequalities −[ϕx (¯ q − 1) − ϕx (¯ q )] ≤

ϕx (¯ q + h0 ) − ϕx (¯ q) ϕx (¯ q + hn ) − ϕx (¯ q) ≤ . hn h0

6 The monotone convergence theorem applies, but due to the non-increasing property of the sequence, one shoud observe that the whole sequence is bounded from above by the first term, say the one with n = 0. See [Kol, Section 2 page 439] for quoting monotone convergence.

9

Now it is easy to calculate ( −τ (ϕx ) (¯ q ; 1) = −(τ − 1) 0

if x ∈ ]¯ q , +∞[ , if x ∈ ]−∞, q¯] .

Hence Φ0 (¯ q ; 1) = (1 − τ ) P(]−∞, q¯]) + (−τ ) P(]¯ q , +∞[) = P(]−∞, q¯]) − τ . Thus the right derivative is ≥ 0 if and only if P(]−∞, q¯]) is ≥ τ . b) Similarly7 the directional derivative in the direction −1 is ≥ 0 if and only if P(]−∞, q¯[) is ≤ τ . 

6

Mean or center of mass

This Section continues our discussion about hypotheses. These are obvious facts. If P is of order 2 the mean E(X) is the unique m in R minimizing Z 2 E(|X − m| ) = |x − m|2 dP(dx) . R

This comes from the Huygens formula E(|X − m|2 ) = |m − E(X)|2 + v(X) (in Mechanics this is a formula about moments) where v(X) is the variance: in L2 (Ω) the random variables X − E(X) and the constant (E(X) − m) 1Ω are orthogonal. As in Section 3 assuming “P is of order 2” may seem too much since the mean exists as soon as “P is of order 1”. But the very elementary following holds. If P is only of order 1 the function x 7→ |x − m|2 − x2 is P-integrable and it remains that m is the mean if and only if it minimizes Z  |x − m|2 − x2 dP(dx) . R

Indeed |x − m|2 − x2 = m2 − 2 m x and so Z  2 |x − m|2 − x2 dP(dx) = m2 − 2 m E(X) = m − E(X) − E(X)2 . R 7

Minus signs are always more perilous. In the multivariate case one works with linear vector space and this technicality disappears.

10

For other means in metric spaces, see Fr´echet [Fr]. Instead of substracting a function independent of m, Fr´echet uses limits.

Appendix In [Mor4, Section 11] Moreau studied carefully the action of a bilinear form on BV functions. In particular, he provided a systematic development of more ancient results. Here is one statement by J.J. Moreau. Essential for understanding is that x and y are BV (or even locally BV) functions defined on the real interval I, Φ is a bilinear mapping and d is for the differential (or Stieltjes) measure obtained from a BV function. (In [Mor4] x, y and Φ are Banach-valued.) Corollary 11.4: For every [a, b] ⊂ I, under Convention 3.5 if a or b happen to equal an end of I, one has Z Z − Φ(dx, y ) + Φ(x+ , dy) = Φ(x+ (b), y + (b)) − Φ(x− (a), y − (a)) (11.9) [a,b]

[a,b]

and analogous formulas corresponding to (11.2) and (11.3). We are going to deduce Lemma 2 from (11.9): I = R, the BV functions are real valued, Φ is the usual product of real numbers. More precisely x = F (the cumulative function of P) and y = idR . So (11.9) rewrites Z Z y dF + F dy = b F (b) − a F − (a) . [a,b]

[a,b]

R

Substracting the integral {a} y dF from both sides one recovers (3). For the curious connoisseurs the corollary above follows from Proposition 11.1 p.39 and specially from formula (11.1) which asserts dΦ(x, y) = Φ(dx, y − ) + Φ(x+ , dy) the other formulas (11.2) and (11.3) being dΦ(x, y) = Φ(dx, y + ) + Φ(x− , dy) dΦ(x, y) = Φ(dx,

y+ + y− x+ + x− ) + Φ( , dy) . 2 2

These formulas are related to Integration by parts results. We could refer to the 1976 paper by Rockafellar [Ro, Prop.1 pp.161–162] which relies on the 1966 book [AB, Prop.8.5.5 pp.374–375] (these two references 11

are in [Mor4]). Hildebrandt in 1963 gave a lot of Integration by parts results [Hi, pages 53, 86, 93, 127 and 342]. And the first who proved an integration by parts result for general functions was K¨onig [K¨on] in 1897! About K¨ onig see the remark on page 76 of R.M. Dudley & R. Norvaiˇsa http://www.maphysto.dk/publications/MPS-LN/1998/1.pdf (title An introduction to p-variation and Young integrals, With emphasis on sample functions of stochastic processes). Acknowledgments. Thanks to Manuel Monteiro Marques, Paul Raynaud de Fitte and Lionel Thibault for their comments. They are not responsible of possible weaknesses.

References [A1]

Armatte, M., Maurice Fr´echet statisticien, enquˆeteur et agitateur public, Revue d’histoire des math´ematiques, 7 (2001) 7–65.

[A2]

Armatte, M., Fr´echet et la m´ediane: un moment dans une histoire de la robustesse, J. Soc. Fr. Stat., 147 (2006) 23–37.

[AB]

Asplund, E. & Bungart, L., A first course in integration, Holt, Rinehart and Winston, New York-Toronto, 1966.

[BH]

Belili, N. & Heinich, H., Median for metric spaces, Appl. Math. (Warsaw), 28 (2001) 191–209.

[C]

Chaudhuri, P., On a geometric notion of quantiles for multivariate data, J. Amer. Statist. Assoc., 91 (1996) 862–872.

[DCD1] Dacunha-Castelle, D. & Duflo, M., Probabilit´es et statistiques, 1. Probl`emes ` a temps fixe, Masson, Paris, 1982. [DCD2] Dacunha-Castelle, D. & Duflo, M., Exercices de probabilit´es et statistiques, 1. Probl`emes ` a temps fixe, Masson, Paris, 1982. [Fe1]

Ferguson, T.S., Mathematical statistics: a decision theoretic approach, Probability and Mathematical Statistics, Vol. 1 Academic Press, New York-London 1967.

[Fe2]

Ferguson, T.S., see at http://www.math.ucla.edu/∼tom/ “Solutions to Selected Exercises”: http://www.math.ucla.edu/∼tom/MathematicalStatistics/ Sec18.pdf 12

[FF]

Foata, D. & Fuchs, A., Calcul des probabilit´es, deuxi`eme ´edition, Dunod, Paris, 1998.

[Fr]

Fr´echet M., Les ´el´ements al´eatoires de nature quelconque dans un espace distanci´e, Ann. Inst. H. Poincar´e, Sect. B, Prob. et Stat., 10 (1948) 215–310. This text, and other papers by this author, is on numdam.org.

[Ha]

Haldane, J.B.S., Note on the median of a multivariate distribution, Biometrika, 35 (1948) 414–415.

[Hi]

Hildebrandt, T.H., Introduction to the Theory of Integration, Academic Press, New York, 1963.

[IT]

Ioffe, A.D. & Tihomirov, V.M., Theory of extremal problems, Studies in Mathematics and its Applications, 6, North-Holland Publishing Co., Amsterdam-New York, 1979.

[Kem]

Kemperman J.H.B., The median of a finite measure on a Banach space, Statistical data analysis based on the L1 -norm and related methods (Neuchˆ atel, 1987), 217–230, North-Holland, Amsterdam, 1987.

[Koe]

Koenker, R., Quantile Regression, Econometric Society Monographs, 38, Cambridge University Press, Cambridge, 2005.

[KB]

Koenker, R. & Bassett G., Regression Quantiles, Econometrica, 46 (1978) 33–50.

[Kol]

Koltchinskii, V.I. M -estimation, convexity and quantiles, Ann. Statist., 25 (1997) 435–477.

[K¨ on]

K¨ onig, G., A hat´ arozott integr´ alok elm´elet´ehez [On the theory of ´ definite integrals], Mathematikai ´es Term´eszettudom´anyi Ertez´ ıt¨o, 15 (1897) 380–384.

[MD]

Milasevic, P. & Ducharme, G.R., Uniqueness of the spatial median, Ann. Statist., 15 (1987) 1332–1333.

[MM1] Monteiro Marques, M.D.P., Regularization and graph approximation of a discontinuous evolution problem, J. Differential Equations, 67 (1987) 145–164.

13

[MM2] Monteiro Marques, M.D.P., Differential inclusions in nonsmooth mechanical problems. Shocks and dry friction, Progress in Nonlinear Differential Equations and their Applications, 9. Birkh¨auser Verlag, Basel, 1993. [Mor1] Moreau, J.J., Proximit´e et dualit´e dans un espace hilbertien, Bull. Soc. Math. France, 93 (1965) 273–299. ´ [Mor2] Moreau, J.J., Fonctionnelles convexes, S´eminaire sur les Equations aux D´eriv´ees Partielles (1966–1967) II, Coll`ege de France, Paris (1967), 108 pages (reedited in 2003 in 950 copies by Facolt`a di Ingegneria, Universit`a di Roma “Tor Vergata”). This hard to find text is on numdam.org (including an errata page). [Mor3] Moreau, J.J., Approximation en graphe d’une ´evolution discontinue, RAIRO Anal. Num´er., 12 (1978) 75–84. [Mor4] Moreau, J.J., Bounded variation in time, in Topics in nonsmooth mechanics (J.J. Moreau, P.D. Panagiotopoulos, G. Strang eds), Birkh¨ auser, Basel, 1988. pp.1–74. [RF]

Raynaud de Fitte, P., Th´eor`eme ergodique ponctuel et lois fortes des grands nombres pour des points al´eatoires d’un espace m´etrique ` a courbure n´egative, Ann. Probab., 25 (1997) 738–766.

[Ro]

Rockafellar, R.T., Dual problems of Lagrange for arcs of bounded variation, in Calculus of Variations and Control Theory (ed. by D.L. Russel), Academic Press, New York 1976, pp.155–192.

[RW]

Rockafellar, R.T. & Wets, R.J.-B., Variational analysis, Grundlehren der Mathematischen Wissenschaften, 317, Springer-Verlag, Berlin, 1998.

[S1]

Schilling, R.L., Measures, integrals and martingales, Cambridge University Press, New York, 2005.

[S2]

Schilling, R.L., http://www.motapa.de/measures integrals and martingales/solutions13+14.pdf

[V1]

Valadier, M., Sur l’int´egration d’ensembles convexes compacts en dimension infinie, C. R. Acad. Sci. Paris S´er. A-B, 266 (1968) A14–A16.

14

[V2]

Valadier, M., Sous-diff´erentiels d’une borne sup´erieure et d’une somme continue de fonctions convexes, C. R. Acad. Sci. Paris S´er. A-B, 268 (1969) 39–42.

[V3]

Valadier, M., La muti-application m´edianes conditionnelles, Z. Wahrsch. Verw. Gebiete, 67 (1984) 279–282.

[V4]

Valadier, M., Quantiles as minimizers, (2014) arXiv:1411.2732.

15