An Application of Reversible Entropic Dynamics on Curved

mental) structure. It arises only after assigning a .... whose solutions have been obtained by use of mathematics software (Maple) and analytical manipulation:.
141KB taille 5 téléchargements 341 vues
An Application of Reversible Entropic Dynamics on Curved Statistical Manifolds C. Cafaro, S. A. Ali and A. Gif n Department of Physics, University at Albany–SUNY, Albany, NY 12222,USA Abstract. Entropic Dynamics (ED) [1] is a theoretical framework developed to investigate the possibility that laws of physics re ect laws of inference rather than laws of nature. In this work, a RED (Reversible Entropic Dynamics) model is considered. The geometric structure of the underlying curved statistical manifold Ms is studied. Reversible aspects of such RED are emphasized. The trajectories of this particular model are hyperbolic curves (geodesics) on Ms . Moreover, some analysis concerning the stability of these geodesics on Ms is carried out.

Keywords: Inductive Inference, Information Geometry, Statistical Manifolds, Relative Entropy PACS: 02.50.Tt

INTRODUCTION We use Maximum Entropy (ME) methods [2; 3] to construct a RED model. ME methods are inductive inference tools. They are used for updating from a prior to a posterior distribution when new information in the form of constraints becomes available. We use known techniques [1] to show that they lead to equations that are analogous to equations of motion. Information is processed using ME methods in the framework of Information Geometry (IG) [4]. The ED model follows from an assumption about what information is relevant to predict the evolution of the system. We focus only on reversible aspects of the ED model. In this case, given a known initial state and that the system evolves to another nal known state, we investigate the possible trajectories of the system. Reversible and irreversible aspects in addition to further developments on the ED model will appear in a forthcoming paper [8]. Given two probability distributions, how can one de ne a notion of "distance" between them? The answer to this question is provided by IG [4]. IG is Riemannian geometry applied to probability theory. As it is shown in [5], the notion of distance between dissimilar probability distributions is quanti ed by the Fisher-Rao information metric tensor.

THE UNDERLYING MANIFOLD OF THE RED MODEL We consider a RED model whose microstates span a 2D space labelled by the variables x1 2 RC and x2 2 R. We assume that the only testable information pertaining to the quantities x1 and x2 consists of the expectation values hx1 i, hx2 i and the variance 1x2 . Therefore these three expected values de ne the 3D space of macrostates of the system. A measure of distinguishability among the states of the ED model is achieved by assigning a probability distribution p .tot/ xEj E to each macrostate E . The process of

assigning a probability distribution to each state provides M S with a metric structure. Speci cally, the Fisher-Rao information metric de ned in .6/ is a measure of distinguishability among macrostates. It assigns an IG to the space of states.

The Statistical Manifold M S Consider a hypothetical physical system evolving over a two-dimensional space. The variables x1 and x2 label the 2D space of microstates of the system. We assume that all information relevant to the dynamical evolution of the system is contained in the probability distributions. For this reason, no other information is required. Each macrostate may be thought as a point of a three-dimensional statistical manifold with .2/ coordinates given by the numerical values of the expectations .1/ 1 D hx 1 i, 1 D hx 2 i, .2/ 2 D 1x 2 . The available information can be written in the form of the following constraint equations, R C1 R C1 .2/ hx1 i D 0 d x1 x1 p1 x1 j .1/ , hx2 i D 1 d x2 x2 p2 x2 j .2/ , 1 1 ; 2 1x2 D

q

.x2

hx2 i/2 D

hR

C1 1 d x 2 .x 2

.2/ .2/ 1 ; 2

hx2 i/2 p2 x2 j

i1

2

(1)

.

The probability distributions p1 and p2 are constrained by the conditions of normalization, Z C1 Z C1 .2/ .1/ d x1 p1 x1 j 1 D 1, d x2 p2 x2 j .2/ D 1. (2) 1 ; 2 0

1

Information theory identi es the exponential distribution as the maximum entropy distribution if only the mean is known. The Gaussian distribution is identi ed as the maximum entropy distribution if only the mean and the variance are known. ME methods allow us to associate a probability distribution p .tot/ xEj E to each point in the space .1/ .2/ .2/ of states E . The distribution that best re ects the prior information 1 , 1 , 2 contained in the prior distribution m .E x / updated by the information .hx1 i ; hx2 i ; 1x2 / is obtained by maximizing the relative entropy 1 0 Z C1 Z C1 p .tot/ xEj E A, S E D d x1 d x2 p .tot/ xEj E log @ (3) m .E x/ 0 1

where m.E x / m is the uniform prior probability distribution. The prior m .E x / is set to be uniform since we assume the lack of prior available information about the system (postulate of equal a priori probabilities). Upon maximizing .3/, given the constraints .1/ and .2/, we obtain p

.tot/

xEj E D p1 x1 j

.1/ 1

p2 x 2 j

.2/ .2/ 1 ; 2

D

1 1

e

x1 1

q 2

2 2/ 2 2 2

.x2

1 2 2

e

;

(4)

.2/ .2/ where .1/ 1 D 1 , 1 D 2 and 2 D available information about the system.

2.

The probability distribution .4/ encodes the

The Metric Structure of Ms We cannot determine the evolution of microstates of the system since the available information is insuf cient. Not only is the information available insuf cient but we do not know what the equation of motion is. In fact there is no standard "equation of motion". Instead we can ask: how close are the two total distributions with parameters . 1 ; 2 ; 2 / and . 1 C d 1 ; 2 C d 2 ; 2 C d 2 /? Once the states of the system have been de ned, the next step concerns the problem of quantifying the notion of changing from the state E to the state E C d E . A convenient measure of change is distance. The measure we seek is given by the dimensionless "distance" ds between p .tot/ xEj E and p .tot/ xEj E C d E [4] :

where gi j D

Z

ds 2 D gi j d i d d xE p

.tot/

xEj E

j

,

(5)

@ log p .tot/ xEj E @ log p .tot/ xEj E

(6) @ i @ j is the Fisher-Rao metric [5]. Substituting .4/ into .6/, the metric gi j on Ms becomes, 0 1 1 0 0 2 B 1 1 C 0 0 C. gi j D B (7) 2 @ A 2 2 0 0 2 2

From .7/, the "length" element .5/ reads, ds 2 D

1 2 1

d

2 1C

1 2 2

d

2 2C

2 2 2

d

2 2.

(8)

We bring attention to the fact that the metric structure of Ms is an emergent (not fundamental) structure. It arises only after assigning a probability distribution p .tot/ xEj E to each state E .

The Statistical Curvature of Ms We study the curvature of Ms . This is achieved via application of differential geometry methods to the space of probability distributions. As we are interested speci cally in the curvature properties of Ms , recall the de nition of the Ricci scalar R, R D g i j Ri j ,

(9)

where g ik gk j D

i j

so that g i j D gi j Ri j D @k 0ikj

1

2

2 , 2 , 2 /. 1 2 2

D diag.

k n @ j 0ik C 0ikj 0kn

The Ricci tensor Ri j is,

m k 0ik 0 jm .

(10)

The Christoffel symbols 0ikj appearing in the Ricci tensor are de ned in the standard way, 1 0ikj D g km @i gm j C @ j gim @m gi j : (11) 2 Using .7/ and the de nitions given above, the non-vanishing Christoffel symbols are 1 1 3 3 1 D 2 D 02 D 011 , 022 D 2 1 2 , 033 D 12 and 023 . The Ricci scalar becomes 32 2 1

RD

1 < 0.

(12)

From .12/ we conclude that Ms is a 3D curved manifold of constant negative .R D 1/ curvature. It is interesting to note that the space of Gaussians is a 2D curved manifold while we have a 3D manifold (3D D 2D C 1D, Gaussians + exponentials). It seems that the additional exponential probability distributions do not spoil the constant curvature. This is trivial because the additional exponentials are independent of the Gaussians: in this model we consider two independent systems. The study of the Gaussian and the exponential independent of each other, followed by the use of one system as a clock for the other appears in [8].

CANONICAL FORMALISM FOR THE RED MODEL Given the initial macrostate and that the system evolves to a nal macrostate, we investigate the expected trajectory of the system on Ms . It is known [6] that the classical dynamics of a particle can be derived from the principle of least action in the form, Z sf dq j J J acobi q D dsF q j ; ; s; H D 0, (13) ds si where the function F does not encode any information about the time dependence. The quantity s is an arbitrary (unphysical) parameter along the trajectory. Indeed dq j F qj; ; s; H ds

[2 .H

U /]

1 2

X dq j dqk a jk ds ds j;k

!1 2

,

(14)

where the energy of the particle is given by H D T C U .q/ D

1X a jk .q/ qP j qPk C U .q/ . 2 j;k

(15)

The coef cients a jk .q/ are the reduced mass matrix coef cients and qP D dq ds . We now seek the expected trajectory of the system assuming it evolves from the given initial state old D Cd . 1 .si / ; 2 .si / ; 2 .si // to a new state new D

1 s f ; 2 s f ; 2 s f . A convenient choice of the parameter s is through a supplementary condition so the F is constant (=1). Then, it can be shown that the system moves along a geodesic in the space of states [1]. Since the trajectory of the system is a geodesic, the RED-action is itself the length:

Z

JR E D [ ] D where P D

d ds

sf

ds gi j

si

d

Z

1 2

.s/ d j .s/ ds ds i

sf

dsL

si

;P

(16)

and L. ; P / is the Lagrangian of the system, i j L. ; P / D .gi j P P / 2 . 1

(17)

The evolution of the system can be deduced from a variational principle of the Jacobi type. The natural choice for the supplementary condition for the RED model that de nes the parameter determining the evolution along the trajectory is s , where satis es d id j the condition gi j d d D 1. Performing a standard calculus of variations, and identifying s with , we obtain, Z 1 @gi j i j d P k k P P JE D [ ] D d D 0; 8 k . (18) k 2@ d Note that from .18/,

d Pk d

1 @gi j P i P j . 2@ k

This "equation of motion" is interesting because it shows that if D 0 for a particular k then the corresponding P k is conserved. This suggests to interpret P k as momenta. Equations .18/ and .11/ lead to the geodesic equations, i j d2 k. / k d . /d . / C 0 D 0. (19) ij d 2 d d Observe that .19/ are second order equations. These equations describe a dynamics that is reversible and they give the trajectory between an initial and nal position. The trajectory can be equally well traversed in both directions. @gi j @ k

D

Geodesics on Ms We seek the explicit form of .19/ for then statistical coordinates . 1 ; 2 ; o 2/ parametrizing the submanifold ms of Ms , ms D p .tot/ xEj E 2 Ms : E satis es .19/ . Substituting the explicit expression of the connection coef cients found in subsection .2:3/ into .19/, the geodesic equations become, d2 d

1

1 2

1

d2 d

2 2

d d 1 2

1

2

d d

D 0, 2

2

d2 d

2 2

C 21 2

2 d 2 d d d

2

2

2

d d

2

D 0.

D 0,

(20)

This is a set of coupled ordinary differential equations, whose solutions have been obtained by use of mathematics software (Maple) and analytical manipulation: 1.

/ D A1 .cosh .

A22 2. / D 2 2 cosh .2 2.

/ D A2

The quantities A1 , A2 , B2 , PjPj

1 2

1

1

// ,

1 2

sinh .2

/

cosh . cosh .2

and

sinh .

/

1

2

2

2

/

2

A22

/C 8

sinh .

/

sinh .2

2

2 2

C B2 ,

/

.

A22

/C 8

2

(21)

2 2

are the ve integration constants (5 D 6

1,

D 1). The coupling between the parameters 2 and 2 is re ected by the fact that their respective evolution equations in .21/ are de ned in terms of the same integration constants A2 and 2 . Equations .21/ parametrize the evolution surface of the statistical submanifold ms Ms . By eliminating the parameter , 2 can be expressed explicitly as a function of 1 and 2 , 2.

1,

2/ D

2

2 2 1

1

2 1

.

2

B2 / .

(22)

A1 A2

This equation describes the submanifold evolution surface.

ABOUT THE STABILITY OF GEODESICS ON Ms We brie y investigate the stability of the trajectories of the RED model considered on Ms . It is known [6] that the Riemannian curvature of a manifold is closely connected with the behavior of the geodesics on it. If the Riemannian curvature of a manifold is negative, geodesics (initially parallel) rapidly diverge from one another. A more detailed study of the stability of geodesics on different curved statistical manifolds underlying RED models will be the subject of a forthcoming paper [7]. For the sake of simplicity, 1 A2 we assume very special initial conditions: D 1 D 2 1, A1 and B2 are 4, 8 2 2

arbitrary. However, the conclusion we reach can be generalized to more general initial conditions. Recall that Ms is the space of probability distributions p .tot/ xEj E labeled

by parameters 1 ; 2 ; 2 . These parameters are the coordinates for the point p .tot/ , and in these coordinates a volume element d VMs reads, 1

d VMs D g 2 E d n E

p

gd

1d

2d 2

(23)

where g D j det gi j j D

Ms is,

2

2 4 1 2

. Hence, using .23/, the volume of an extended region of

VMs D

Z1 .

/

1 .0/

Z2 .

/

2 .0/

Z2 .

/

p

gd

1d

2d 2.

(24)

2 .0/

Finally, using .21/ in .24/, the temporal evolution of the volume VMs becomes, A2 VMs . I / D p e : 2

(25)

1 d 1 Note that > 0. Equation .25/ shows that volumes VMs . I / 1D 1 d D0 increase exponentially with . Consider the one-parameter . / family of statistical volume elements FVMs . / VMs . I / . The stability of the geodesics of the RED model may be studied from the behavior of the ratio rVMs of neighboring volumes VMs . I C / and VMs . I /,

VMs . I C / (26) VMs . I / The quantity rVMs describes the relative volume changes in for volume elements with parameters and C where the changes are considered to be positive. Substituting .25/ in .26/, we obtain: def

rVMs D

rVMs D e

.

(27)

Equation .26/ shows that different initial conditions . / affect the rate of exponential growth. We conclude that the statistical volume elements associated to the geodesics .21/ of the RED model are strongly divergent in size, not in position. The divergence in position requires the consideration of equations of motion. The correct tool to study this might be the equation of geodesic deviation in general relativity. A detailed analysis will appear in [7].

FINAL REMARKS A RED model was considered. The space of microstates is 2D while all information necessary to study the dynamical evolution of such a system is contained in a 3D space of macrostates. It was shown that the RED model is related to a 3D statistical space of probability distributions Ms which possess the geometry of a curved manifold of constant negative curvature .R D 1/. The geodesics of the RED model are hyperbolic curves on the submanifold ms of Ms . Furthermore, considerations of statistical volume elements suggest that these entropic dynamical models might be useful to mimic exponentially unstable systems. Based on the results obtained from the RED model and provided the correct variables describing the true degrees of freedom of a system be identi ed, perhaps more insights into the foundations of models of physics may be uncovered. Acknowledgements: The authors are grateful to Prof. A. Caticha for useful guidance.

REFERENCES 1. 2. 3. 4. 5. 6. 7. 8.

A. Caticha, "Entropic Dynamics", Bayesian Inference and Maximum Entropy Methods in Science and Engineering, ed. by R.L. Fry, AIP Conf. Proc. 617, 302 (2002). A. Caticha, "Relative Entropy and Inductive Inference", Bayesian Inference and Maximum Entropy Methods in Science and Engineering,ed. by G. Erickson and Y. Zhai, AIP conf. Proc. 707, 75 (2004). A. Caticha, "Maximum entropy and Bayesian data analysis: Entropic prior distributions", Physical Review E 70, 046127 (2004). S. Amari and H. Nagaoka, Methods of Information Geometry, American Mathematical Society, Oxford University Press, 2000. R.A. Fisher, Proc. Cambridge Philos. Soc. 122, 700 (1925); C.R. Rao, Bull. Calcutta Math. Soc. 37, 81 (1945). V.I. Arnold, Mathematical Methods of Classical Physics, Springer-Verlag, 1989. C. Cafaro, S. A. Ali, "Stochasticity in Reversible Entropic Dynamics on Curved Statistical Manifolds", In Preparation. C. Cafaro, S. A. Ali, A. Gif n, "Reversible and Irreversible Aspects of Entropic Dynamics", In Preparation.