A re-examination of the “zero is enough” hypothesis ... - Olivier Brandouy

Aug 5, 2012 - Indeed, as Cont (2000) points out, the result of more than half a century of ... act comparably well in terms of convergence to equilibrium price and efficiency as ... offsetting by a random amount the most recent market price.
867KB taille 0 téléchargements 37 vues
J Econ Interact Coord (2012) 7:223–248 DOI 10.1007/s11403-012-0099-0 REGULAR ARTICLE

A re-examination of the “zero is enough” hypothesis in the emergence of financial stylized facts Olivier Brandouy · Angelo Corelli · Iryna Veryzhenko · Roger Waldeck

Received: 8 September 2011 / Accepted: 16 July 2012 / Published online: 5 August 2012 © Springer-Verlag 2012

Abstract In recent years, a growing literature has claimed that the market microstructure is sufficient to generate the so-called stylized facts without any reference to the behaviour of market players. Indeed, qualitative stylized-facts can be generated with zero-intelligence traders (ZITs) but we stress that they are without any quantitative predictive power. In this paper we show that in most of the cases, such qualitative stylized facts hide unrealistic price motions at the intraday level and ill-calibrated return processes as well. To generate realistic price motions and return series with adequate quantitative values is out-of-reach using pure ZIT populations. To do so, one must increasingly constrain agents’ choices to a point where it is hard to claim that their behaviour is completely random. In addition we show that even with highly constrained ZIT agents, one cannot reproduce real time series from these. Except in a few cases, first order moments of ZITs never equal real data ones. We therefore claim that stylized facts produced by means of ZIT agents are useless for financial engineering.

The authors would like to thank funding from the Institut Mines-Telecom, grant “Future and rupture”. O. Brandouy (B) Sorbonne Graduate Business School, GREGOR (EA 2474, IAE de Paris), Paris, France e-mail: [email protected] A. Corelli IEI Department, Linköping University, Linköping, Sweden e-mail: [email protected] I. Veryzhenko ENSAM Paristech, GREGOR (EA 2474, IAE de Paris), Paris, France e-mail: [email protected] R. Waldeck Department LUSSI, Telecom Bretagne, Institut Mines-Telecom, Brest, France e-mail: [email protected]

123

224

O. Brandouy et al.

Keywords Agent based modelling · Time series analysis · Financial markets · Heterogeneity of agents · Market microstructure 1 Introduction Agent-based modelling is a powerful research tool for understanding how macro phenomena emerge. As such, it is a widely used methodology in the social sciences and economics in particular (Duffy 2006). As with the approach of the representative individual agent, it aims at explaining the macroscopic level from the microscopic point of view of agents, but with the additional step that interaction structures and rationality play a major role. Financial markets are particularly appealing applications for agent-based methods (Le Baron 2006; Hommes 2006) and the debate is on whether institutions or rationality is the most important factor in explaining stylized facts that are observed in real markets, but also replicated in artificial ones. Indeed, as Cont (2000) points out, the result of more than half a century of empirical studies on financial time series indicates that the seemingly random variations of asset prices do share some quite non trivial statistical properties. Such properties, common across a wide range of instruments, markets and time periods are called empirical stylized facts. Such stylized facts are usually formulated in terms of qualitative properties of asset returns, and may not be precise enough to distinguish among different parametric models. The major stylized facts are the following : i) for the return time series : absence of autocorrelation, except for very small intraday time scales, volatility clustering,1 autocorrelation of volumes and cross correlation volume volatility (Levy et al. 2000); ii) for the empirical distribution of asset returns, one observes that they are non-Gaussian but leptokurtic i.e. fat tailed distributions tending to Gaussianity when the time scale increases. The fact that many different assets share such common qualitative properties and that many artificial systems may reproduce the latter does not imply that they can also generate adequate quantitative values for these properties. This paper shows that in fact, this is almost out of reach within the framework of zero intelligence agents (Gode and Sunder 1993). Stylized facts are difficult to explain by the mainstream theory and the effort in empirical research to describe data lacks of a convenient theoretical foundation of these facts. However, an alternative approach has emerged from econophysics which describes the same financial facts as scaling laws. Indeed, this approach considers that physical systems, consisting of a large number of interacting particles, obey universal scaling laws that are independent of the microscopic details : economies (and social systems in general) can be considered in the same way Amaral et al. (1999). This approach may suggest that the rational individual choice is not important in explaining macroscopic facts. But as Lux (2009) points out, it might be the heterogeneity of market participants together with a few basic principles of interactions that exerts a dominating influence on the macroscopic market behaviour, whatever the institutional 1 Volatility clustering expresses the fact that large (small) changes in returns tend to be followed by large

(small) changes of either sign, for some considerable time.

123

A re-examination of the “zero is enough” hypothesis

225

settings. Indeed a large literature has emerged which aims at discerning among two major explanations: market micro-structure and agents’ behaviour and heterogeneity. Papers in favour of the explanation by the microstructure initiate with the research on zero intelligent traders (here-after ZIT) of Gode and Sunder (1993) who show, in the spirit of Becker (1962), that traders acting randomly but within a budget constraint act comparably well in terms of convergence to equilibrium price and efficiency as compared to human subjects in experimental economics (see Smith 1962). Cliff et al. (1997) show that for special demand or supply function this may not be the case and a slightly more complex behavioural assumption is required to achieve equilibrium in continuous double auction markets. Ladley and Schenk-Hoppe (2009) in a similar framework as Gode and Sunder, but with a constant flow of traders entering the market, reproduce price movements and show that aspects of the order book such as the size of spreads and conditional probabilities of order submissions can be obtained by the interplay of ZITs and the book. However, the observed frequency of the submitted order types seems related to strategic behaviours based on the observed book state. Maslov (2000) introduces a model where traders randomly choose to trade either at the market price or by placing a limit order; limit orders are simply determined by offsetting by a random amount the most recent market price. Maslov shows that fat tails, long range correlation in the volatility and non-trivial Hurst exponents arise in such framework. One paper by Farmer et al. (2005) shows that a simple ZIT model working as a continuous double auction with both market and limit orders predicts well bid-ask spreads, price diffusion rates and market impact function related to the supply and demand of 11 stocks in the London Stock Exchange. Their conclusion is that the price formation mechanism strongly constrains the market, playing a more important role than strategic behaviour. They adjust their model to real data by making simple assumptions about order placement, cancelling process of limit orders and ticks (price increments). These quantities are estimated on a daily basis from the real stocks and serve for making predictions. Nevertheless, their framework exhibits odd assumptions such as an order issuing based on log prices rather than raw prices and various Poisson laws. Other studies show that stylized facts can be reproduced using heterogeneous behaviours in the agents population. Most of these models2 suggest that the aggregation of simple interactions at the micro level leads to complex non linear behaviour at the macro level. Typically, the heterogeneity of behaviours is due to different types of rationality (informational and computational) and heterogeneity in preferences as well. The experimental economics literature also tackles some issues concerning stylized facts. Early experiments show that bubbles can easily be reproduced in double auction markets (Smith et al. 1988). Bubbles are resilient to market conditions such as short selling, margin buying opportunities, limit price-change rules, informed insider trading (King et al. 1993). The only way to generate prices that approximately reflect the intrinsic value of a dividend share is by increasing the level of subject

2 Since there is a large literature the reader is refereed to the surveys of Le Baron et al. (1999), Hommes

(2006), Lux (2009) or to the book of Levy et al. (2000).

123

226

O. Brandouy et al.

experience. Plott and Sunder (1982) are the first to report some stylized facts within the lab without however providing any explanation about it. They show excess Kurtosis and the lack of autocorrelation of returns in prices. Kirchler and Huber (2007) present an explanation based on asymmetric information between traders in a double auction markets. They introduce a market characterized by traders receiving heterogeneous information. They show that the latter is the source of fat tails and absolute returns whereas higher noise trading (trading not based on fundamentals) does not explain absolute returns. Bottazzi et al. (2011) design an experimental market essentially adapted from Hommes (2006). In this research, each subject makes a prediction about future expected return and volatility which determines the investment strategy of an artificial speculator through the maximization of a CARA utility function; they reproduce some of the stylized facts regarding returns, i.e. excess volatility, skewness and fat tails. They conjecture that unstable markets are more destabilizing for traders since the latter rely more on the crowd which generates unstable dynamics. To the best of our knowledge, only one paper attempts to reconcile the two streams of literature presented above, by suggesting that different elements can be at the heart of the emergence of stylized facts for different time horizons. Liu et al. (2008) show that a market with a clearing house microstructure and zero intelligent agents is responsible for reproducing leptokurtic and fat-tailed distributions, autocorrelation and excess volatility for intraday data but it is only responsible for excess volatility in daily returns. Therefore, behavioural assumptions are required to explain other facts which they show by the use of an extended minority game. Our paper lies in the strand of literature of ZIT agents and contributes to the debate concerning the level of intelligence that is necessary for generating stylized facts and realistic price motions as well. Not only are we interested in coarse grain qualitative empirical regularities, but also in the actual ability of ZITs to generate quantitatively acceptable stylized facts. For that purpose, we produce several families of ZIT agents similar to those found in the literature, but calibrated using real market data. This paper is organized as follows: Sect. 2 presents the artificial market simulator used for replicating real data, defines the ZIT agents and the simulation methodology. Section 3 shows an introductory case study, presents the core results of the paper and proposes a sensitivity analysis of the latter. The punchline that can be drawn from our results is simple: ZITs are unable to generate realistic price motions, except when highly constrained, and fail most often to generate quantitatively stylized facts. It implies that aspects linked to rationality, especially heterogeneity in behaviour, should play a central role for this line of research to become useful in financial engineering. This is the essence of the concluding Sect. 4.

2 Simulation methodology In this section, we start with a general presentation of the simulation tool (Sect. 2.1). We then move to a presentation of the different behavioural assumptions used in the experiments (Sect. 2.2).

123

A re-examination of the “zero is enough” hypothesis

227

2.1 A brief description of ATOM In this research, we use an artificial market simulator called “ArTificial Open Market” (here-after ATOM). In this simulator, price formation directly emerges from the interaction of agents at the micro-level. The market microstructure in ATOM is duplicated from the Paris NYSE-Euronext stock exchange with the following features:3 1. It can emulate several order-books in parallel. Each order-book runs a continuous, double auction mechanism. 2. All types of NYSE-Euronext orders are allowed: limit, market, cancel or update orders, as well as sophisticated combinations such as stop-limit orders or limit orders with ‘‘iceberg” execution.4 3. The philosophy ruling the platform is such that an agent is an abstract entity that can be instantiated both by an autonomous process (developing its own strategy), or through an interface allowing humans to interact with the system. This latter possibility is not used in this research. 4. Concerning virtual agents, their possible behaviour is flexible and can be designed to fit the researchers’ requirements. At each time step, the scheduler system offers agents the possibly to decide if they will send or not a new order for each of their stocks to the corresponding order book. Particular attention has been paid to the accuracy of calculations made by the platform so as to ensure the trustworthiness of prices it delivers. Notably, no real types are used or arithmetic divisions are allowed in the code. A logging system systematically historicizes all agents’ decisions, as well as their impacts on the market. This is an essential feature to exploit the “replay-engine” ability of the simulator: if a population of agents simply re-processes real orders submitted to the market on a given day, the resulting price series is exactly the same as the one produced by the market.5 So to speak, the simulator is unbiased with regard to the microstructure it claims to be grounded on. For example, Fig. 1 illustrates such an experiment with a set of 83616 real-world orders concerning the French blue-chip France-Telecom (FTE), and submitted to the NYSE-Euronext market on June 26th 2008 between H9.02 .14 .813 and H17.24 .59 .917 . Figure 1a presents the price produced with ATOM and the real-order flow while Fig. 1b presents the corresponding exchanged volumes. In each of these figures, two sets of data are plotted. The upper set corresponds to the series generated by ATOM in processing the real-world order flow. The bottom part displays those actually observed on the NYSE-Euronext market. One clearly sees that 3 For technical details describing the simulation platform, see Mathieu and Brandouy (2010). 4 See NYSE-Euronext rule-book, at http://www.euronext.com In this paper we only use “Limit”, “Market”

and “Cancel” orders. 5 Technically speaking, the replay-engine configuration of the platform operates as follows: it first reads a log file in which orders, with their characteristics -and notably the different time stamps-, are stored. It then creates as many assets,order-books and agents as necessary to reprocess exactly this file. Once this has been done, each order is parsed from the log file and attributed to the agent issuing the order, taking into account the time stamp of the order. The platform can read most of the logs in text or FIX formats through wrappers.

123

O. Brandouy et al.

18.0 17.8

Price

18.2

228

800

1600

2400

3200

4000

4800

5600

6400

7200

8000

8800

9600

10400

9600

10400

18.0 17.8

Price

18.2

Simulated Time

10:00

12:00

14:00

16:00

Physical Time

15000 0 5000

Volume

25000

(a)

800

1600

2400

3200

4000

4800

5600

6400

7200

8000

8800

15000 0 5000

Volume

25000

Simulated Time

10:00

12:00

14:00

16:00

Physical Time

(b) Fig. 1 Simulated prices from real-order flows versus Real data. a Prices: ATOM versus NYSE-Euronext. b Volumes: ATOM versus NYSE-Euronext

the replay-engine included in our artificial platform can process the real-world order flow in the same way it is treated by the NYSE-Euronext engine. Beyond this simple case, ATOM can use various kind of agents with their own behaviours and intelligence. Thousands of these agents can evolve simultaneously,

123

A re-examination of the “zero is enough” hypothesis

229

creating a truly heterogeneous population. Once designed, agents evolve by themselves, learning and adapting to their (financial) environment in the most sophisticated cases. In this research, we do not use such agents but rather artificial entities using random generators to make their decisions and unable to learn or evolve: Zero Intelligence Traders. As we show below, ZITs can come in various forms, from the simplest to the most constrained ones. 2.2 Agent’s behaviour : from unconstrained to highly-constrained ZITs In this section we introduce four different types of ZITs.6 For the sake of possible replication of the results presented in this research, the pseudo-code describing each agent is available in “Appendix”. The most basic ZITs we use (called hereafter “Unconstrained ZIT”) are directly inspired by the work of Maslov (2000). From this starting point, we add, step by step, constraints to their behaviour (in accordance with real market data) with the aim of reproducing quantitatively some stylized facts. Notice that we denote by capital letters all real market data (P, V, R, for example resp prices, volumes and returns) and by small letters all simulated data. Except when a cancel order is issued, the trading activity consists in sending to the market an order made of a direction, a price limit (except when this order is a market order) and a quantity. The common characteristics for all of our ZITs, whatever their level of constraints, are as follows: – As said previously, there are three possible order types: limit, market and cancel. The proportions between these different order types are 80 % of limit orders, 15 % of market orders and 5 % of cancel orders which reflects realistic characteristics of real markets. These figures are estimated from a data set of real order flows gathering around 36,000 observations of intraday trading (courtesy of Calyon SA, here-after “CDS”, for Calyon Data Set). However, they may vary from a given stock to another and across categories of investors (Foucault 1999; Handa and Schwartz 2006). Accordingly, these proportions are modified in the sensitivity analysis (see page 241). – Each agent can submit both orders, Buy and Sell. – Buy and Sell orders arise with equal probability ( p = 0.5). – A single asset is traded. – Within each ZIT family, we define two subcategories of agents with respect to the real market average volume observed for a given stock. ‘‘Big fishes” draw volumes between the mean and the maximum real volume , while ‘‘Small fishes”, draw this value between the minimum and the mean real volume. The reason for this choice is that it may generate a realistic picture of contemporary markets, where, in a 6 What “zero intelligence” means is questionable: in our simulations, we denominate ZIT artificial agents that mostly use random number generators to determine prices and volumes in their orders. These agents are more or less sophisticated but none of them use artificial intelligence methods such as classifiers or learning mechanisms to adapt their behaviour and/or to evolve. We thus do not claim to investigate all possible declinations of ZITs but a series of ZIT instantiations that are common in the literature.

123

230

O. Brandouy et al.

typical experiment, the ratio of big to small fishes is 1 over 5. This proportion is also obtained from the “CDS”. – Budget constraints are implemented: agents cannot make a trade that yields a negative profit, i.e., buyers cannot buy at a price higher than their reservation value and sellers cannot sell for a price below their marginal cost. Note that all the parameters chosen for the initial settings of the simulations are calculated based on real market data. These parameters are inferred for each stock on each day. As mentioned previously, four types of ZITs are defined from these basic elements: 1. Unconstrained ZIT (U Z I T ) with two types of price-generating processes: – Uniform price distribution (U Z I TU , see Algorithm 1, page 246): pt is drawn from a Uniform distribution in [Pmin , Pmax ] where, for a reminder of the convention, Pmin and Pmax are the minimum, respectively maximum price observed on the real market. – Normal price distribution (U Z I TN , see Algorithm 2, page 246): pt follows a Normal N (Pmean , Psd ). 2. Statistically calibrated ZIT (S Z I T , see Algorithm 3, page 246) are kind of bounded U Z I TU , meaning that i) they still perform a random draw from a Uniform distribution ii) the price range is limited by [Pmin , Pmax ] and iii) the range of admissible price is different between Sellers and Buyers, since we took for Sellers a simulated range for ask prices amin and amax and for bid prices bmin and bmax . These simulated boundaries are obtained from the “CDS” as mentioned previously. To go into the details, we first separate Bids and Asks and then find for each of these subsets the minimum and the maximum values observed on a given day. Similarly, the same procedure was applied to volume data. 3. Trend calibrated agents (T Z I T , see Algorithm 4, page 247) are S Z I T with the following additional feature: when they issue a new order, they pick a price that is formed using two additional parameters γt and δt . γt is geared at reproducing the tendency of a given series. δt generates some additional randomness. More precisely, T Z I T agents respect the following procedure : (a) Divide the day into n sub-intervals (b) Find min and max prices within each sub-intervals (c) Estimate the price series according to Eqs. (1) to (4): for this, choose the number of time period t for which you want an estimation of price (by fixing the parameter θ ) and apply Eqs. (1) to (4) to estimate prices. Let’s consider the following example : on a given period, one observes for a given stock a slow decay from a maximum to a minimum price. This slow decay can be described using Eqs. (1) to (4). γ0 = 0 1 γt = γt−1 + with t ∈ [0, θ ] θ δt ∼ log N (0, 1) Pt = Pmax + (Pmin − Pmax ) × γt × δt

123

(1) (2) (3) (4)

A re-examination of the “zero is enough” hypothesis

231

Fig. 2 Example of calibration procedure, Renault SA. S Pn : Subperiod “n”

Note that in Eq. 2, γt is an increasing step function from zero to one with step size equal to 1/θ . θ allows to track the underlying trend with a certain level of accuracy. The bigger θ , the more accurate this fitting. Thus, the behaviour of these agents necessitates to specify within a given day n sub-periods and,7 for each of these sub periods, the maximum and the minimum price. Note that Eq. 4 implies that the first price set by this category of agents is close to Pmax , for this example of a slow decay in prices (Fig. 2). Based on these information, agents can track the global price tendency so as to algorithmically grasp the underlying motions. So to speak, we endow TZITs with limited foreseeing capabilities in the very short run. 4. Derived from a procedure introduced by Farmer et al. (2005), the fourth category of ZIT agents is characterized by their relative aggressiveness (AZ I T , see Algorithm 5, 26):8 – Patient agents (AZ I TP ): send limit buy orders with prices drawn from a uniform distribution between zero and amin and limit sell orders from bmax to ∞.9 These agents are parametrized in such a way that they cannot trade one with another. Trades cannot occur within this population. The quantity is determined randomly using a uniform distribution between two integers min(V ) and max(V ). – Impatient ZIT ( AZ I TI ): send market orders using the same distribution for quantities as Patient ZIT. There are 85 % of AZ I TP and 15 % of AZ I TI in a typical simulation10 and the quantity posted by AZ I TP is twice the quantity posted by AZ I TI . 7 The precision in this procedure can be tuned so as to track more or less closely the underlying dynamics. n = 50 is the value arbitrarily chosen for the experiments. 8 Note that in Farmer et al. (2005) log prices and not prices where drawn from a Uniform distribution. 9 a min and bmax are inferred as described for S Z I T . 10 These figures corresponding roughly to the proportion of market versus limit orders in real markets.

123

5

5

10

15 0

0

Empirical Quantiles −5

−3

10

−2

−1

0

1

2

Normalized Returns

Theorical Quantiles

(a)

(b)

3

1e−04 1e−03 1e−02 1e−01 1e+00

Autocorrelation

−0.2 0.0 0.2 0.4 0.6 0.8 1.0

−10

−5

Probability

O. Brandouy et al. 1e−07 1e−05 1e−03 1e−01 1e+01

232

0

10

20

30

40

Time Lag

50

1

2

5

10

20

50

100

200

(d)

(c) Fig. 3 Renault SA intraday returns, departure from Normality. a Fat tails (normalized returns). b QQ-plot. c Auto-correlations, raw returns. d Auto-correlations, absolute returns (loglog scale)

3 Empirical design and results We introduce a case study to illustrate the main concepts and statistics used in the empirical part of this research (Sect. 3.1) and then generalize our results (Sect. 3.2). We also run a sensitivity analysis to explore how our results depend on various experimental settings (Sect. 3.3). 3.1 An introductory case study In this section, we focus on a single stock (Renault SA) for a single day (August 1, 2002) and compare simulated versus real data in terms of price motions and stylized facts. It is clear that the return series coming from Renault S.A. prices is not normal. Moreover, it exhibits fat tails and the autocorrelation in absolute returns decays slowly in log-log scale (see Fig. 3a–d). To assess the Volume-Volatility relationship we use the following, simplified framework: 1. We first slice the series in non overlapping time windows containing 300 observations. Depending upon the length of the series under investigation, the number of time windows may vary slightly. 2. We calculate for each of these slices, for each asset and/or each artificial series produced by the agents, two indicators: the mean volume over this period (mvt ) and the standard deviation of these returns (sdt ).

123

0

5

0.02 0.01 0.00

Empirical Quantiles −5

10

−3

−2

−1

0

1

2

Normalized Returns

Theoretical Quantiles

(a)

(b)

3

−0.2 0.0 0.2 0.4 0.6 0.8 1.0

1e−04 1e−03 1e−02 1e−01 1e+00

−10

Autocorrelation

233

−0.02 −0.01

Probability

1e−08 1e−06 1e−04 1e−02 1e+00

A re-examination of the “zero is enough” hypothesis

0

10

20

30

40

50

1

2

5

10

20

50

100

200

(d)

Time Lag

(c) Fig. 4 Qualitative stylized facts, unconstrained ATOM ZIT. a Fat tails (normalized returns). b QQ-plot. c Auto-correlations, raw returns. d Auto-correlations, absolute returns (loglog scale)

3. We then estimate a linear model: ˆ t sdt ∼ α + φmv

(5)

We use the value of φˆ for comparing the volume volatility relationship between simulated and real data. In doing so, and for the sake of simplicity, we explicitly exclude from the analysis the value of the constant α (said in different terms, we consider this constant to be similar for both models; see Fig. 4). This process is used in this article whenever the volume-volatility relationship is investigated. For example, this relationship for Renault is presented in Fig. 5a. The value of the regression slope is 1.3086E−6. The same Figure, (see Fig. 5b) is also produced using Unconstrained ZIT agents.11 In Table 1 we report a series of univariate statistics illustrating some of the stylized facts analysed in this research. Even if several stylized facts are reproduced by U Z I TU , T Z I T or even AZ I T , one should also consider the underlying price motions from which these statistics are computed. In Fig. 6, we report the prices coming from the real market (Fig. 6a) and for the five families of ZIT (Fig. 6b–f). It is clear that most of the ZIT families, beyond their apparent ability to generate congruent stylized facts, have a hard time at producing “realistic” price motions. The only “possible” exception, even if this can 11 Specifically, Z I T , see Sect. 2.2. U

123

Volatility

Volatility

4e−04 0e+00 0

200

400

600

800

1000

2e−04 3e−04 4e−04 5e−04 6e−04

O. Brandouy et al.

8e−04

234

1200

0

50

100

150

Volume

Volume

(a)

(b)

200

250

Fig. 5 Intraday volume-volatility relationship. a Renault S.A. b U Z I TU Table 1 Descriptive statistics 1 Real

2 U Z I TU

3 U Z I TN

4 SZ IT

5 T ZIT

6 AZ I T

Nobsa

2,214

2,302

2,273

2,222

2,219

2,005

Minimumb

−0.0085

−0.0211

−0.0156

−0.0232

−0.0151

−0.0900

Maximumc

0.0170

0.0233

0.0153

0.0279

0.0157

0.0900

Meand

−1.30E−5

6.00E−6

−2.00E−6

−8.00E−6

−1.40E−5

−2.64E−4

Stdeve

0.0012

0.0035

0.0036

0.0050

0.0019

0.0163

Skewnessf

1.5192

0.1071

−0.0216

0.1130

0.1471

0.2351

Kurtosisg

33.1606

10.0905

3.6706

5.2325

12.1035

9.8945

a Number of observations; 2,214 means for example 2,214 returns for the given trading day b,c,d,e,f ,g Minimum, maximum, and moments for observed returns

be discussed, is the T Z I T example that produces a price dynamics that looks like, at coarse grain, the real one. In summary, we point out two shortcomings in the use of ZIT for financial markets simulations: 1. The qualitative adequacy of stylized facts generated by ZIT can be accepted, at least at coarse grain. However, it is clearly more questionable if one considers their quantitative values. 2. The underlying price motion remains most of the time totally unrealistic from a qualitative point of view. This remark will be emphasized later-on (see Sect. 4) but will not be specifically investigated further in the following developments. In the following section, we present a procedure enabling us to make a comparison between a set of 37 real data and different simulations from the five ZIT families described Sect. 2.2. 3.2 Beyond the case study: ‘‘zero is not enough” Our data consists in intraday prices collected from the Paris NYSE-Euronext Stock Exchange covering 37 stocks in August 2002 (22 trading days). These stocks are

123

1500

2000

0

500

1500

(a)

(b)

1000

1500

2000

0

500

1000

Time

(c)

(d)

1000

1500

2000

2000

1500

Time

SS.1 0

1000

Time

SS.1 500

500

Time

45.0 45.5 46.0 46.5 47.0 47.5 48.0

0

45.0 45.5 46.0 46.5 47.0 47.5 48.0

SS.1 1000

45.0 45.5 46.0 46.5 47.0 47.5 48.0

SS.1

500

45.0 45.5 46.0 46.5 47.0 47.5 48.0

0

SS.1

235

2000

45.0 45.5 46.0 46.5 47.0 47.5 48.0

SS.1

45.0 45.5 46.0 46.5 47.0 47.5 48.0

A re-examination of the “zero is enough” hypothesis

0

500

1000

Time

Time

(e)

(f)

1500

2000

Fig. 6 Price motions, real versus simulated. a Renault S.A., real. b U Z I TU . c U Z I TN . d S Z I T . e T Z I T . f AZ I T

components of the CAC 40 index and are therefore amongst the most traded within the French market. For each day and each of the i = 37 stocks in the sample we produce a set of 5 simulations with the 4 different ZIT families described in Sect. 2.2.12 Notice again that ZIT agents are calibrated using real values calculated from the sample. For each simulation, we produce one concatenated return series based on the 22 simulated days. To avoid closing-to-opening jumps due to overnight information accumulation, we exclude returns that can be computed using closing prices at date t and opening prices at date t + 1. In other terms, the 22 days for each stock are summarized in j = 1 long time series. Note that the same concatenation procedure is run over the 12 2 unconstrained ZITS, statistically calibrated, Trend ZITs, Aggressive ZITs.

123

236

O. Brandouy et al.

real dataset. We thus have k = 6 subsets, 1 from real data, 5 from simulated ones. The dataset is therefore equivalent to a i = 37 × j = 1 × k = 6 tensor. For each of these simulations, we estimate, over the 37 observations, the distribution for the following statistics calculated on returns: 1. 2. 3. 4. 5.

Mean Standard deviation Skewness Kurtosis ρ1 and ρ2 , the first two values for autocorrelation coefficients computed using raw returns 6. The slope of the decay function for autocorrelation coefficients calculated over absolute returns. 7. The value of φˆ (see Eq. 5) indicating the direction and the strength of the volume volatility relationship. We then run two series of non-parametric tests using one simulation and real data as a benchmark to: – test equality in population distribution (two-sample KS test). For two distributions D1 and D2 , the null is that ‘‘D1 and D2 come from the same distribution.” – test equality in means (Fligner–Policello test13 and paired Wilcoxon test). For two distributions D1 and D2 , the null is that ‘‘D1 and D2 have the same mean value.” These tests are geared at appreciating whether the quantitative stylized facts are reproduced or not by means of ZITs. For illustration purposes, a limited example of the distribution over the 37 observations of each characteristic value for SF calculated on TZIT is plotted against the corresponding values calculated with real data in Fig. 7. We first report a series of non-parametric tests geared at examining whether the whole distribution of each stylized fact (summed-up with a single parameter) is similar to the distribution of the corresponding stylized facts calculated from the real world distributions. Results of the Kolmogorov–Smirnov test of equality in distribution is presented in Table 2. The results are rather unambiguous: except for the single case of the distribution of mean returns generated by Trend Calibrated ZIT (T Z I T ),14 and for the volume/volatility relationship for U Z I TU and U Z I TN , the two-sample Kolmogorov–Smirnoff test can be rejected with high levels of confidence. The interpretation is straightforward: neither higher moments nor autocorrelation-based stylized facts can be matched by any of our ZIT families. However, unconstrained ZITs, probably due to their high level of freedom, generate a realistic relationship between the average traded volume and the resulting average volatility observed within the same time window. One can also notice that when ZITs are more and more constrained, this stylized fact, if still noticeable, does no longer fit the values of our benchmark. 13 Fligner–Policello is the equivalent to the Mann–Whitney test but without assuming equality in variance. 14 For example, an illustration confirming this result for T Z I T can be found in Fig. 7a.

123

237

0

0

500

1000

1500

2000

50000 100000 150000 200000

A re-examination of the “zero is enough” hypothesis

−2.0e−05

−1.5e−05

−1.0e−05

−5.0e−06

0.0e+00

5.0e−06

0.0005

0.0010

0.0015

0.0020

0.0025

(b)

0.0

0.00

0.01

0.5

0.02

1.0

0.03

1.5

0.04

(a)

−1

0

1

2

0

50

100

(d)

0

0

5

5

10

15

10

20

25

15

(c)

−0.30

−0.25

−0.20

−0.15

−0.10

−0.30

−0.25

−0.20

−0.15

−0.10

−0.05

(f)

0

0

200

600

1000

100000 200000 300000

1400

(e)

−0.005

−0.004

−0.003

(g)

−0.002

−0.001

0.000

0e+00

1e−05

2e−05

3e−05

(h)

Fig. 7 Stylized facts distributions for T Z I T , real (plain lines) versus simulated (dash lines). a Mean. b SD. c Skewness. d Kurtosis. e ρ1 . f ρ2 . g Slope. h Vol/Vol

If we restrict our attention on first order moments, and check equality of means for these moments between simulated and real data using a test proposed by Fligner and Policello (1981), we obtain the following results (see Table 3).

123

123

0.9459

0.0000***

p value

0.5279

D

0.1892

p value

0.0017**

D

0.4324

p value

0.0000***

D

0.5676

p value

0.0002***

p value

D

0.4865

D

*** Sign. at 0.1 %, ** 1 %, * 5 % and “.” 10 %

AZ I T

T ZIT

SZ IT

U Z I TN

U Z I TU

Mean

Table 2 Two-sample Kolmogorov Smirnov tests

0.0201*

0.3514

0.0000***

0.6486

0.0000***

0.9459

0.0000***

0.6757

0.0000***

0.9189

SD

0.0000***

0.5676

0.0094**

0.3784

0.0402*

0.3243

0.0094**

0.3784

0.0017**

0.4324

Skewness

0.0000***

0.9189

0.0017**

0.4324

0.0001***

0.5135

0.0007***

0.4595

0.0002***

0.4865

Kurtosis

0.0000***

0.7027

0.0000***

0.8919

0.0000***

0.9189

0.0000***

0.9459

0.0000***

0.9459

ρ1

0.0000***

0.7027

0.0000***

0.9189

0.0000***

1.0000

0.0000***

1.0000

0.0000***

1.0000

ρ2

0.0402*

0.3243

0.0000***

0.7838

0.0000***

0.9730

0.0000***

1.0000

0.0000***

1.0000

Slope

0.0000***

0.6757

0.0201*

0.3514

0.0757.

0.2973

0.5279

0.1892

0.3565

0.2162

φˆ

238 O. Brandouy et al.

Mean

1.276E−7

Mean

0.0130∗

0.0012

−2.656E−6

2.4828

0.0000∗∗∗

U*

p value

−4.035E−5

155.6127

Mean

0.0010

0.1441

0.0000∗∗∗

4.3272

−16.9185

0.0007∗∗∗

0.3194

0.0000∗∗∗

p value

3.3761

7.3446

−0.9957

U*

−0.1787

0.0008

0.3648

0.9062

−7.5817

0.1034

1.6285

−0.0119

0.1221

1.5459

−0.0001

Skewness

−1.747E−6

0.0000∗∗∗

−39.8479

0.0040

0.0000∗∗∗

−8.9075

0.0022

0.0000∗∗∗

−40.3570

0.0035

SD

Mean

0.0000∗∗∗

−4.2283

U*

p value

−1.395E−6

0.0000∗∗∗

−5.9756

1.754E−7

0.0000∗∗∗

−5.6787

Mean

p value

U*

Mean

p value

U*

*** Sign. at 0.1 %, ** 1 % and * 5 %

Real

AZ I T

T ZIT

SZ IT

U Z I TN

U Z I TU

Mean

Table 3 Fligner Policello test

25.8839

−0.2137

0.0078∗∗∗

−2.6619

0.0000∗∗∗

−0.1996

−20.2753

0.0000∗∗∗

−16.1067

−0.1526

0.0000∗∗∗

12.4847

−0.2809

0.0000∗∗∗

25.4974

−0.3001

0.0000∗∗∗

25.6899

−0.3015

ρ1

2739.8962

0.0001∗∗∗

−3.8624

40.6920

0.0002∗∗∗

3.6821

2221.9776

0.0026∗∗∗

3.0100

19.6185

0.0003∗∗∗

3.6420

17.7071

Kurtosis

−0.2137

0.0078∗∗∗

−2.6619

−0.1997

0.0000∗∗∗

−32.4546

−0.1349

0.0000∗∗∗

−∞

−0.0800

0.0000∗∗∗

−∞

−0.0746

0.0000∗∗∗

−∞

−0.0781

ρ2

−0.0018

0.0150∗

−2.4313

−0.0017

0.0000∗∗∗

8.1609

−0.0030

0.0000∗∗∗

17.5064

−0.0033

0.0000∗∗∗



−0.0035

0.0000∗∗∗



−0.0035

Slope

2.816E−6

0.0000∗∗∗

−8.8679

6.623E−7

0.0184∗

2.3564

2.781E−6

0.9960

0.0049

5.725E−6

0.3385

−0.9570

1.912E−6

0.7127

−0.3682

4.288E−6

φˆ

A re-examination of the “zero is enough” hypothesis 239

123

123

Mean

p value

4.0000

0.0121∗

0.0012

0.0000∗∗∗

516.0000

0.0010

0.0000∗∗∗

677.0000

0.0008

0.0000∗∗∗

−2.656E−6

−4.035E−5

703.0000

Mean

W

0.0023∗∗

154.0000

W

p value

−1.747E−6

Mean

0.0002∗∗∗

115.0000

p value

W

0.0040

0.0000∗∗∗

0.0000∗∗∗

−1.395E−6

p value

Mean

44.0000

0.0022

76.0000

1.754E−7

W

Mean

0.0000∗∗∗

0.0000∗∗∗

p value

0.0035

5.0000

1.276E−7

69.0000

W

Mean

SD

*** Sign. at 0.1 %, ** 1 % * 5 % and “.” 10 %

Real

AZ I T

T ZIT

SZ IT

U Z I TN

U Z I TU

Mean

Table 4 Paired Wilcoxon tests

0.1441

0.0000∗∗∗

616.0000

−16.9185

0.0056∗∗

532.0000

−0.1787

0.1448

449.0000

−7.5817

0.0800.

468.0000

−0.0119

0.1105

458.0000

−0.0001

Skewness

25.8839

0.0000∗∗∗

1.0000

2739.8962

0.0001∗∗∗

98.0000

40.6920

0.0003∗∗∗

584.0000

2221.9776

0.0046∗∗

536.0000

19.6185

0.0004∗∗∗

576.0000

17.7071

Kurtosis

−0.2137

0.0101∗

183.0000

−0.1996

0.0000∗∗∗

12.0000

−0.1526

0.0000∗∗∗

665.0000

−0.2809

0.0000∗∗∗

702.0000

−0.3001

0.0000∗∗∗

702.0000

−0.3015

ρ1

−0.2137

0.0106∗

184.0000

−0.1997

0.0000∗∗∗

0.0000

−0.1349

0.0000∗∗∗

0.0000

−0.0800

0.0000∗∗∗

0.0000

−0.0746

0.0000∗∗∗

0.0000

−0.0781

ρ2

−0.0018

0.0913.

239.0000

−0.0017

0.0000∗∗∗

679.0000

−0.0030

0.0000∗∗∗

702.0000

−0.0033

0.0000∗∗∗

703.0000

−0.0035

0.0000∗∗∗

703.0000

−0.0035

Slope

4.288E−6

2.816E−6

0.0000∗∗∗

1191.0000

6.623E−7

0.0219∗

473.0000

2.781E−6

1

684.0000

5.726E−6

0.338

774.0000

1.913E−6

0.7068

720.000

φˆ

240 O. Brandouy et al.

241

0.0

0.5

1.0

1.5

2.0

2.5

A re-examination of the “zero is enough” hypothesis

−1

0

1

2

Fig. 8 Distribution of Skewness for U Z I TU —dash line versus real world series—solid line

Here again, the tests reject the ability of our ZIT families at reproducing quantitative stylized facts. The only cases where these tests cannot be rejected are those of U Z I TU , U Z I TN , S Z I T for the skewness distribution and for the value of the volume/volatility relationship, and T Z I T for the mean. Said differently, the first three categories of agents might do a relative good job at generating realistic third-order moments for the return distribution and, as mentioned previously, in delivering a realistic volume/volatility interplay. However, in these tables only the mean of each parameter distribution is tested against its real-world counterpart. If the test cannot be rejected the only thing we can conclude is that the central values of the distribution are not so different. In our opinion, this is a necessary but not sufficient condition to accept a family of Agents. For example, if one considers the distribution of the Skewness for U Z I TU against the real series, the equality of the means cannot be rejected (see Tables 3, 4) although it is clear that the whole distribution is different (see Fig. 8). Rejection for T Z I T may be more surprising since they were constrained to generate more realistic price motions and even with that, they were not able to reproduce first orders moments beyond the mean. Even if skewness is an important feature of financial distributions (notably important for asset managers), given the overall negative conclusions drawn on other moments and correlations of the distributions, this is a rather weak result. If we go further in the analysis with a paired Wilcoxon rank test, T Z I T is now rejected while none of the skewness-related tests are rejected at the 5 % threshold.

3.3 Sensitivity analysis: the importance of model parameters A variety of factors may have an effect on the results presented above. This section is dedicated to a discussion on two factors that may affect notably our results: i) the impact of the proportion of “Big fishes” versus “Small fishes” and ii) the role of the ratio “Limit” to “Market” orders. To address these points, we first vary only one of

123

242

O. Brandouy et al.

these parameters while the other stay constant and report the impact of this variation on the market. Note that T Z I T are not significantly influenced by the variation in the proportions of the two factors, but rather by the price trend itself. Amongst the two remaining families, S Z I T is the more driven by the dynamics of order flows, and thus is the best potential candidate for an efficient calibration through both factors. We have chosen to restrict the presentation of the sensitivity analysis to the latter and to the AZ I T inspired by the Farmer model (Farmer et al. 2005). For the latter, we vary the proportion of “patient” versus “impatient” agents in the population. In Table 5, we report the correlations between the model parameters and the statistical properties tracked throughout this paper. i) Big fishes/Small fishes The results in Table 5 clearly show that except for the mean, which is insensitive to the modification of the model parameters, all other stylized facts do react in some way to the latter. Even if the correlation coefficients may be relatively small (ranging from −0.5781 for the Kurtosis to 0.5876 for the “slope”), all of them are significant at the 1 % level. However, if volatility tends to increase with

Table 5 Linear correlation coefficient of model parameters and stylized facts Statistics

Mean SD Skewness

Big/small fishes proportiona

ρ1 ρ2 Slope φˆ

Impatient agent

Patient agent

Corr.

0.3216

0.1408

−0.1551

0.1848

p value

0.1250

0.2414

0.2493

0.1942

Corr.

0.5744

−0.9759

0.2259

−0.6677

p value

0.000∗∗∗

Corr. p value

Kurtosis

Limit/market proportionb

Corr.

−0.05648 0.000∗∗∗ −0.5781

0.000∗∗∗

0.0911∗

0.1688

0.0990

0.1593

0.4637

−0.1872

−0.5669

p value

0.000∗∗∗

Corr.

0.4663

p value

0.000∗∗∗

0.000∗∗∗

Corr.

0.1470

0.0580

pvalue

0.000∗∗∗

0.000∗∗∗

Corr.

0.5876

p value

0.000∗∗∗

Corr. p value

−0.3429 0.000∗∗∗

0.1179 −0.7495

−0.5572 0.000∗∗∗

0.000∗∗∗ 0.6469 0.000∗∗∗ −0.7164 0.000∗∗∗ 0.0339 0.8021

0.0571

0.0084

0.6358

0.9502

0.000∗∗∗ −0.0597 0.000∗∗∗ 0.2283 0.1070 −0.7771 0.000∗∗∗ 0.6859 0.000∗∗∗ −0.2688 0.0564∗ −0.3503 0.0117∗∗

100 Replications for each parameter set have been conducted. Each value is an averaged result *** Sign. at 1 %, ** 5 %, * 10 % a Experiment settings: basic design of S Z I T . The number of Small Fishes is fixed as 200, while the number of Big Fishes varies from 0 to 200 b Experiment settings: basic design of S Z I T . The proportion of Cancel orders is fixed,  = 5. The C proportion of Limit orders  L varies from 60 to 95 %, and proportion of Market orders is defined according to formula  M = 100 −  L − C

123

A re-examination of the “zero is enough” hypothesis

243

this ratio (the correlation coefficient being equal to 0.5744), kurtosis tends to decrease (coefficient = −0.5781 ). Big fishes, due to the large trading volume they can generate, provide a large amount of liquidity that directly benefits to Small fishes. In other words, Big fishes “feed” Small fishes. Thus, Small fishes can easily buy or sell stocks with a price close to the current market price, until a big order is completely executed. When the number of Big fishes increases significantly, these big players loose their role of liquidity providers, because they trade more frequently within their own group. Increasing the proportion of Big fishes has also a positive effect on the slope of the decay function for autocorrelation coefficients calculated over absolute returns (correlation coefficient equals 0.5876). This result may suggest that the more “Big fishes” in the market, the more likely the emergence of volatility clusters. In summary the fact that we observe significant correlations in that case indicates that finding the appropriate proportion of Big fishes versus Small fishes might be a route to quantitatively fitting stylized facts observed on real markets by using ZITs. However, we were unable in our simulations to find the “ideal mix” which would have led to the perfect emergence of the whole set of studied stylized facts. In fact it seems a rather impossible task to fit most of the stylized fact by varying only one parameter. Moreover, the point is not to find the proportion which would fit these stylized facts but rather to show that within a reasonable range corresponding to what is commonly observed on the market, this is not the case. ii) Limit / Market orders From the same Table 5, one can notice that the proportion of “limit” to “market” orders has a significant effect on most of the studied stylized facts except the Skewness, the Kurtosis and the volume/volatility relationship. Concerning the standard deviation (SD), the correlation coefficient is close to −1. This result was expected: on the one hand, traders supply liquidity by posting limit orders and, on the other hand, demand liquidity by submitting market orders that yield immediate partial or full execution. Thus, a large proportion of limit orders provides an important liquidity on both sides of the order book (Bids and Asks). On the contrary, a market order is immediately executed against order(s) standing in the limit order book: it moves the market by walking up or down the limit order book. Clearly, the proportion of Limit/Market orders has a significant impact on market volatility. A higher proportion of limit orders stabilizes the market by decreasing the standard deviation (correlation coefficient = −0.9759). The negative coefficient for ρ1 (−0.7495) suggests that the more Limit orders, the lower the auto-correlation of raw returns: this is a well-known fact related to the BidAsk bounce. To summarize, some important stylized facts (Skewness, Kurtosis and volume/volatility relationship) seem to be insensitive to the modifications of the ratio Limit to Market orders. This indicates that fitting with accuracy real-world stylized facts is probably out of reach using this ratio alone. iii) Patient/Impatient agents One can observe that increasing the number of these two categories of agents has a similar impact on various stylized facts as the variation of the proportion between Limit and Market orders does (see Table 5). This result was expected since “patient” agents provide liquidity while “impatient” agents demand liquidity.

123

244

O. Brandouy et al.

Table 6 Regression results for effects of model parameters on stylized facts Model

Statistics

1

Mean

2 3 4 5 6

SD Skewness Kurtosis ρ1 ρ2

7

Slope

8

φˆ

α

β1

β2

R2 0.301

1.859E−4

1.573E−8

−2.137E−6