Comparing computational approaches to the analysis of high-frequency trading data using Bayesian methods

Financial prices are usually modelled as continuous, often involving geometric Brownian motion with drift, leverage, and possibly jump components. An alternative modelling approach allows financial observations to take discrete values when they are interpreted as integer multiples of a fixed quantity, the ticksize, the monetary value associated with a single change in the asset evolution. These samples are usually collected at very high frequency, exhibiting diverse trading operations per seconds. In this context, the observables are modelled in two different ways: on one hand, via the Skellam process, defined as the difference between two independent Poisson processes; on the other, using a stochastic process whose conditional law is that of a mixture of Geometric distributions. The parameters of the two stochastic processes modelled as functions of a stochastic volatility process, which is in turn described by a discretised Gaussian Ornstein-Uhlenbeck AR(1) process. The work will present, at first, a parametric model for independent and identically distributed data, in order to motivate the algorithmic choices used as a basis for the next Chapters. These include adaptive Metropolis-Hastings algorithms, and Interweaving Strategy. The central Chapters of the work are devoted to the illustration of Particle Filtering methods for MCMC posterior computations (or PMCMC methods). The discussion starts by presenting the existing Particle Gibbs and the Particle Marginal Metropolis-Hastings samplers. Additionally, we propose two extensions to the existing methods. Posterior inference and out-of-sample prediction obtained with the different methodologies is discussed, and compared to the methodologies existing in the literature. To allow for more flexibility in the modelling choices, the work continues with a presentation of a semi-parametric version of the original model. Comparative inference obtained via the previously discussed methodologies is presented. The work concludes with a summary and an account of topics for further research.

[1]  N. Shephard,et al.  Basics of Levy processes , 2012 .

[2]  Mohamed Alosh,et al.  An integer-valued pth-order autoregressive structure (INAR(p)) process , 1990, Journal of Applied Probability.

[3]  J. Griffin,et al.  Sampling Returns for Realized Variance Calculations: Tick Time or Transaction Time? , 2006 .

[4]  S. Geisser,et al.  A Predictive Approach to Model Selection , 1979 .

[5]  Jun S. Liu,et al.  Monte Carlo strategies in scientific computing , 2001 .

[6]  Michael,et al.  On a Class of Bayesian Nonparametric Estimates : I . Density Estimates , 2008 .

[7]  Jim E. Griffin,et al.  An adaptive truncation method for inference in Bayesian nonparametric models , 2013, Statistics and Computing.

[8]  Fredrik Lindsten,et al.  Particle gibbs with ancestor sampling , 2014, J. Mach. Learn. Res..

[9]  C. Antoniak Mixtures of Dirichlet Processes with Applications to Bayesian Nonparametric Problems , 1974 .

[10]  Heikki Haario,et al.  Componentwise adaptation for high dimensional MCMC , 2005, Comput. Stat..

[11]  Fredrik Lindsten,et al.  Ancestor Sampling for Particle Gibbs , 2012, NIPS.

[12]  Heikki Haario,et al.  Adaptive proposal distribution for random walk Metropolis algorithm , 1999, Comput. Stat..

[13]  C. Andrieu,et al.  The pseudo-marginal approach for efficient Monte Carlo computations , 2009, 0903.5480.

[14]  Fredrik Lindsten,et al.  On the use of backward simulation in the particle Gibbs sampler , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15]  Radford M. Neal Markov Chain Sampling Methods for Dirichlet Process Mixture Models , 2000 .

[16]  S. Heston A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options , 1993 .

[17]  N. Shephard,et al.  Power and bipower variation with stochastic volatility and jumps , 2003 .

[18]  Yaming Yu,et al.  To Center or Not to Center: That Is Not the Question—An Ancillarity–Sufficiency Interweaving Strategy (ASIS) for Boosting MCMC Efficiency , 2011 .

[19]  J. Rosenthal,et al.  Coupling and Ergodicity of Adaptive Markov Chain Monte Carlo Algorithms , 2007, Journal of Applied Probability.

[20]  J. Griffin The Ornstein–Uhlenbeck Dirichlet process and other time-varying processes for Bayesian nonparametric inference , 2011 .

[21]  N. Shephard Realized power variation and stochastic volatility models , 2003 .

[22]  N. Shephard,et al.  Econometric analysis of realised volatility and its use in estimating stochastic volatility models , 2000 .

[23]  M. Pitt,et al.  Filtering via Simulation: Auxiliary Particle Filters , 1999 .

[24]  M. Abramowitz,et al.  Handbook of Mathematical Functions With Formulas, Graphs and Mathematical Tables (National Bureau of Standards Applied Mathematics Series No. 55) , 1965 .

[25]  P. Müller,et al.  Bayesian Nonparametrics: An invitation to Bayesian nonparametrics , 2010 .

[26]  Lancelot F. James,et al.  Gibbs Sampling Methods for Stick-Breaking Priors , 2001 .

[27]  Jim E. Griffin,et al.  Stick-breaking autoregressive processes , 2011 .

[28]  F. Quintana,et al.  Bayesian clustering and product partition models , 2003 .

[29]  Makoto Takahashi,et al.  Estimating stochastic volatility models using daily returns and realized volatility simultaneously , 2007, Comput. Stat. Data Anal..

[30]  T. Ferguson A Bayesian Analysis of Some Nonparametric Problems , 1973 .

[31]  Neil Shephard,et al.  Designing Realised Kernels to Measure the Ex-Post Variation of Equity Prices in the Presence of Noise , 2008 .

[32]  J. Strackee,et al.  The frequency distribution of the difference between two Poisson variates , 1962 .

[33]  Mohamed Alosh,et al.  FIRST‐ORDER INTEGER‐VALUED AUTOREGRESSIVE (INAR(1)) PROCESS , 1987 .

[34]  Christophe Andrieu,et al.  A tutorial on adaptive MCMC , 2008, Stat. Comput..

[35]  Gregor Kastner,et al.  Ancillarity-sufficiency interweaving strategy (ASIS) for boosting MCMC estimation of stochastic volatility models , 2014, Comput. Stat. Data Anal..

[36]  P. Green,et al.  Bayesian Model-Based Clustering Procedures , 2007 .

[37]  N. Shephard,et al.  Integer-valued Lévy processes and low latency financial econometrics , 2012 .

[38]  E. McKenzie,et al.  Autoregressive moving-average processes with negative-binomial and geometric marginal distributions , 1986, Advances in Applied Probability.

[39]  A. Doucet,et al.  Particle Markov chain Monte Carlo methods , 2010 .

[40]  J. Hull Options, Futures, and Other Derivatives , 1989 .

[41]  L. Holden,et al.  Adaptive independent Metropolis–Hastings , 2009, 0903.0483.

[42]  S. Taylor Financial Returns Modelled by the Product of Two Stochastic Processes , 1961 .

[43]  Michael McAleer,et al.  Realized Volatility: A Review , 2008 .

[44]  E GriffinJim,et al.  Advances in Markov chain Monte Carlo , 2013 .

[45]  C. Liu,et al.  Forecasting realized volatility: a Bayesian model‐averaging approach , 2009 .

[46]  Jim E. Griffin,et al.  Inference in Infinite Superpositions of Non-Gaussian Ornstein--Uhlenbeck Processes Using Bayesian Nonparametic Methods , 2011 .