Applying Dynamic Training‐Subset Selection Methods Using Genetic Programming for Forecasting Implied Volatility

Volatility is a key variable in option pricing, trading, and hedging strategies. The purpose of this article is to improve the accuracy of forecasting implied volatility using an extension of genetic programming (GP) by means of dynamic training‐subset selection methods. These methods manipulate the training data in order to improve the out‐of‐sample patterns fitting. When applied with the static subset selection method using a single training data sample, GP could generate forecasting models, which are not adapted to some out‐of‐sample fitness cases. In order to improve the predictive accuracy of generated GP patterns, dynamic subset selection methods are introduced to the GP algorithm allowing a regular change of the training sample during evolution. Four dynamic training‐subset selection methods are proposed based on random, sequential, or adaptive subset selection. The latest approach uses an adaptive subset weight measuring the sample difficulty according to the fitness cases' errors. Using real data from S&P500 index options, these techniques are compared with the static subset selection method. Based on mean squared error total and percentage of non‐fitted observations, results show that the dynamic approach improves the forecasting performance of the generated GP models, especially those obtained from the adaptive‐random training‐subset selection method applied to the whole set of training samples.

[1]  R. C. Merton,et al.  Theory of Rational Option Pricing , 2015, World Scientific Reference on Contingent Claims Analysis in Corporate Finance.

[2]  Tony Wong,et al.  An engineering approach to forecast volatility of financial indices , 2007 .

[3]  F. Hemez,et al.  Identification of response surface models using genetic programming , 2006 .

[4]  Edmund K. Burke,et al.  On improving genetic programming for symbolic regression , 2005, 2005 IEEE Congress on Evolutionary Computation.

[5]  Oliver Masutti,et al.  Genetic Programming with Syntactic Restrictions Applied to Financial Volatility Forecasting , 2001 .

[6]  Jeff Fleming The quality of market volatility forecasts implied by S&P 100 index option prices , 1998 .

[7]  F. Black,et al.  The Pricing of Options and Corporate Liabilities , 1973, Journal of Political Economy.

[8]  Christopher J. Neely,et al.  USING A GENETIC PROGRAM TO PREDICT EXCHANGE RATE VOLATILITY , 2002 .

[9]  Richard J. Povinelli,et al.  Time series data mining: identifying temporal patterns for characterization and prediction of time series events , 1999 .

[10]  Sophia Blau,et al.  Numerical Optimization Of Computer Models , 2016 .

[11]  Thomas W. Miller,et al.  The Forecast Quality of Cboe Implied Volatility Indexes , 2003 .

[12]  Richard J. Rendleman,et al.  STANDARD DEVIATIONS OF STOCK PRICE RATIOS IMPLIED IN OPTION PRICES , 1976 .

[13]  Campbell R. Harvey,et al.  Market volatility prediction and the efficiency of the S & P 100 index option market , 1992 .

[14]  Mihir Sen,et al.  Heat transfer correlations by symbolic regression , 2006 .

[15]  Sana Ben Hamida,et al.  Selecting the Best Forecasting-Implied Volatility Model Using Genetic Programming , 2009, Adv. Decis. Sci..

[16]  M. O'Neill,et al.  Genetic Programming for Dynamic Environments , 2007 .

[17]  Donald P. Chiras,et al.  The information content of option prices and a test of market efficiency , 1978 .

[18]  Peter Nordin,et al.  An On-Line Method to Evolve Behavior and to Control a Miniature Robot in Real Time with Genetic Programming , 1996, Adapt. Behav..

[19]  Mark J. Willis,et al.  Using a tree structured genetic algorithm to perform symbolic regression , 1995 .

[20]  Peter Ross,et al.  Small Populations over Many Generations can beat Large Populations over Few Generations in Genetic P , 1997 .

[21]  Christopher Gathercole,et al.  An investigation of supervised learning in genetic programming , 1998 .

[22]  陳樹衡,et al.  Using Genetic Programming to Model Volatility in Financial Time Series , 1997 .

[23]  R. Baillie,et al.  Fractionally integrated generalized autoregressive conditional heteroskedasticity , 1996 .

[24]  Peter Ross,et al.  Dynamic Training Subset Selection for Supervised Learning in Genetic Programming , 1994, PPSN.

[25]  Wolfgang Banzhaf,et al.  Genetic Programming: An Introduction , 1997 .

[26]  M. Dacorogna,et al.  Volatilities of different time resolutions — Analyzing the dynamics of market components , 1997 .

[27]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[28]  Tzu-Wen Kuo,et al.  Genetic Programming and Financial Trading: How Much About "What We Know" , 2008 .

[29]  Maarten Keijzer,et al.  Scaled Symbolic Regression , 2004, Genetic Programming and Evolvable Machines.

[30]  T. Bollerslev,et al.  Generalized autoregressive conditional heteroskedasticity , 1986 .

[31]  Tony Wong,et al.  Volatility forecasting using time series data mining and evolutionary computation techniques , 2007, GECCO '07.

[32]  Vidroha Debroy,et al.  Genetic Programming , 1998, Lecture Notes in Computer Science.

[33]  Manfred Gilli,et al.  Heuristic optimisation in financial modelling , 2010, Annals of Operations Research.

[34]  Peter Nordin,et al.  Genetic programming - An Introduction: On the Automatic Evolution of Computer Programs and Its Applications , 1998 .

[35]  Campbell R. Harvey,et al.  S&P 100 Index Option Volatility , 1991 .

[36]  Wolfgang Banzhaf,et al.  Dynamic Subset Selection Based on a Fitness Case Topology , 2004, Evolutionary Computation.

[37]  木村 正行,et al.  Robustness of robot programs generated by genetic programming , 1996 .

[38]  Stephen Taylor,et al.  Forecasting S&P 100 Volatility: The Incremental Information Content of Implied Volatilities and High Frequency Index Returns , 2000 .