Ensemble Methods for Jump-Diffusion Models of Power Prices

We propose a machine learning-based methodology which makes use of ensemble methods with the aims (i) of treating missing data in time series with irregular observation times and detecting anomalies in the observed time behavior; (ii) of defining suitable models of the system dynamics. We applied this methodology to US wholesale electricity price time series that are characterized by missing data, high and stochastic volatility, jumps and pronounced spikes. For missing data, we provide a repair approach based on the missForest algorithm, an imputation algorithm which is completely agnostic about the data distribution. To identify anomalies, i.e., turbulent movements of power prices in which jumps and spikes are observed, we took into account the no-gap reconstructed electricity price time series, and then we detected anomalous regions using the isolation forest algorithm, an anomaly detection method that isolates anomalies instead of profiling normal data points as in the most common techniques. After removing anomalies, the additional gaps will be newly filled by the missForest imputation algorithm. In this way, a complete and clean time series describing the stable dynamics of power prices can be obtained. The decoupling between the stable motion and the turbulent motion allows us to define suitable jump-diffusion models of power prices and to provide an estimation procedure that uses the full information contained in both the stable and the turbulent dynamics.

[1]  Jinliang Zhang,et al.  Day-ahead electricity price forecasting by a new hybrid method , 2012, Comput. Ind. Eng..

[2]  Pablo Laguna,et al.  Improved heart rate variability signal analysis from the beat occurrence times according to the IPFM model , 2000, IEEE Transactions on Biomedical Engineering.

[3]  K. French Stock returns and the weekend effect , 1980 .

[4]  Zhi-Hua Zhou,et al.  Isolation Forest , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[5]  P. Tankov,et al.  MULTI-FACTOR JUMP-DIFFUSION MODELS OF ELECTRICITY PRICES , 2008 .

[6]  Dimitris Bertsimas,et al.  From Predictive Methods to Missing Data Imputation: An Optimization Approach , 2017, J. Mach. Learn. Res..

[7]  D. Duffie,et al.  Simulated Moments Estimation of Markov Models of Asset Prices , 1990 .

[8]  Rahim Mahmoudvand,et al.  Missing value imputation in time series using Singular Spectrum Analysis , 2016 .

[9]  J. Marrero,et al.  Comparison of imputation methods for missing laboratory data in medicine , 2013, BMJ Open.

[10]  Wim Sweldens,et al.  The lifting scheme: a construction of second generation wavelets , 1998 .

[11]  Chen Wang,et al.  Time Series Data Imputation: A Survey on Deep Learning Approaches , 2020, ArXiv.

[12]  Irma J. Terpenning,et al.  STL : A Seasonal-Trend Decomposition Procedure Based on Loess , 1990 .

[13]  Peter Tiño,et al.  Uncovering delayed patterns in noisy and irregularly sampled time series: An astronomy application , 2009, Pattern Recognit..

[14]  Małgorzata Misztal,et al.  Some Remarks on the Data Imputation Using "missForest" Method , 2013 .

[15]  Robert F. Mudde,et al.  Estimation of turbulence power spectra for bubbly flows from Laser Doppler Anemometry signals , 2005 .

[16]  Peter Bühlmann,et al.  MissForest - non-parametric missing value imputation for mixed-type data , 2011, Bioinform..

[17]  Richard J. Povinelli,et al.  Data Improving in Time Series Using ARX and ANN Models , 2017, IEEE Transactions on Power Systems.

[18]  Grant Foster,et al.  Wavelets for period analysis of unevenly sampled time series , 1996 .

[19]  R. Weron Electricity price forecasting: A review of the state-of-the-art with a look into the future , 2014 .

[20]  Robert Jenssen,et al.  Time Series Cluster Kernel for Learning Similarities between Multivariate Time Series with Missing Data , 2017, Pattern Recognit..

[21]  Carole Thiebaut,et al.  Time-Scale and Time-Frequency Analyses of Irregularly Sampled Astronomical Time Series , 2005, EURASIP J. Adv. Signal Process..

[22]  Sm Robeson Influence of spatial sampling and interpolation on estimates of air temperature change , 1994 .

[23]  Yan Liu,et al.  Recurrent Neural Networks for Multivariate Time Series with Missing Values , 2016, Scientific Reports.

[24]  Eduardo S. Schwartz,et al.  Electricity Prices and Power Derivatives: Evidence from the Nordic Power Exchange , 2000 .

[25]  H. Geman,et al.  Understanding the Fine Structure of Electricity Prices , 2004 .

[26]  E Afrifa-Yamoah,et al.  Missing data imputation of high‐resolution temporal climate time series data , 2020, Meteorological Applications.

[27]  J. Kurths,et al.  Comparison of correlation analysis techniques for irregularly sampled time series , 2011 .

[28]  Zhiwen Yu,et al.  End-to-End Incomplete Time-Series Modeling From Linear Memory of Latent Variables , 2020, IEEE Transactions on Cybernetics.

[29]  James D. Hamilton A New Approach to the Economic Analysis of Nonstationary Time Series and the Business Cycle , 1989 .

[30]  D. Percival,et al.  Wavelet variance analysis for gappy time series , 2010 .

[31]  David M Kreindler,et al.  The effects of the irregular sample and missing data in time series analysis. , 2006, Nonlinear dynamics, psychology, and life sciences.

[32]  R. Huisman,et al.  Regime Jumps in Electricity Prices , 2001 .

[33]  Carlo Mari Short-term stochastic movements of electricity prices and long-term investments in power generating technologies , 2021, Energy Systems.

[34]  P. Broersen Spectral Analysis of Irregularly Sampled Data with Time Series Models , 2009 .

[35]  M. Kegnenlezom,et al.  A jump-diffusion model for pricing electricity under price-cap regulation , 2019, Mathematical Sciences.

[36]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[37]  Thomas G. Dietterich,et al.  A Meta-Analysis of the Anomaly Detection Problem , 2015 .

[38]  Á. Cartea,et al.  Pricing in Electricity Markets: A Mean Reverting Jump Diffusion Model with Seasonality , 2005 .

[39]  Benjamin Guedj,et al.  Pycobra: A Python Toolbox for Ensemble Learning and Visualisation , 2017, J. Mach. Learn. Res..

[40]  A. Gelman Method of Moments Using Monte Carlo Simulation , 1995 .

[41]  Carlo Mari Regime-switching characterization of electricity prices dynamics , 2006 .

[42]  R. Weron,et al.  Modelling Electricity Prices: Jump Diffusion and Regime Switching , 2004 .

[43]  Michael Schulz,et al.  Spectrum: spectral analysis of unevenly spaced paleoclimatic time series , 1997 .

[44]  D. McFadden A Method of Simulated Moments for Estimation of Discrete Response Models Without Numerical Integration , 1989 .

[45]  Idris A. Eckley,et al.  A wavelet-based approach for imputation in nonstationary multivariate time series , 2021, Statistics and Computing.

[46]  S. Vicente‐Serrano,et al.  Gap Filling of Monthly Temperature Data and Its Effect on Climatic Variability and Trends , 2019, Journal of Climate.

[47]  J. Carpenter,et al.  Practice of Epidemiology Comparison of Random Forest and Parametric Imputation Models for Imputing Missing Data Using MICE: A CALIBER Study , 2014 .