A multiple support vector machine approach to stock index forecasting with mixed frequency sampling

Abstract The independent variables commonly used to predict the stock price index usually contain data sampled at different frequencies, and simultaneously, there exist multiple outputs. However, most current researches ignore different frequencies among independent variables and multi-output issues. This paper proposes a multiple output support vector machine unrestricted mixed data sampling (MSVM-UMIDAS) approach – which can achieve multiple results for sequential points simultaneously by applying mixed frequency independent variables. We test the in-sample and out-of-sample performances of MSVM-UMIDAS for stock forecasting in terms of (t−1), (t−2) and (t−3) and then compare the performances of the proposed model with those of other models. The results indicate that our model performs better when assessed by four different measurements. Thus, our proposed model is more realistic in practice and an appropriate tool for multi-output and mixed frequency issues for stock price forecasting.

[1]  Luis Alonso,et al.  Multioutput Support Vector Regression for Remote Sensing Biophysical Parameter Estimation , 2011, IEEE Geoscience and Remote Sensing Letters.

[2]  Amir Abbas Najafi,et al.  Polar support vector machine: Single and multiple outputs , 2016, Neurocomputing.

[3]  Zhongyi Hu,et al.  Multiple-output support vector regression with a firefly algorithm for interval-valued stock price index forecasting , 2014, Knowl. Based Syst..

[4]  Ching-Chiang Yeh,et al.  The use of hybrid manifold learning and support vector machines in the prediction of business failure , 2011, Knowl. Based Syst..

[5]  Luc De Raedt,et al.  Top-Down Induction of Clustering Trees , 1998, ICML.

[6]  Arash Ghanbari,et al.  Integration of genetic fuzzy systems and artificial neural networks for stock price forecasting , 2010, Knowl. Based Syst..

[7]  Ana Beatriz Galvão,et al.  Changes in predictive ability with mixed frequency data , 2013 .

[8]  Koby Crammer,et al.  On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[9]  Angelos Kanas,et al.  Non‐linear forecasts of stock returns , 2003 .

[10]  Tapio Elomaa,et al.  Multi-target regression with rule ensembles , 2012, J. Mach. Learn. Res..

[11]  Grigorios Tsoumakas,et al.  Multi-target Regression via Random Linear Target Combinations , 2014, ECML/PKDD.

[12]  Nello Cristianini,et al.  Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[13]  Einoshin Suzuki,et al.  Bloomy Decision Tree for Multi-objective Classification , 2001, PKDD.

[14]  E. Ghysels,et al.  Série Scientifique Scientific Series Predicting Volatility: Getting the Most out of Return Data Sampled at Different Frequencies , 2022 .

[15]  Gwo-Hshiung Tzeng,et al.  Combining VIKOR-DANP model for glamor stock selection and stock performance improvement , 2014, Knowl. Based Syst..

[16]  J. Friedman,et al.  Predicting Multivariate Responses in Multiple Linear Regression , 1997 .

[17]  Jiahai Wang,et al.  Financial time series prediction using a dendritic neuron model , 2016, Knowl. Based Syst..

[18]  Fernando Pérez-Cruz,et al.  SVM multiregression for nonlinear channel estimation in multiple-input multiple-output systems , 2004, IEEE Transactions on Signal Processing.

[19]  Muhammad Tanveer,et al.  An efficient regularized K-nearest neighbor based weighted twin support vector regression , 2016, Knowl. Based Syst..

[20]  Massimiliano Marcellino,et al.  Midas Vs. Mixed-Frequency VAR: Nowcasting GDP in the Euro Area , 2009 .

[21]  Ming Zeng,et al.  A generalized Mitchell-Dem'yanov-Malozemov algorithm for one-class support vector machine , 2016, Knowl. Based Syst..

[22]  Kimon P. Valavanis,et al.  Surveying stock market forecasting techniques - Part II: Soft computing methods , 2009, Expert Syst. Appl..

[23]  Ping-Feng Pai,et al.  A support vector machine-based model for detecting top management fraud , 2011, Knowl. Based Syst..

[24]  Farrukh Javed,et al.  Importance of the macroeconomic variables for variance prediction: A GARCH-MIDAS approach , 2012 .

[25]  Bernard Zenko,et al.  Learning Classification Rules for Multiple Target Attributes , 2008, PAKDD.

[26]  Craig Hiemstra,et al.  Testing for Linear and Nonlinear Granger Causality in the Stock Price-Volume Relation , 1994 .

[27]  Michael P. Clements,et al.  Forecasting US output growth using leading indicators: an appraisal using MIDAS models , 2009 .

[28]  S. Džeroski,et al.  Using single- and multi-target regression trees and ensembles to model a compound index of vegetation condition , 2009 .

[29]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[30]  Li Chen,et al.  News impact on stock price return via sentiment analysis , 2014, Knowl. Based Syst..

[31]  Michael P. Clements,et al.  Macroeconomic Forecasting With Mixed-Frequency Data , 2008 .

[32]  E. Ghysels,et al.  There is a Risk-Return Tradeoff after All , 2004 .

[33]  David G. McMillan,et al.  Non-linear forecasting of stock returns: Does volume help? , 2007 .

[34]  E. Ghysels,et al.  Why Do Absolute Returns Predict Volatility So Well , 2006 .

[35]  Michael D. Bradley,et al.  Forecasting with a nonlinear dynamic model of stock returns and industrial production , 2004 .

[36]  Sandra Stankiewicz,et al.  Forecasting GDP growth using mixed-frequency models with switching regimes , 2015 .

[37]  Gonzalo Rubio Irigoyen,et al.  The Relationship between Risk and Expected Return in Europe , 2005 .

[38]  Zhongyi Hu,et al.  Multi-step-ahead time series prediction using multiple-output support vector regression , 2014, Neurocomputing.

[39]  Alexander J. Smola,et al.  Learning with Kernels: support vector machines, regularization, optimization, and beyond , 2001, Adaptive computation and machine learning series.

[40]  Ricardo de A. Araújo A robust automatic phase-adjustment method for financial forecasting , 2012, Knowl. Based Syst..

[41]  Nicholas Sarantis,et al.  Nonlinearities, cyclical behaviour and predictability in stock markets: international evidence , 2001 .

[42]  Massimiliano Marcellino,et al.  Unrestricted mixed data sampling (MIDAS): MIDAS regressions with unrestricted lag polynomials , 2015 .

[43]  Rich Caruana,et al.  Multitask Learning , 1997, Machine Learning.

[44]  Fernando Pérez-Cruz,et al.  Multi-dimensional Function Approximation and Regression Estimation , 2002, ICANN.

[45]  Bo Wang,et al.  When Ensemble Learning Meets Deep Learning: a New Deep Support Vector Machine for Classification , 2016, Knowl. Based Syst..

[46]  Eric Ghysels,et al.  News - Good or Bad - and its Impact on Volatility Predictions over Multiple Horizons , 2008 .

[47]  Eric Ghysels,et al.  Real-time forecasting of the US federal government budget: A simple mixed frequency data regression approach , 2015 .

[48]  Wentao Mao,et al.  A fast and robust model selection algorithm for multi-input multi-output support vector machine , 2014, Neurocomputing.

[49]  Eric Ghysels,et al.  Série Scientifique Scientific Series the Midas Touch: Mixed Data Sampling Regression Models the Midas Touch: Mixed Data Sampling Regression Models* , 2022 .

[50]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[51]  Jason Weston,et al.  Support vector machines for multi-class pattern recognition , 1999, ESANN.

[52]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .