Identifying winners of competitive events: A SVM-based classification model for horserace prediction

The aim of much horserace modelling is to appraise the informational efficiency of betting markets. The prevailing approach involves forecasting the runners' finish positions by means of discrete or continuous response regression models. However, theoretical considerations and empirical evidence suggest that the information contained within finish positions might be unreliable, especially among minor placings. To alleviate this problem, a classification-based modelling paradigm is proposed which relies only on data distinguishing winners and losers. To assess its effectiveness, an empirical experiment is conducted using data from a UK racetrack. The results demonstrate that the classification-based model compares favourably with state-of-the-art alternatives and confirm the reservations of relying on rank ordered finishing data. Simulations are conducted to further explore the origin of the model's success by evaluating the marginal contribution of its constituent parts.

[1]  Leighton Vaughan Williams Information Efficiency in Financial and Betting Markets , 2009 .

[2]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[3]  Steven D. Levitt,et al.  Why are Gambling Markets Organized so Differently from Financial Markets? , 2004 .

[4]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[5]  Chih-Jen Lin,et al.  Asymptotic Behaviors of Support Vector Machines with Gaussian Kernel , 2003, Neural Computation.

[6]  Bart Baesens,et al.  Comprehensible Credit Scoring Models Using Rule Extraction from Support Vector Machines , 2007, Eur. J. Oper. Res..

[7]  David Edelman,et al.  Adapting support vector machine methods for horserace odds prediction , 2003, Ann. Oper. Res..

[8]  J. J. Kelly A new interpretation of information rate , 1956 .

[9]  Ruth N. Bolton,et al.  Searching for positive returns at the track: a multinomial logic model for handicapping horse races , 1986 .

[10]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[11]  Johnnie E.V. Johnson,et al.  Investigating the roots of the favourite–longshot bias: an analysis of decision making by supply- and demand-side agents in parallel betting markets , 2000 .

[12]  W. F. Benter Computer-Based Horse Race Handicapping and Wagering Systems , 2008 .

[13]  H. L. Le Roy,et al.  Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability; Vol. IV , 1969 .

[14]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[15]  Raymond D. Sauer,et al.  The Economics of Wagering Markets , 1998 .

[16]  Leighton Vaughan Williams,et al.  Information Efficiency in Betting Markets: A Survey , 1999 .

[17]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .

[18]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[19]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[20]  P. Zarembka Frontiers in econometrics , 1973 .

[21]  Ming-Chien Sung,et al.  Information Efficiency in Financial and Betting Markets: Searching for semi-strong form inefficiency in the UK racetrack betting market , 2005 .

[22]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[23]  Johan A. K. Suykens,et al.  Benchmarking Least Squares Support Vector Machine Classifiers , 2004, Machine Learning.

[24]  Randall G. Chapaaan,et al.  Exploiting Rank Ordered Choice Set Data within the Stochastic Utility Model , 1982 .

[25]  Stephen Figlewski Subjective Information and Market Efficiency in a Betting Market , 1979, Journal of Political Economy.

[26]  S. Sathiya Keerthi,et al.  A Fast Dual Algorithm for Kernel Logistic Regression , 2002, 2007 International Joint Conference on Neural Networks.

[27]  W. Ziemba,et al.  Transactions Costs, Extent of Inefficiencies, Entries and Multiple Wagers in a Racetrack Betting Model , 1985 .

[28]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[29]  William T. Ziemba,et al.  Efficiency of Racetrack Betting Markets , 2008 .

[30]  Owen Jones,et al.  Exploring Decision Makers' Use of Price Information in a Speculative Market , 2006, Manag. Sci..

[31]  David Law,et al.  Insider Trading, Herding Behaviour and Market Plungers in the British Horse-Race Betting Market , 2002 .

[32]  Alexander J. Smola,et al.  Advances in Large Margin Classifiers , 2000 .

[33]  Kristof Coussement,et al.  Faculteit Economie En Bedrijfskunde Hoveniersberg 24 B-9000 Gent Churn Prediction in Subscription Services: an Application of Support Vector Machines While Comparing Two Parameter-selection Techniques Churn Prediction in Subscription Services: an Application of Support Vector Machines While Comparin , 2022 .

[34]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[35]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[36]  Thorsten Joachims,et al.  A support vector method for multivariate performance measures , 2005, ICML.

[37]  Klaus Obermayer,et al.  Support vector learning for ordinal regression , 1999 .

[38]  Adi Schnytzer,et al.  Inside Information in a Betting Market , 1995 .

[39]  Richard B. Westin,et al.  Transferability of disaggregate mode choice models , 1975 .

[40]  David J. Curry,et al.  Prediction in Marketing Using the Support Vector Machine , 2005 .

[41]  Hsuan-Tien Lin A Study on Sigmoid Kernels for SVM and the Training of non-PSD Kernels by SMO-type Methods , 2005 .