Regression models for forecasting goals and match results in association football

Abstract In the previous literature, two approaches have been used to model match outcomes in association football (soccer): first, modelling the goals scored and conceded by each team; and second, modelling win–draw–lose match results directly. There have been no previous attempts to compare the forecasting performance of these two types of model. This paper aims to fill this gap. Bivariate Poisson regression is used to estimate forecasting models for goals scored and conceded. Ordered probit regression is used to estimate forecasting models for match results. Both types of models are estimated using the same 25-year data set on English league football match outcomes. The best forecasting performance is achieved using a ‘hybrid’ specification, in which goals-based team performance covariates are used to forecast win–draw–lose match results. However, the differences between the forecasting performance of models based on goals data and models based on results data appear to be relatively small.

[1]  Robert Simmons,et al.  Making up the results: the work of the football pools panel, 1963-1997 , 2000 .

[2]  M. Maher Modelling association football scores , 1982 .

[3]  John Goddard,et al.  Forecasting football results and the efficiency of fixed‐odds betting , 2004 .

[4]  S. Coles,et al.  Modelling Association Football Scores and Inefficiencies in the Football Betting Market , 1997 .

[5]  Mark J. Dixon,et al.  A birth process model for association football matches , 1998 .

[6]  D. Karlis,et al.  Analysis of sports data by using bivariate Poisson models , 2003 .

[7]  Mark J. Dixon,et al.  The value of statistical forecasts in the UK association football betting market , 2004 .

[8]  David Law,et al.  The Favourite-Longshot Bias and Market Efficiency in UK Football Betting , 2000 .

[9]  Håvard Rue,et al.  Prediction and retrospective analysis of soccer matches in a league , 2000 .

[10]  Robert Simmons,et al.  Forecasting sport: the behaviour and performance of football tipsters , 2000 .

[11]  Tim Kuypers,et al.  Information and efficiency: an empirical study of a fixed odds betting market , 2000 .

[12]  Paul A. Johnson A test of the normality assumption in the ordered probit model , 1996 .

[13]  Ruud H. Koning,et al.  Balance in competition in Dutch soccer , 2000 .

[14]  Stephen R. Clarke,et al.  Home ground advantage of individual clubs in English soccer , 1995 .

[15]  G. Ridder,et al.  Down to Ten: Estimating the Effect of a Red Card in Soccer , 1994 .

[16]  Paul Glewwe,et al.  A test of the normality assumption in ordered probit model , 1997 .

[17]  Mark J. Dixon,et al.  Dynamic modelling and prediction of English Football League matches for betting , 2002 .

[18]  T. Palva,et al.  Pages 1-19 , 2001 .

[19]  David Dyte,et al.  A ratings based Poisson model for World Cup soccer simulation , 2000, J. Oper. Res. Soc..

[20]  P. Holgate Estimation for the bivariate Poisson distribution , 1964 .

[21]  Vic Barnett,et al.  The Effect of an Artificial Pitch Surface on Home Team Performance in Football (Soccer) , 1993 .

[22]  Stephen Dobson,et al.  The impact of managerial change on team performance in professional sports , 2002 .