Competing Process Hazard Function Models for Player Ratings in Ice Hockey

Evaluating the overall ability of players in the National Hockey League (NHL) is a difficult task. Existing methods such as the famous "plus/minus" statistic have many shortcomings. Standard linear regression methods work well when player substitutions are relatively uncommon and scoring events are relatively common, such as in basketball, but as neither of these conditions exists for hockey, we use an approach that embraces the unique characteristics of the sport. We model the scoring rate for each team as its own semi-Markov process, with hazard functions for each process that depend on the players on the ice. This method yields offensive and defensive player ability ratings which take into account quality of teammates and opponents, the game situation, and other desired factors, that themselves have a meaningful interpretation in terms of game outcomes. Additionally, since the number of parameters in this model can be quite large, we make use of two different shrinkage methods depending on the question of interest: full Bayesian hierarchical models that partially pool parameters according to player position, and penalized maximum likelihood estimation to select a smaller number of parameters that stand out as being substantially different from average. We apply the model to all five-on-five (full-strength) situations for games in five NHL seasons.

[1]  A. P. Dawid,et al.  Selection paradoxes of Bayesian inference , 1994 .

[2]  Andrew C. Thomas The Impact of Puck Possession and Location on Ice Hockey Strategy , 2006 .

[3]  Mike Wright,et al.  Using a Markov process model of an association football match to determine the optimal timing of substitution and tactical decisions , 2002, J. Oper. Res. Soc..

[4]  Brian Macdonald A Regression-Based Adjusted Plus-Minus Statistic for NHL Players , 2010, 1006.4310.

[5]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[6]  Brian Macdonald Adjusted Plus-Minus for NHL Players using Ridge Regression with Goals, Shots, Fenwick, and Corsi , 2012, 1201.0317.

[7]  Shane T. Jensen,et al.  Estimating player contribution in hockey with regularized logistic regression , 2012, 1209.5026.

[8]  L. Brown In-season prediction of batting averages: A field test of empirical Bayes and Bayes methodologies , 2008, 0803.3697.

[9]  G. Casella,et al.  The Bayesian Lasso , 2008 .

[10]  Michael E. Schuckers,et al.  National Hockey League Skater Ratings Based upon All On-Ice Events: An Adjusted Minus/Plus Probability (AMPP) Approach , 2011 .

[11]  R. Tibshirani The lasso method for variable selection in the Cox model. , 1997, Statistics in medicine.

[12]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[13]  Patrick D. Larkey,et al.  Bridging Different Eras in Sports , 1999 .

[14]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[15]  J. Sill Improved NBA Adjusted +/- Using Regularization and Out-of-Sample Testing , 2010 .

[16]  Brian Macdonald,et al.  An Expected Goals Model for Evaluating NHL Teams and Players , 2012 .

[17]  Qing Li,et al.  The Bayesian elastic net , 2010 .

[18]  Brian Macdonald,et al.  Adjusted Plus-Minus for NHL Players using , 2012 .

[19]  C ThomasAndrew,et al.  Inter-arrival Times of Goals in Ice Hockey , 2007 .

[20]  D. Cox Regression Models and Life-Tables , 1972 .

[21]  Donald B. Rubin,et al.  Validation of Software for Bayesian Models Using Posterior Quantiles , 2006 .

[22]  Andrew C. Thomas Inter-arrival Times of Goals in Ice Hockey , 2007 .

[23]  C. Stein,et al.  Estimation with Quadratic Loss , 1992 .

[24]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[25]  Chris Hans Elastic Net Regression Modeling With the Orthant Normal Prior , 2011 .

[26]  T. Swartz,et al.  Strategies for Pulling the Goalie in Hockey , 2010 .

[27]  D.,et al.  Regression Models and Life-Tables , 2022 .