Case Study in Evaluating Time Series Prediction Models Using the Relative Mean Absolute Error

ABSTRACT Statistical prediction models inform decision-making processes in many real-world settings. Prior to using predictions in practice, one must rigorously test and validate candidate models to ensure that the proposed predictions have sufficient accuracy to be used in practice. In this article, we present a framework for evaluating time series predictions, which emphasizes computational simplicity and an intuitive interpretation using the relative mean absolute error metric. For a single time series, this metric enables comparisons of candidate model predictions against naïve reference models, a method that can provide useful and standardized performance benchmarks. Additionally, in applications with multiple time series, this framework facilitates comparisons of one or more models’ predictive performance across different sets of data. We illustrate the use of this metric with a case study comparing predictions of dengue hemorrhagic fever incidence in two provinces of Thailand. This example demonstrates the utility and interpretability of the relative mean absolute error metric in practice, and underscores the practical advantages of using relative performance metrics when evaluating predictions.

[1]  Yihui Xie,et al.  Dynamic Documents with R and knitr, Second Edition , 2015 .

[2]  Yihui Xie,et al.  Dynamic Documents with R and knitr , 2015 .

[3]  F. Ellis McKenzie,et al.  Influenza Forecasting in Human Populations: A Scoping Review , 2014, PloS one.

[4]  Phillip T. Koshute,et al.  Prediction of High Incidence of Dengue in the Philippines , 2014, PLoS neglected tropical diseases.

[5]  T. Scott,et al.  The Complex Relationship between Weather and Dengue Virus Transmission in Thailand , 2013, The American journal of tropical medicine and hygiene.

[6]  Alicia Karspeck,et al.  Real-Time Influenza Forecasts during the 2012–2013 Season , 2013, Nature Communications.

[7]  Dylan B. George,et al.  Big Data Opportunities for Global Infectious Disease Surveillance , 2013, PLoS medicine.

[8]  J. Shaman,et al.  Forecasting seasonal outbreaks of influenza , 2012, Proceedings of the National Academy of Sciences.

[9]  Anna L. Buczak,et al.  A data-driven epidemiological prediction method for dengue outbreaks using local and remote sensing data , 2012, BMC Medical Informatics and Decision Making.

[10]  J. Rocklöv,et al.  Forecast of Dengue Incidence Using Temperature and Rainfall , 2012, PLoS neglected tropical diseases.

[11]  L. Held,et al.  Modeling seasonality in space‐time infectious disease surveillance data , 2012, Biometrical journal. Biometrische Zeitschrift.

[12]  S. Wood Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models , 2011 .

[13]  L. Held,et al.  Predictive Model Assessment for Count Data , 2009, Biometrics.

[14]  Michael A. Johansson,et al.  Multiyear Climate Variability and Dengue—El Niño Southern Oscillation, Weather, and Dengue Incidence in Puerto Rico, Mexico, and Thailand: A Longitudinal Data Analysis , 2009, PLoS medicine.

[15]  A. Raftery,et al.  Strictly Proper Scoring Rules, Prediction, and Estimation , 2007 .

[16]  Rob J Hyndman,et al.  Another look at measures of forecast accuracy , 2006 .

[17]  J. Grego,et al.  Fast stable direct fitting and smoothness selection for generalized additive models , 2006, 0709.3906.

[18]  Spyros Makridakis,et al.  The M3-Competition: results, conclusions and implications , 2000 .

[19]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[20]  Fred L. Collopy,et al.  Error Measures for Generalizing About Forecasting Methods: Empirical Comparisons , 1992 .

[21]  A. H. Murphy,et al.  Skill Scores Based on the Mean Square Error and Their Relationships to the Correlation Coefficient , 1988 .

[22]  THE WORLD HEALTH ORGANIZATION , 1954 .

[23]  Thailand. Samnakngān Khana Kammakān Phatthanākān Sētthakit l Chāt Preliminary report : the 2010 Population and Housing Census , 2011 .

[24]  S. Zeger,et al.  Markov regression models for time series: a quasi-likelihood approach. , 1988, Biometrics.