Use of R as a Toolbox for Mathematical Statistics Exploration

The R language, a freely available environment for statistical computing and graphics is widely used in many fields. This “expert-friendly” system has a powerful command language and programming environment, combined with an active user community. We discuss how R is ideal as a platform to support experimentation in mathematical statistics, both at the undergraduate and graduate levels. Using a series of case studies and activities, we describe how R can be used in a mathematical statistics course as a toolbox for experimentation. Examples include the calculation of a running average, maximization of a nonlinear function, resampling of a statistic, simple Bayesian modeling, sampling from multivariate normal, and estimation of power. These activities, often requiring only a few dozen lines of code, offer students the opportunity to explore statistical concepts and experiment. In addition, they provide an introduction to the framework and idioms available in this rich environment.

[1]  William F. Eddy,et al.  Graduate Education in Computational Statistics , 1987 .

[2]  D. Harville Matrix Algebra From a Statistician's Perspective , 1998 .

[3]  John F. Monahan Teaching Statistical Computing at North Carolina State University , 2004 .

[4]  James E. Gentle Courses in Statistical Computing and Computational Statistics , 2004 .

[5]  Deborah Nolan,et al.  Stat Labs: Mathematical Statistics Through Applications , 2001 .

[6]  A. F. Smith,et al.  Statistical analysis of finite mixture distributions , 1986 .

[7]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[8]  J. Fox Nonparametric Regression Appendix to An R and S-PLUS Companion to Applied Regression , 2002 .

[9]  Deborah Nolan,et al.  Teaching statistics theory through applications , 1999 .

[10]  J. Goodnight A Tutorial on the SWEEP Operator , 1979 .

[11]  Jenny A. Baglivo,et al.  Computer algebra systems: Maple and Mathematica , 1995 .

[12]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[13]  Brian D. Ripley,et al.  Modern Applied Statistics with S Fourth edition , 2002 .

[14]  Cleve B. Moler,et al.  Numerical computing with MATLAB , 2004 .

[15]  Kenneth Lange Computational Statistics and Optimization Theory at UCLA , 2004 .

[16]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[17]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[18]  Peter Dalgaard,et al.  Introductory statistics with R , 2002, Statistics and computing.

[19]  P. Diggle,et al.  Analysis of Longitudinal Data , 2003 .

[20]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[21]  William N. Venables,et al.  S Programming , 2000 .

[22]  U. Ligges Review of An R and S-PLUS companion to applied regression by J. Fox, Sage Publications, Thousand Oaks, California 2002 , 2003 .

[23]  Jenny A. Baglivo MathematicaLaboratories for Mathematical Statistics: Emphasizing Simulation and Computer Intensive Methods , 2005 .

[24]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[25]  N. Makris,et al.  A review and new report of medial temporal lobe dysfunction as a vulnerability indicator for schizophrenia: a magnetic resonance imaging morphometric family study of the parahippocampal gyrus. , 2003, Schizophrenia bulletin.

[26]  Helen Arnold,et al.  Hitchhiker's guide to the galaxy , 2006, SIGGRAPH '06.

[27]  Ronald A. Thisted,et al.  Elements of statistical computing , 1986 .

[28]  Margaret S. Pepe,et al.  Receiver Operating Characteristic Methodology , 2000 .

[29]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[30]  J. Rice Mathematical Statistics and Data Analysis , 1988 .

[31]  Eric R. Ziegel,et al.  Counterexamples in Probability and Statistics , 1986 .

[32]  P. Diggle,et al.  Analysis of Longitudinal Data. , 1997 .

[33]  Peter Colman,et al.  Analysis of Longitudinal Data(second edition) Diggle P, Heagarty P, Liang K-Y, Zeger S(2002)ISBN 0198524846; 396 pages;£40.00,$85.00 Oxford University Press; , 2004 .

[34]  Thomas L. Moore Teaching Statistics: Resources for Undergraduate Instructors , 2001 .