论文信息 - Understanding Statistics by Using Spreadsheets to Generate and Analyze Samples

Understanding Statistics by Using Spreadsheets to Generate and Analyze Samples

The random number and probability distribution functions in Excel allow the user to easily generate samples that simulate data typical of any kind of biomedical study. The act of generating the samples should provide the user with an implicit understanding of fundamental statistical concepts, including variables, probability, independence, sampling variation, linear modeling, random error, fixed effects, random effects, and individual responses. Analysis of the samples, which is essentially an attempt to recover the formulae that generated the samples, should reinforce these concepts and develop others related to statistical inference, including bias, confidence limits, statistical significance, and chances of benefit and harm. The spreadsheets accompanying this article provide examples of generation and analysis of data for reliability and validity studies and for simple and covariate-adjusted comparisons of group means without and with repeated measurement. An example is also given for generation of a binary variable for data simulating events, such as the occurrence of injuries, but the analysis by generalized linear modeling is currently not available in these spreadsheets.

Will G Hopkins | W. Hopkins

[1] Stephen W Marshall,et al. Risk Factors and Risk Statistics for Sports Injuries , 2007, Clinical journal of sport medicine : official journal of the Canadian Academy of Sport Medicine.

[2] Alan M. Batterham,et al. Spreadsheets for analysis of controlled trials, with adjustment for a subject characteristic , 2006 .

[3] Jacob Cohen. Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[4] P. Lachenbruch. Statistical Power Analysis for the Behavioral Sciences (2nd ed.) , 1989 .

[5] Will G. Hopkins,et al. A spreadsheet for deriving a confidence interval, mechanistic inference and clinical inference from a P value , 2007 .