Doing Bayesian Data Analysis: A Tutorial with R, JAGS, and Stan

There is an explosion of interest in Bayesian statistics, primarily because recently created computational methods have finally made Bayesian analysis obtainable to a wide audience. Doing Bayesian Data Analysis: A Tutorial with R, JAGS, and Stan provides an accessible approach to Bayesian data analysis, as material is explained clearly with concrete examples. The book begins with the basics, including essential concepts of probability and random sampling, and gradually progresses to advanced hierarchical modeling methods for realistic data. Included are step-by-step instructions on how to conduct Bayesian data analyses in the popular and free software R and WinBugs. This book is intended for first-year graduate students or advanced undergraduates. It provides a bridge between undergraduate training and modern Bayesian methods for data analysis, which is becoming the accepted research standard. Knowledge of algebra and basic calculus is a prerequisite. New to this Edition (partial list): * There are all new programs in JAGS and Stan. The new programs are designed to be much easier to use than the scripts in the first edition. In particular, there are now compact high-level scripts that make it easy to run the programs on your own data sets. This new programming was a major undertaking by itself.* The introductory Chapter 2, regarding the basic ideas of how Bayesian inference re-allocates credibility across possibilities, is completely rewritten and greatly expanded.* There are completely new chapters on the programming languages R (Ch. 3), JAGS (Ch. 8), and Stan (Ch. 14). The lengthy new chapter on R includes explanations of data files and structures such as lists and data frames, along with several utility functions. (It also has a new poem that I am particularly pleased with.) The new chapter on JAGS includes explanation of the RunJAGS package which executes JAGS on parallel computer cores. The new chapter on Stan provides a novel explanation of the concepts of Hamiltonian Monte Carlo. The chapter on Stan also explains conceptual differences in program flow between it and JAGS.* Chapter 5 on Bayes' rule is greatly revised, with a new emphasis on how Bayes' rule re-allocates credibility across parameter values from prior to posterior. The material on model comparison has been removed from all the early chapters and integrated into a compact presentation in Chapter 10.* What were two separate chapters on the Metropolis algorithm and Gibbs sampling have been consolidated into a single chapter on MCMC methods (as Chapter 7). There is extensive new material on MCMC convergence diagnostics in Chapters 7 and 8. There are explanations of autocorrelation and effective sample size. There is also exploration of the stability of the estimates of the HDI limits. New computer programs display the diagnostics, as well.* Chapter 9 on hierarchical models includes extensive new and unique material on the crucial concept of shrinkage, along with new examples.* All the material on model comparison, which was spread across various chapters in the first edition, in now consolidated into a single focused chapter (Ch. 10) that emphasizes its conceptualization as a case of hierarchical modeling.* Chapter 11 on null hypothesis significance testing is extensively revised. It has new material for introducing the concept of sampling distribution. It has new illustrations of sampling distributions for various stopping rules, and for multiple tests.* Chapter 12, regarding Bayesian approaches to null value assessment, has new material about the region of practical equivalence (ROPE), new examples of accepting the null value by Bayes factors, and new explanation of the Bayes factor in terms of the Savage-Dickey method.* Chapter 13, regarding statistical power and sample size, has an extensive new section on sequential testing, and making the research goal be precision of estimation instead of rejecting or accepting a particular value.* Chapter 15, which introduces the generalized linear model, is fully revised, with more complete tables showing combinations of predicted and predictor variable types.* Chapter 16, regarding estimation of means, now includes extensive discussion of comparing two groups, along with explicit estimates of effect size.* Chapter 17, regarding regression on a single metric predictor, now includes extensive examples of robust regression in JAGS and Stan. New examples of hierarchical regression, including quadratic trend, graphically illustrate shrinkage in estimates of individual slopes and curvatures. The use of weighted data is also illustrated.* Chapter 18, on multiple linear regression, includes a new section on Bayesian variable selection, in which various candidate predictors are probabilistically included in the regression model.* Chapter 19, on one-factor ANOVA-like analysis, has all new examples, including a completely worked out example analogous to analysis of covariance (ANCOVA), and a new example involving heterogeneous variances.* Chapter 20, on multi-factor ANOVA-like analysis, has all new examples, including a completely worked out example of a split-plot design that involves a combination of a within-subjects factor and a between-subjects factor.* Chapter 21, on logistic regression, is expanded to include examples of robust logistic regression, and examples with nominal predictors.* There is a completely new chapter (Ch. 22) on multinomial logistic regression. This chapter fills in a case of the generalized linear model (namely, a nominal predicted variable) that was missing from the first edition.* Chapter 23, regarding ordinal data, is greatly expanded. New examples illustrate single-group and two-group analyses, and demonstrate how interpretations differ from treating ordinal data as if they were metric.* There is a new section (25.4) that explains how to model censored data in JAGS.* Many exercises are new or revised. * Accessible, including the basics of essential concepts of probability and random sampling* Examples with R programming language and JAGS software* Comprehensive coverage of all scenarios addressed by non-Bayesian textbooks: t-tests, analysis of variance (ANOVA) and comparisons in ANOVA, multiple regression, and chi-square (contingency table analysis)* Coverage of experiment planning* R and JAGS computer programming code on website* Exercises have explicit purposes and guidelines for accomplishment* Provides step-by-step instructions on how to conduct Bayesian data analyses in the popular and free software R and WinBugs

[1]  B. Mallick VARIABLE SELECTION FOR REGRESSION MODELS , 2016 .

[2]  Mehdi Korki,et al.  Bayesian Hypothesis Testing for Block Sparse , 2015 .

[3]  M. Lee,et al.  Bayesian Cognitive Modeling: A Practical Course , 2014 .

[4]  Peter Willett,et al.  What is a tutorial , 2013 .

[5]  Maurizio Ferconi,et al.  The Theory That Would Not Die , 2013 .

[6]  J. Kruschke Bayesian estimation supersedes the t test. , 2013, Journal of experimental psychology. General.

[7]  Effect Size and Sample Size Planning , 2013 .

[8]  John K Kruschke,et al.  Posterior predictive checks can and should be Bayesian: comment on Gelman and Shalizi, 'Philosophy and the practice of Bayesian statistics'. , 2013, The British journal of mathematical and statistical psychology.

[9]  Zhiyong Zhang,et al.  Bayesian Inference and Application of Robust Growth Curve Models Using Student's t Distribution , 2013 .

[10]  H. Moriya,et al.  The Power of Kawaii: Viewing Cute Images Promotes a Careful Behavior and Narrows Attentional Focus , 2012, PloS one.

[11]  John K. Kruschke,et al.  The Time Has Come : Bayesian Methods for Data Analysis in the Organizational Sciences , 2012 .

[12]  J. Rothman,et al.  Sexual Deprivation Increases Ethanol Intake in Drosophila , 2012, Science.

[13]  Martyn Plummer,et al.  JAGS Version 3.3.0 user manual , 2012 .

[14]  Nial Friel,et al.  Estimating the evidence – a review , 2011, 1111.1957.

[15]  Mark Andrews,et al.  Doing Bayesian Data Analysis: A Tutorial Using R and BUGS by John K. Kruschk , 2011 .

[16]  Radford M. Neal MCMC Using Hamiltonian Dynamics , 2011, 1206.1901.

[17]  Z. Dienes Bayesian Versus Orthodox Statistics: Which Side Are You On? , 2011, Perspectives on psychological science : a journal of the Association for Psychological Science.

[18]  John K Kruschke,et al.  Bayesian Assessment of Null Values Via Parameter Estimation and Model Comparison , 2011, Perspectives on psychological science : a journal of the Association for Psychological Science.

[19]  Cody Ding Incorporating our own knowledge in data analysis: Is this the right time? , 2011 .

[20]  J. Kruschke Doing Bayesian Data Analysis: A Tutorial with R and BUGS , 2010 .

[21]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[22]  Mark A. Pitt,et al.  Adaptive Design Optimization: A Mutual Information-Based Approach to Model Discrimination in Cognitive Science , 2010, Neural Computation.

[23]  Christopher J. Nachtsheim,et al.  Split-Plot Designs: What, Why, and How , 2009 .

[24]  Jeff Miller What is the probability of replicating a statistically significant effect? , 2009, Psychonomic bulletin & review.

[25]  A. Gelman,et al.  Correlations and Multiple Comparisons in Functional Imaging: A Statistical Perspective (Commentary on Vul et al., 2009) , 2009, Perspectives on psychological science : a journal of the Association for Psychological Science.

[26]  S. Ghosal,et al.  Convergence properties of sequential Bayesian D-optimal designs , 2009 .

[27]  G. Cumming,et al.  Confidence intervals : better answers to better questions. , 2009 .

[28]  J. Kruschke Highlighting : A Canonical Experiment , 2009 .

[29]  M. Aitkin,et al.  Bayes factors: Prior sensitivity and model generalizability , 2008 .

[30]  James Carifio,et al.  Resolving the 50‐year debate around using and misusing Likert scales , 2008, Medical education.

[31]  Jie W Weiss,et al.  Bayesian Statistical Inference for Psychological Research , 2008 .

[32]  J. Kruschke Bayesian approaches to associative learning: From passive to active learning , 2008, Learning & behavior.

[33]  Z. Diénès Understanding Psychology as a Science: An Introduction to Scientific and Statistical Inference , 2008 .

[34]  E. Wagenmakers A practical solution to the pervasive problems ofp values , 2007, Psychonomic bulletin & review.

[35]  Zehao Shen,et al.  Ecological applications of multilevel analysis of variance. , 2007, Ecology.

[36]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[37]  L H Damgaard,et al.  Technical note: how to use Winbugs to draw inferences in animal models. , 2007, Journal of animal science.

[38]  N. Jenkins Ergogenic, metabolic, and perceptual effects of low doses of caffeine , 2007 .

[39]  T. Griffiths,et al.  Iterated learning: Intergenerational knowledge transmission reveals inductive biases , 2007, Psychonomic bulletin & review.

[40]  Glenn Shafer,et al.  Scientific Reasoning: The Bayesian Approach (3rd ed.), Colin Howson and Peter Urbach , 2007 .

[41]  Fulvio De Santis,et al.  Using historical data for Bayesian sample size determination , 2007 .

[42]  Persi Diaconis,et al.  c ○ 2007 Society for Industrial and Applied Mathematics Dynamical Bias in the Coin Toss ∗ , 2022 .

[43]  James G. Scott,et al.  An exploration of aspects of Bayesian multiple testing , 2006 .

[44]  Robert F Krueger,et al.  How money buys happiness: genetic and environmental processes linking finances and life satisfaction. , 2006, Journal of personality and social psychology.

[45]  D. Berry Bayesian clinical trials , 2006, Nature Reviews Drug Discovery.

[46]  Kenneth Rice,et al.  FDR and Bayesian Multiple Comparisons Rules , 2006 .

[47]  Christian P. Robert,et al.  Monte Carlo Statistical Methods , 2005, Springer Texts in Statistics.

[48]  P. Frijters,et al.  How Important is Methodology for the Estimates of the Determinants of Happiness? , 2004 .

[49]  Gerd Gigerenzer,et al.  The Null Ritual. What You Always Wanted to Know About Null Hypothesis Testing, but Were Afraid to Ask , 2004 .

[50]  Mu Zhu The Counter-intuitive Non-informative Prior for the Bernoulli Family , 2004 .

[51]  M. C. Jones,et al.  A skew extension of the t‐distribution, with applications , 2003 .

[52]  Qi-Man Shao,et al.  A Monte Carlo gap test in computing HPD regions , 2003 .

[53]  Martyn Plummer,et al.  JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling , 2003 .

[54]  Deborah Nolan,et al.  You Can Load a Die, But You Can't Bias a Coin , 2002 .

[55]  Mark C. Greenwood,et al.  Workshop Statistics: Discovery With Data, A Bayesian Approach , 2002, Technometrics.

[56]  Fei Wang,et al.  A simulation-based approach to Bayesian sample size determination for performance under a given model and for separating models , 2002 .

[57]  P. Rogers,et al.  Effects of low doses of caffeine on cognitive performance, mood and thirst in low and higher caffeine consumers , 2000, Psychopharmacology.

[58]  Andrew Thomas,et al.  WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility , 2000, Stat. Comput..

[59]  D. Berry,et al.  Bayesian perspectives on multiple comparisons , 1999 .

[60]  L. Kupper,et al.  Sample size determination for multiple comparison studies treating confidence interval width as random. , 1999, Statistics in medicine.

[61]  Matthew N. O. Sadiku,et al.  A Tutorial on Simulation of Queueing Models , 1999 .

[62]  Ming-Hui Chen,et al.  Monte Carlo Estimation of Bayesian Credible and HPD Intervals , 1999 .

[63]  Ulrich Hoffrage,et al.  Simplifying Bayesian Inference: The General Case , 1999 .

[64]  Bradley P. Carlin,et al.  Markov Chain Monte Carlo in Practice: A Roundtable Discussion , 1998 .

[65]  L Rosa,et al.  A close look at therapeutic touch. , 1998, JAMA.

[66]  C. Adcock Sample size determination : a review , 1997 .

[67]  D. Lindley The choice of sample size , 1997 .

[68]  A. O’Hagan,et al.  Properties of intrinsic and fractional Bayes factors , 1997 .

[69]  Robert J. Steidl,et al.  Statistical power analysis in wildlife research , 1997 .

[70]  I. J. Myung,et al.  Applying Occam’s razor in modeling cognition: A Bayesian approach , 1997 .

[71]  Len Thomas,et al.  Retrospective Power Analysis , 1997 .

[72]  Robert P. Abelson,et al.  On the Surprising Longevity of Flogged Horses: Why There Is a Case for the Significance Test , 1997 .

[73]  F. Schmidt Statistical Significance Testing and Cumulative Knowledge in Psychology: Implications for Training of Researchers , 1996 .

[74]  Rob J Hyndman,et al.  Computing and Graphing Highest Density Regions , 1996 .

[75]  Gerd Gigerenzer,et al.  How to Improve Bayesian Reasoning Without Instruction: Frequency Formats , 1995 .

[76]  K. Chaloner,et al.  Bayesian Experimental Design: A Review , 1995 .

[77]  L. Joseph,et al.  Sample Size Calculations for Binomial Proportions Via Highest Posterior Density Intervals , 1995 .

[78]  L. Joseph,et al.  Some comments on Bayesian sample size determination , 1995 .

[79]  A. Gelfand,et al.  Bayesian Model Choice: Asymptotics and Exact Calculations , 1994 .

[80]  A. James,et al.  Sexual Activity and the Lifespan of Male Fruitflies: A Dataset That Gets Attention , 1994 .

[81]  William L. Hays,et al.  Statistics, 5th ed. , 1994 .

[82]  Leonard Maltin,et al.  Leonard Maltin's Movie and Video Guide , 1993 .

[83]  A. Dale,et al.  A History of Inverse Probability from Thomas Bayes to Karl Pearson , 1993 .

[84]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[85]  T. Pham-Gia,et al.  Sample Size Determination in Bayesian Analysis , 1992 .

[86]  Radford M. Neal An improved acceptance procedure for the hybrid Monte Carlo algorithm , 1992, hep-lat/9208011.

[87]  D. Burmaster,et al.  Bivariate distributions for height and weight of men and women in the United States. , 1992, Risk analysis : an official publication of the Society for Risk Analysis.

[88]  James O. Berger,et al.  Ockham's Razor and Bayesian Analysis , 1992 .

[89]  Javier R. Movellan,et al.  Benefits of gain: speeded learning and minimal hidden layers in back-propagation networks , 1991, IEEE Trans. Syst. Man Cybern..

[90]  Scott E. Maxwell,et al.  Designing Experiments and Analyzing Data: A Model Comparison Perspective , 1990 .

[91]  P. Lachenbruch Statistical Power Analysis for the Behavioral Sciences (2nd ed.) , 1989 .

[92]  Jeremy MG Taylor,et al.  Robust Statistical Modeling Using the t Distribution , 1989 .

[93]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[94]  D D Boos,et al.  Tests and confidence sets for comparing two mean residual life functions. , 1988, Biometrics.

[95]  A. Kennedy,et al.  Hybrid Monte Carlo , 1988 .

[96]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[97]  P. Macaskill,et al.  Stopping rules for clinical trials incorporating clinical opinion. , 1984, Biometrics.

[98]  Paul Slovic,et al.  Multidimensional functional learning (MFL) and some new conceptions of feedback , 1981 .

[99]  C. Dunnett,et al.  Significance testing to establish equivalence between treatments, with special reference to data in the form of 2X2 tables. , 1977, Biometrics.

[100]  W. Westlake,et al.  Symmetrical confidence intervals for bioequivalence trials. , 1976, Biometrics.

[101]  R. Snee Graphical Display of Two-way Contingency Tables , 1974 .

[102]  M. Degroot Optimal Statistical Decisions , 1970 .

[103]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[104]  R. L. Winkler The Assessment of Prior Distributions in Bayesian Analysis , 1967 .

[105]  P. Meehl Theory-Testing in Psychology and Physics: A Methodological Paradox , 1967, Philosophy of Science.

[106]  William C. Guenther,et al.  Concepts of statistical inference , 1966 .

[107]  G. Rota The Number of Partitions of a Set , 1964 .

[108]  A. N. Kolmogorov,et al.  Foundations of the theory of probability , 1960 .

[109]  R. Duncan Luce,et al.  Individual Choice Behavior , 1959 .

[110]  F. J. Anscombe,et al.  Fixed-Sample-Size Analysis of Sequential Observations , 1954 .

[111]  C. Eriksen,et al.  Effects of failure stress upon skilled performance. , 1952, Journal of experimental psychology.

[112]  S S Stevens,et al.  On the Theory of Scales of Measurement. , 1946, Science.

[113]  C. I. Bliss,et al.  THE METHOD OF PROBITS. , 1934, Science.

[114]  Student,et al.  THE PROBABLE ERROR OF A MEAN , 1908 .