Augmenting Data With Published Results in Bayesian Linear Regression

In most research, linear regression analyses are performed without taking into account published results (i.e., reported summary statistics) of similar previous studies. Although the prior density in Bayesian linear regression could accommodate such prior knowledge, formal models for doing so are absent from the literature. The goal of this article is therefore to develop a Bayesian model in which a linear regression analysis on current data is augmented with the reported regression coefficients (and standard errors) of previous studies. Two versions of this model are presented. The first version incorporates previous studies through the prior density and is applicable when the current and all previous studies are exchangeable. The second version models all studies in a hierarchical structure and is applicable when studies are not exchangeable. Both versions of the model are assessed using simulation studies. Performance for each in estimating the regression coefficients is consistently superior to using current data alone and is close to that of an equivalent model that uses the data from previous studies rather than reported regression coefficients. Overall the results show that augmenting data with results from previous studies is viable and yields significant improvements in the parameter estimation.

[1]  David J. Spiegelhalter,et al.  An Overview of the Bayesian Approach , 2004 .

[2]  Richard D Riley,et al.  Multivariate meta‐analysis: the effect of ignoring within‐study correlation , 2009 .

[3]  Feng Chen,et al.  Combining Previously Published Studies with Current Data in Bayesian Logistic Regression Model: An Example for Identifying Susceptibility Genes Related to Lung Cancer in Humans , 2009, Journal of toxicology and environmental health. Part A.

[4]  Xiao-Li Meng,et al.  Modeling covariance matrices in terms of standard deviations and correlations, with application to shrinkage , 2000 .

[5]  Adrian F. M. Smith,et al.  Bayesian Statistics 5. , 1998 .

[6]  D. Spiegelhalter,et al.  Disease Mapping With WinBUGS and MLwiN, Bayesian Approaches to Clinical Trials and Health Care Evaluation , 2004 .

[7]  M. A. Best Bayesian Approaches to Clinical Trials and Health‐Care Evaluation , 2005 .

[8]  Kerrie Mengersen,et al.  Multivariate meta‐analysis , 2003, Statistics in medicine.

[9]  Scott M. Lynch,et al.  Introduction to Applied Bayesian Statistics and Estimation for Social Scientists , 2007 .

[10]  S. Maxwell The persistence of underpowered studies in psychological research: causes, consequences, and remedies. , 2004, Psychological methods.

[11]  D. Borsboom,et al.  The poor availability of psychological research data for reanalysis. , 2006, The American psychologist.

[12]  Julian P T Higgins,et al.  Recent developments in meta‐analysis , 2008, Statistics in medicine.

[13]  J. Bernardo The Concept of Exchangeability and its Applications , 2001 .

[14]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[15]  S. Raudenbush,et al.  A multivariate mixed linear model for meta-analysis. , 1996 .

[16]  J. Ibrahim,et al.  Power prior distributions for regression models , 2000 .

[17]  Vincent Buskens,et al.  Combining Statistical Evidence From Several Studies , 2013 .

[18]  Theo Stijnen,et al.  Combining multiple outcome measures in a meta‐analysis: an application , 2003, Statistics in medicine.