Don't Let the Truth Get in the Way of a Good Story: An Illustration of Citation Bias in Epidemiologic Research

The production of scientific knowledge is susceptible to bias at every stage of the process, from what questions are asked by the investigator, to which method is chosen to gather data, to which analyses are conducted (e.g., “P-hacking,” wherein the method of statistical analysis and the degrees of freedom are manipulated until they yield statistically significant results) (1). Even after completion of a study, authors sometimes choose not to submit their work for publication because they are not satisfied with the results (i.e., the “file drawer” problem) (1), or they encounter difficulties with getting results published because of reviewer or editorial bias (“publication bias”) (2–4). Although prepublication biases have been well described in epidemiologic textbooks, postpublication biases, such as selective citation, have been less well documented. “Citation bias” occurs when scientists selectively cite papers based upon risk estimates that conform to their preconceived notions (5). When researchers have a bias in favor of “X causing Y,” they are more likely to cite papers that found evidence to support their view. Conversely, when researchers harbor a bias against a hypothesized association, they may selectively cite papers that report null findings. Here, we use research on job strain and the risk of coronary heart disease to examine factors that influence citations in peer-reviewed literature. In addition to the risk estimate for job strain relative to no job strain in each study, we take into account the impact factor of the publishing journal, which is an indicator of its prestige.

[1]  A. Steptoe,et al.  Work-Related Psychosocial Factors and the Development of Ischemic Heart Disease: A Systematic Review , 2009, Cardiology in review.

[2]  Ritva Luukkonen,et al.  Work stress and risk of cardiovascular mortality: prospective cohort study of industrial employees , 2002, BMJ : British Medical Journal.

[3]  J. Johnson,et al.  Combined effects of job strain and social isolation on cardiovascular disease morbidity and mortality in a random sample of the Swedish male working population. , 1989, Scandinavian journal of work, environment & health.

[4]  M Marmot,et al.  Job strain, job demands, decision latitude, and risk of coronary heart disease within the Whitehall II study , 2003, Journal of epidemiology and community health.

[5]  R. Rosenthal,et al.  Selective publication of antidepressant trials and its influence on apparent efficacy. , 2008, The New England journal of medicine.

[6]  R. Wears,et al.  Journal prestige, publication bias, and other characteristics associated with citation of published studies in peer-reviewed journals. , 2002, JAMA.

[7]  G. Colditz,et al.  A prospective study of job strain and coronary heart disease in US women. , 2002, International journal of epidemiology.

[8]  T. Perneger,et al.  Citation bias favoring statistically significant studies was present in medical research. , 2013, Journal of clinical epidemiology.

[9]  J. P. Flynn,et al.  Twenty-year perspective of randomized controlled trials for surgery of chronic nonspecific low back pain: citation bias and tangential knowledge. , 2013, The spine journal : official journal of the North American Spine Society.

[10]  S. Vernon,et al.  Decision latitude, psychologic demand, job strain, and coronary heart disease in the Western Electric Study. , 1994, American journal of epidemiology.

[11]  M. Kivimäki,et al.  Cumulative meta-analysis of job strain and CHD. , 2014, Epidemiology.

[12]  A. LaCroix,et al.  Occupational strain and the incidence of coronary heart disease. , 1989, American journal of epidemiology.

[13]  E. Eaker,et al.  Myocardial infarction and coronary death among women: psychosocial predictors from a 20-year follow-up of women in the Framingham Study. , 1992, American journal of epidemiology.

[14]  Andrew Steptoe,et al.  Job strain as a risk factor for coronary heart disease: a collaborative meta-analysis of individual participant data , 2012, The Lancet.

[15]  Leif D. Nelson,et al.  P-Curve: A Key to the File Drawer , 2013, Journal of experimental psychology. General.