Invited commentary: Off-roading with social epidemiology--exploration, causation, translation.

Population health improvements are the most relevant yardstick against which to evaluate the success of social epidemiology. In coming years, social epidemiology must increasingly emphasize research that facilitates translation into health improvements, with continued focus on macro-level social determinants of health. Given the evidence that the effects of social interventions often differ across population subgroups, systematic and transparent exploration of the heterogeneity of health determinants across populations will help inform effective interventions. This research should consider both biological and social risk factors and effect modifiers. We also recommend that social epidemiologists take advantage of recent revolutionary improvements in data availability and computing power to examine new hypotheses and expand our repertoire of study designs. Better data and computing power should facilitate underused analytic approaches, such as instrumental variables, simulation studies and models of complex systems, and sensitivity analyses of model biases. Many data-driven machine-learning approaches are also now computationally feasible and likely to improve both prediction models and causal inference in social epidemiology. Finally, we emphasize the importance of specifying exposures corresponding with realistic interventions and policy options. Effect estimates for directly modifiable, clearly defined health determinants are most relevant for building translational social epidemiology to reduce disparities and improve population health.

[1]  M Alan Brookhart,et al.  Evaluating Short-Term Drug Effects Using a Physician-Specific Prescribing Preference as an Instrumental Variable , 2006, Epidemiology.

[2]  Atul J Butte,et al.  Systematic evaluation of environmental factors: persistent pollutants and nutrients correlated with serum lipid levels , 2012, International journal of epidemiology.

[3]  Y. Abu-Mostafa Machines that Think for Themselves , 2012 .

[4]  F. Hu,et al.  Depression and risk of stroke morbidity and mortality: a meta-analysis and systematic review. , 2011, JAMA.

[5]  M. Glymour,et al.  Does childhood schooling affect old age memory or mental status? Using state schooling laws as natural experiments , 2008, Journal of Epidemiology & Community Health.

[6]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[7]  E. Keeler,et al.  Do longer postpartum stays reduce newborn readmissions? Analysis using instrumental variables. , 2000, Health services research.

[8]  Bruce G. Link,et al.  Six paths for the future of social epidemiology. , 2013, American journal of epidemiology.

[9]  Joseph P. Newhouse,et al.  Does More Intensive Treatment of Acute Myocardial Infarction in the Elderly Reduce Mortality? Analysis Using Instrumental Variables , 1995 .

[10]  Rafael Lozano,et al.  Modeling causes of death: an integrated approach using CODEm , 2012, Population Health Metrics.

[11]  M. Hernán Invited commentary: hypothetical interventions to define causal effects--afterthought or prerequisite? , 2005, American journal of epidemiology.

[12]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[13]  N. Adler,et al.  Commentary: it's not all means and genes--socio-economic position, variation and genetic confounding. , 2010, International journal of epidemiology.

[14]  N. Risch,et al.  Genomic Priorities and Public Health , 2003, Science.

[15]  M G Marmot,et al.  Social/economic status and disease. , 1987, Annual review of public health.

[16]  J. M. Oakes,et al.  The (mis)estimation of neighborhood effects: causal inference for a practicable social epidemiology. , 2004, Social science & medicine.

[17]  Aik Choon Tan,et al.  Ensemble machine learning on gene expression data for cancer classification. , 2003, Applied bioinformatics.

[18]  M. McQueen,et al.  Gene–environment interactions related to body mass: School policies and social context as environmental moderators , 2012, Journal of theoretical politics.

[19]  M. Lai,et al.  Risk groups defined by Recursive Partitioning Analysis of patients with colorectal adenocarcinoma treated with colorectal resection , 2012, BMC Medical Research Methodology.

[20]  A. Hartz,et al.  A comparison of observational studies and randomized, controlled trials , 2000, American journal of ophthalmology.

[21]  Mary Cushman,et al.  Estrogen plus progestin and the risk of coronary heart disease. , 2003, The New England journal of medicine.

[22]  L. Berkman,et al.  Social epidemiology: social determinants of health in the United States: are we losing ground? , 2009, Annual review of public health.

[23]  N. Adler,et al.  U.S. disparities in health: descriptions, causes, and mechanisms. , 2008, Annual review of public health.

[24]  Carole Ober,et al.  Gene-environment interactions in human disease: nuisance or opportunity? , 2011, Trends in genetics : TIG.

[25]  J. Boardman State-level moderation of genetic tendencies to smoke. , 2009, American journal of public health.

[26]  Robert A. Legenstein,et al.  Combining predictions for accurate recommender systems , 2010, KDD.

[27]  T. Osypuk Invited commentary: integrating a life-course perspective and social theory to advance research on residential segregation and health. , 2013, American journal of epidemiology.

[28]  David R. Williams Socioeconomic Differentials in Health: A Review and Redirection , 1990 .

[29]  T. Bruckner,et al.  Positive income shocks and accidental deaths among Cherokee Indians: a natural experiment. , 2011, International journal of epidemiology.

[30]  J Carpenter,et al.  Bootstrap confidence intervals: when, which, what? A practical guide for medical statisticians. , 2000, Statistics in medicine.

[31]  Sebastian Schneeweiss,et al.  Instrumental variable methods in comparative safety and effectiveness research , 2010, Pharmacoepidemiology and drug safety.

[32]  I. Kawachi,et al.  Education and Inequalities in Risk Scores for Coronary Heart Disease and Body Mass Index: Evidence for a Population Strategy , 2012, Epidemiology.

[33]  J. Concato,et al.  Randomized, controlled trials, observational studies, and the hierarchy of research designs. , 2000, The New England journal of medicine.

[34]  S. Syme,et al.  Incorporating socioeconomic factors into U.S. health policy: addressing the barriers. , 2002, Health affairs.

[35]  Robert Plomin,et al.  Evidence for a strong genetic influence on childhood adiposity despite the force of the obesogenic environment. , 2008, The American journal of clinical nutrition.

[36]  Sander Greenland,et al.  Modern Epidemiology 3rd edition , 1986 .

[37]  A. Roux,et al.  Commentary: causes of incidence and causes of cases--a Durkheimian perspective on Rose. , 2001, International journal of epidemiology.

[38]  Peter Bühlmann,et al.  Predicting causal effects in large-scale systems from observational data , 2010, Nature Methods.

[39]  Maximilian D. Schmeiser,et al.  Expanding Wallets and Waistlines: The Impact of Family Income on the BMI of Women and Men Eligible for the Earned Income Tax Credit , 2008, Health economics.

[40]  J. Ioannidis,et al.  Systematic Review of the Empirical Evidence of Study Publication Bias and Outcome Reporting Bias , 2008, PloS one.

[41]  G. Tutz,et al.  An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests. , 2009, Psychological methods.

[42]  M. Glymour,et al.  Frailty modifies effectiveness of psychosocial intervention in recovery from stroke , 2007, Clinical rehabilitation.

[43]  Cha Zhang,et al.  Ensemble Machine Learning , 2012 .

[44]  Timothy L Lash,et al.  Semi-Automated Sensitivity Analysis to Assess Systematic Errors in Observational Data , 2003, Epidemiology.

[45]  Timothy L Lash,et al.  A method to automate probabilistic sensitivity analyses of misclassified binary variables. , 2005, International journal of epidemiology.

[46]  M. Glymour,et al.  Differential mental health effects of neighborhood relocation among youth in vulnerable families: results from a randomized trial. , 2012, Archives of general psychiatry.

[47]  C Jencks,et al.  Heredity, environment, and public policy reconsidered. , 1980, American sociological review.

[48]  M. Glymour,et al.  Gender and Crime Victimization Modify Neighborhood Effects on Adolescent Mental Health , 2012, Pediatrics.

[49]  Douglas L. Miller,et al.  The Effects of Housing and Neighborhood Conditions on Child Mortality , 2011, Journal of health economics.

[50]  W. Evans,et al.  The Effect of Income on Mortality: Evidence from the Social Security Notch , 2006, The Review of Economics and Statistics.

[51]  S. Ebrahim,et al.  'Mendelian randomization': can genetic epidemiology contribute to understanding environmental determinants of disease? , 2003, International journal of epidemiology.

[52]  M. Glymour,et al.  Review of publication bias in studies on publication bias: Here's a proposal for editors that may help reduce publication bias , 2005, BMJ : British Medical Journal.

[53]  T. Osypuk,et al.  Do Social Policies Influence the Health of Women and their Children , 2013 .

[54]  Sandro Galea,et al.  Estimated deaths attributable to social factors in the United States. , 2011, American journal of public health.

[55]  E. Lander,et al.  The mystery of missing heritability: Genetic interactions create phantom heritability , 2012, Proceedings of the National Academy of Sciences.

[56]  D. Rehkopf,et al.  Effects of Prenatal Poverty on Infant Health , 2010, American sociological review.

[57]  James M. Robins,et al.  Observational Studies Analyzed Like Randomized Experiments: An Application to Postmenopausal Hormone Therapy and Coronary Heart Disease , 2008, Epidemiology.

[58]  Adele Cutler,et al.  An application of Random Forests to a genome-wide association dataset: Methodological considerations & new findings , 2010, BMC Genetics.

[59]  I. Deary,et al.  Education reduces the effects of genetic susceptibilities to poor physical health. , 2010, International journal of epidemiology.

[60]  S. Rose Mortality risk score prediction in an elderly population using machine learning. , 2013, American journal of epidemiology.

[61]  G. Smith Epidemiology, epigenetics and the 'Gloomy Prospect': embracing randomness in population health research and practice. , 2011, International journal of epidemiology.

[62]  Cha Zhang,et al.  Ensemble Machine Learning: Methods and Applications , 2012 .

[63]  M. Hernán,et al.  Does obesity shorten life? The importance of well-defined interventions to answer causal questions , 2008, International Journal of Obesity.

[64]  Atul J. Butte,et al.  An Environment-Wide Association Study (EWAS) on Type 2 Diabetes Mellitus , 2010, PloS one.