Assessment of skewed exposure in case‐control studies with pooling

Pooling‐based strategies that combine samples from multiple participants for laboratory assays have been proposed for epidemiologic investigations of biomarkers to address issues including cost, efficiency, detection, and when minimal sample volume is available. A modification of the standard logistic regression model has been previously described to allow use with pooled data; however, this model makes assumptions regarding exposure distribution and logit‐linearity of risk (i.e., constant odds ratio) that can be violated in practice. We were motivated by a nested case‐control study of miscarriage and inflammatory factors with highly skewed distributions to develop a more flexible model for analysis of pooled data. Using characteristics of the gamma distribution and the relation between models of binary outcome conditional on exposure and of exposure conditional on outcome, we use a modified logistic regression to accommodate nonlinearity because of unequal shape parameters in gamma distributed exposure for cases and controls. Using simulations, we compare our approach with existing methods for logistic regression for pooled data considering: (1) constant and dose‐dependent effects; (2) gamma and log‐normal distributed exposure; (3) effect size; and (4) the proportions of biospecimens pooled. We show that our approach allows estimation of odds ratios that vary with exposure level, yet has minimal loss of efficiency compared with existing approaches when exposure effects are dose‐invariant. Our model performed similarly to a maximum likelihood estimation approach in terms of bias and efficiency, and provides an easily implemented approach for estimation with pooled biomarker data when effects may not be constant across exposure. Copyright © 2012 John Wiley & Sons, Ltd.

[1]  A. Sterrett On the Detection of Defective Members of Large Populations , 1957 .

[2]  J. Cornfield Joint dependence of risk of coronary heart disease on serum cholesterol and systolic blood pressure: a discriminant function analysis. , 1962, Federation proceedings.

[3]  D D Baird,et al.  Incidence of early loss of pregnancy. , 1988, The New England journal of medicine.

[4]  T. Young,et al.  The association of reproductive and menstrual characteristics and colon and rectal cancer risk in Wisconsin women. , 1995, Annals of epidemiology.

[5]  C R Weinberg,et al.  Using Pooled Exposure Assessment to Improve Efficiency in Case‐Control Studies , 1999, Biometrics.

[6]  M. Andersen,et al.  Biological regulation of receptor-hormone complex concentrations in relation to dose-response assessments for endocrine-active compounds. , 1999, Toxicological sciences : an official journal of the Society of Toxicology.

[7]  K. Kalache,et al.  Elevated monocyte chemotactic protein-1 in amniotic fluid is a risk factor for pregnancy loss , 2002, The journal of maternal-fetal & neonatal medicine : the official journal of the European Association of Perinatal Medicine, the Federation of Asia and Oceania Perinatal Societies, the International Society of Perinatal Obstetricians.

[8]  Melvin E Andersen,et al.  Molecular circuits, biological switches, and nonlinear dose-response relationships. , 2002, Environmental health perspectives.

[9]  Enrique F Schisterman,et al.  Roc Curve Analysis for Biomarkers Based on Pooled Assessments , 2022 .

[10]  T. Theoharides,et al.  High levels of intrauterine corticotropin-releasing hormone, urocortin, tryptase, and interleukin-8 in spontaneous abortions. , 2003, Endocrinology.

[11]  R. Canfield,et al.  Intellectual Impairment in Children with Blood Lead Concentrations below 10 μ g per Deciliter , 2003 .

[12]  Shu-zheng Liu Nonlinear Dose-Response Relationship in the Immune System following Exposure to Ionizing Radiation: Mechanisms and Implications , 2003, Nonlinearity in biology, toxicology, medicine.

[13]  J. Hardy The Collaborative Perinatal Project: lessons and legacy. , 2003, Annals of epidemiology.

[14]  R. Canfield,et al.  Intellectual Impairment in Children with Blood Lead Concentrations below 10 μg per Deciliter , 2003 .

[15]  Rebecca L. Jones,et al.  Identification of chemokines important for leukocyte recruitment to the human endometrium at the times of embryo implantation and menstruation. , 2004, The Journal of clinical endocrinology and metabolism.

[16]  Shu-Dong Zhang,et al.  Bioinformatics Original Paper Effect of Pooling Samples on the Efficiency of Comparative Studies Using Microarrays , 2022 .

[17]  Enrique F Schisterman,et al.  Efficient Design and Analysis of Biospecimens with Measurements Subject to Detection Limit , 2006, Biometrical journal. Biometrische Zeitschrift.

[18]  I. Bergdahl What is the meaning of non-linear dose-response relationships between blood lead concentrations and IQ? , 2006, Neurotoxicology.

[19]  P. Sampson,et al.  Response to: "What is the meaning of non-linear dose-response relationships between blood lead concentrations and IQ?". , 2006, Neurotoxicology.

[20]  Carol Bigelow,et al.  Modeling Nonlinear Dose-Response Relationships in Epidemiologic Studies: Statistical Approaches and Practical Challenges , 2005, Dose-response : a publication of International Hormesis Society.

[21]  Enrique F Schisterman,et al.  Pooling biospecimens and limits of detection: effects on ROC curve analysis. , 2006, Biostatistics.

[22]  Enrique F Schisterman,et al.  Circulating chemokine levels and miscarriage. , 2007, American journal of epidemiology.

[23]  B. Whitcomb,et al.  Circulating levels of cytokines during pregnancy: thrombopoietin is elevated in miscarriage. , 2008, Fertility and sterility.

[24]  Albert Vexler,et al.  To pool or not to pool, from whether to when: applications of pooling to biospecimens subject to a limit of detection. , 2008, Paediatric and perinatal epidemiology.

[25]  R. Lyles,et al.  A Fresh Look at the Discriminant Function Approach for Estimating Crude or Adjusted Odds Ratios , 2009, American Statistician.

[26]  B. Whitcomb,et al.  Maternal serum granulocyte colony-stimulating factor levels and spontaneous preterm birth. , 2009, Journal of women's health.

[27]  Enrique F Schisterman,et al.  Hybrid pooled–unpooled design for cost‐efficient measurement of biomarkers , 2010, Statistics in medicine.

[28]  Colin A. Johnson,et al.  Mutation analysis of 18 nephronophthisis associated ciliopathy disease genes using a DNA pooling and next generation sequencing strategy , 2010, Journal of Medical Genetics.

[29]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .