Case-control studies for rare diseases: improved estimation of several risks and of feature dependences

To capture the dependences of a disease on several risk factors, a challenge is to combine model-based estimation with evidence-based arguments. Standard case-control methods allow estimation of the dependences of a rare disease on several regressors via logistic regressions. For case-control studies, the sampling design leads to samples from two different populations and for the set of regressors in every logistic regression, these samples are then mixed and taken as given observations. But, it is the differences in independence structures of regressors for cases and for controls that can improve logistic regression estimates and guide us to the important feature dependences that are specific to the diseased. A case-control study on laryngeal cancer is used as illustration.

[1]  A. Zuckerman,et al.  IARC Monographs on the Evaluation of Carcinogenic Risks to Humans , 1995, IARC monographs on the evaluation of carcinogenic risks to humans.

[2]  Steffen L. Lauritzen,et al.  Graphical models in R , 1996 .

[3]  N. Wermuth,et al.  Linear Dependencies Represented by Chain Graphs , 1993 .

[4]  A. Olshan,et al.  A sex-specific association between a 15q25 variant and upper aerodigestive tract cancers. , 2011, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[5]  R. Pyke,et al.  Logistic disease incidence models and case-control studies , 1979 .

[6]  E. C. Hammond,et al.  Smoking and lung cancer: recent evidence and a discussion of some questions. , 1959, Journal of the National Cancer Institute.

[7]  N. Wermuth,et al.  Distortion of effects caused by indirect confounding , 2008 .

[8]  J. Ferlay,et al.  Cancer Incidence in Five Continents , 1970, Union Internationale Contre Le Cancer / International Union against Cancer.

[9]  H. Becher,et al.  Tobacco, alcohol, and diet in the etiology of laryngeal cancer: a population-based case-control study , 2004, Cancer Causes & Control.

[10]  S M McKinlay,et al.  Pair-matching-a reappraisal of a popular technique. , 1977, Biometrics.

[11]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[12]  P. McCullagh,et al.  Generalized Linear Models, 2nd Edn. , 1990 .

[13]  S. Haberman,et al.  The analysis of frequency data , 1974 .

[14]  N. Wermuth,et al.  Causality: a Statistical View , 2004 .

[15]  H. Becher,et al.  Intensity and Inhalation of Smoking in the Aetiology of Laryngeal Cancer , 2011, International journal of environmental research and public health.

[16]  N. Wermuth,et al.  Sequences of regressions and their independences , 2011, 1110.1986.

[17]  G. M. Marchetti,et al.  Chain graph models of multivariate regression type for categorical data , 2009, 0906.2098.

[18]  Zhi Geng,et al.  Collapsibility of Relative Risk in Contingency Tables with a Response Variable , 1992 .

[19]  Niels Keiding,et al.  Graphical models for inference under outcome-dependent sampling , 2010, 1101.0901.

[20]  A. W. Edwards The Measure of Association in a 2 × 2 Table , 1963 .

[21]  D. R. Cox,et al.  Interpretation of interaction: A review , 2007, 0712.1106.

[22]  R. Horwitz The planning of observational studies of human populations , 1979 .

[23]  M. Thun,et al.  Tobacco smoke and involuntary smoking. , 2004, IARC monographs on the evaluation of carcinogenic risks to humans.

[24]  N. Wermuth PROBABILITY DISTRIBUTIONS WITH SUMMARY GRAPH STRUCTURE , 2010, 1003.3259.

[25]  Stephen E. Fienberg,et al.  The analysis of cross-classified categorical data , 1980 .

[26]  John A. Nelder,et al.  Generalized linear models. 2nd ed. , 1993 .

[27]  Elias Bareinboim,et al.  Controlling Selection Bias in Causal Inference , 2011, AISTATS.

[28]  B. Everitt Log-linear models for contingency tables , 1992 .

[29]  Average Collapsibility of Some Association Measures , 2011, 1110.1740.

[30]  A. Whittemore Collapsibility of Multidimensional Contingency Tables , 1978 .

[31]  C. la Vecchia,et al.  Combined effect of tobacco and alcohol on laryngeal cancer risk: a case–control study , 2002, Cancer Causes & Control.