Mixture Model Based Group Inference in Fused Genotype and Phenotype Data

The analysis of genetic diseases has classically been directed towards establishing direct links between cause, a genetic variation, and effect, the observable deviation of phenotype. For complex diseases which are caused by multiple factors and which show a wide spread of variations in the phenotypes this is unlikely to succeed. One example is the Attention Deficit Hyperactivity Disorder (ADHD), where it is expected that phenotypic variations will be caused by the overlapping effects of several distinct genetic mechanisms. The classical statistical models to cope with overlapping subgroups are mixture models, essentially convex combinations of density functions, which allow inference of descriptive models from data as well as the deduction of groups. An extension of conventional mixtures with attractive properties for clustering is the context-specific independence (CSI) framework. CSI allows for an automatic adaption of model complexity to avoid overfitting and yields a highly descriptive model.

[1]  Ahmed A. Shabana,et al.  Use of Cholesky Coordinates and the Absolute Nodal Coordinate Formulation in the Computer Simulation of Flexible Multibody Systems , 1999 .

[2]  Nir Friedman,et al.  The Bayesian Structural EM Algorithm , 1998, UAI.

[3]  M. Gill,et al.  Confirmation of association between attention deficit hyperactivity disorder and a dopamine transporter polymorphism , 1997, Molecular Psychiatry.

[4]  N J Cox,et al.  Association of attention-deficit disorder and the dopamine transporter gene. , 1995, American journal of human genetics.

[5]  F. Luft Can complex genetic diseases be solved? (and a PS on PXE) , 2000, Journal of Molecular Medicine.

[6]  Nir Friedman,et al.  Context-Specific Bayesian Clustering for Gene Expression Data , 2002, J. Comput. Biol..

[7]  Michael Murias,et al.  Dopamine genes and ADHD , 2000, Neuroscience & Biobehavioral Reviews.

[8]  Gregory G. Miller,et al.  Trends in environmentally related childhood illnesses. , 2004, Pediatrics.

[9]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[10]  M I Posner,et al.  Attention deficit/hyperactivity disorder children with a 7-repeat allele of the dopamine receptor D4 gene have extreme behavior but normal performance on critical neuropsychological tests of attention. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Gérard Govaert,et al.  An improvement of the NEC criterion for assessing the number of clusters in a mixture model , 1999, Pattern Recognit. Lett..

[12]  Benjamin Georgi,et al.  Context-specific independence mixture modeling for positional weight matrices , 2006, ISMB.

[13]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .