Phenotypic Complexity, Measurement Bias, and Poor Phenotypic Resolution Contribute to the Missing Heritability Problem in Genetic Association Studies

Background The variance explained by genetic variants as identified in (genome-wide) genetic association studies is typically small compared to family-based heritability estimates. Explanations of this ‘missing heritability’ have been mainly genetic, such as genetic heterogeneity and complex (epi-)genetic mechanisms. Methodology We used comprehensive simulation studies to show that three phenotypic measurement issues also provide viable explanations of the missing heritability: phenotypic complexity, measurement bias, and phenotypic resolution. We identify the circumstances in which the use of phenotypic sum-scores and the presence of measurement bias lower the power to detect genetic variants. In addition, we show how the differential resolution of psychometric instruments (i.e., whether the instrument includes items that resolve individual differences in the normal range or in the clinical range of a phenotype) affects the power to detect genetic variants. Conclusion We conclude that careful phenotypic data modelling can improve the genetic signal, and thus the statistical power to identify genetic variants by 20–99%.

[1]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[2]  D. Borsboom,et al.  Comorbidity: A network perspective , 2010, Behavioral and Brain Sciences.

[3]  S. Faraone,et al.  Molecular genetics of attention deficit hyperactivity disorder. , 2010, The Psychiatric clinics of North America.

[4]  John P A Ioannidis,et al.  Beyond genome-wide association studies: genetic heterogeneity and individual predisposition to cancer. , 2010, Trends in genetics : TIG.

[5]  Gonçalo R. Abecasis,et al.  Functional Gene Group Analysis Reveals a Role of Synaptic Heterotrimeric G Proteins in Cognitive Ability , 2010, American journal of human genetics.

[6]  M. Neale,et al.  An integrated phenomic approach to multivariate allelic association , 2010, European Journal of Human Genetics.

[7]  K. Kendler,et al.  Deconstructing major depression: a validation study of the DSM-IV symptomatic criteria , 2010, Psychological Medicine.

[8]  Jun Li,et al.  Steroid 5-{alpha}-reductase Type 2 (SRD5a2) gene polymorphisms and risk of prostate cancer: a HuGE review. , 2010, American journal of epidemiology.

[9]  L. Xian,et al.  Polymorphisms in the promoter regions of matrix metalloproteinases 1 and 3 and cancer risk: a meta-analysis of 50 case-control studies. , 2010, Mutagenesis.

[10]  D. Clayton,et al.  Genome-wide association study and meta-analysis finds over 40 loci affect risk of type 1 diabetes , 2009, Nature Genetics.

[11]  Zhen-hua Hu,et al.  Differential effects of NOD2 polymorphisms on colorectal cancer risk: a meta-analysis , 2010, International Journal of Colorectal Disease.

[12]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[13]  P. Visscher,et al.  Common polygenic variation contributes to risk of schizophrenia and bipolar disorder , 2009, Nature.

[14]  M. Rietschel,et al.  Dissecting the phenotype in genome-wide association studies of psychiatric illness , 2009, British Journal of Psychiatry.

[15]  Paola Sebastiani,et al.  Genome‐wide association studies and the genetic dissection of complex traits , 2009, American journal of hematology.

[16]  W. Johnson,et al.  Group differences in the heritability of items and test scores , 2009, Proceedings of the Royal Society B: Biological Sciences.

[17]  Manuel A. R. Ferreira,et al.  Gene ontology analysis of GWA study data sets provides insights into the biology of bipolar disorder. , 2009, American journal of human genetics.

[18]  Benjamin M. Neale,et al.  Genome-wide association studies in ADHD , 2009, Human Genetics.

[19]  M. Zheng,et al.  ATG16L1 T300A polymorphism and Crohn’s disease susceptibility: evidence from 13,022 cases and 17,532 controls , 2009, Human Genetics.

[20]  N. Wray,et al.  Genomewide Association for Major Depressive Disorder: A possible role for the presynaptic protein Piccolo , 2008, Molecular Psychiatry.

[21]  Martin Lawn,et al.  A new lease of life for Thomson's bonds model of intelligence. , 2009, Psychological review.

[22]  B. Maher Personal genomes: The case of the missing heritability , 2008, Nature.

[23]  D. Borsboom Latent Variable Theory , 2008 .

[24]  M. McCarthy,et al.  Genome-wide association studies for complex traits: consensus, uncertainty and challenges , 2008, Nature Reviews Genetics.

[25]  Conor V. Dolan,et al.  Power Calculations Using Exact Data Simulation: A Useful Tool for Genetic Study Designs , 2007, Behavior genetics.

[26]  B. Maher,et al.  The case of the missing heritability , 2008 .

[27]  D. Posthuma,et al.  Across the continuum of attention skills: a twin study of the SWAN ADHD rating scale. , 2007, Journal of child psychology and psychiatry, and allied disciplines.

[28]  R. Plomin,et al.  Genetic Support for the Dual Nature of Attention Deficit Hyperactivity Disorder: Substantial Genetic Overlap Between the Inattentive and Hyperactive–impulsive Components , 2007, Journal of abnormal child psychology.

[29]  John P.A. Ioannidis,et al.  Non-Replication and Inconsistency in the Genome-Wide Association Setting , 2007, Human Heredity.

[30]  Dorret I. Boomsma,et al.  Variance Decomposition Using an IRT Measurement Model , 2007, Behavior genetics.

[31]  D. Boomsma,et al.  Longitudinal genetic study of verbal and nonverbal IQ from early childhood to young adulthood , 2007 .

[32]  L. Cardon,et al.  Designing candidate gene and genome-wide case–control association studies , 2007, Nature Protocols.

[33]  H.L.J. van der Maas,et al.  A dynamical model of general intelligence: the positive manifold of intelligence by mutualism. , 2006, Psychological review.

[34]  P. Lichtenstein,et al.  Genetic contributions to the development of ADHD subtypes from childhood to adolescence. , 2006, Journal of the American Academy of Child and Adolescent Psychiatry.

[35]  M. Neale,et al.  Problems with using sum scores for estimating variance components: contamination and measurement noninvariance. , 2005, Twin research and human genetics : the official journal of the International Society for Twin Studies.

[36]  M. Neale,et al.  Implications of absence of measurement invariance for detecting sex limitation and genotype by environment interaction. , 2004, Twin research : the official journal of the International Society for Twin Studies.

[37]  P. Fayers Item Response Theory for Psychologists , 2004, Quality of Life Research.

[38]  D. Borsboom,et al.  The Theoretical Status of Latent Variables , 2003 .

[39]  P. Vernon,et al.  Application of Hierarchical Genetic Models to Raven and WAIS Subtests: A Dutch Twin Study , 2002, Behavior genetics.

[40]  Robert Plomin,et al.  Genetics and general cognitive ability (g) , 2002, Trends in Cognitive Sciences.

[41]  D. Posthuma,et al.  Perceptual Speed and IQ Are Associated Through Common Genetic Factors , 2001, Behavior genetics.

[42]  R. Plomin,et al.  Genetic and environmental covariation between verbal and nonverbal cognitive development in infancy. , 2000, Child development.

[43]  R. P. McDonald,et al.  Test Theory: A Unified Treatment , 1999 .

[44]  Gideon J. Mellenbergh,et al.  Measurement precision in test score and item response models , 1996 .

[45]  D. Fulker,et al.  Multivariate genetic analysis of Wechsler Intelligence Scale for Children—Revised (WISC-R) factors , 1995, Behavior genetics.

[46]  W. Meredith Measurement invariance, factor analysis and factorial invariance , 1993 .

[47]  T. Achenbach Manual for the child behavior checklist/4-18 and 1991 profile , 1991 .

[48]  R. Lennox,et al.  Conventional wisdom on measurement: A structural equation perspective. , 1991 .

[49]  D. Sörbom Model modification , 1989 .

[50]  Gideon J. Mellenbergh,et al.  Item bias and item response theory , 1989 .

[51]  P. Eykhoff,et al.  Model building and parameter estimation as means for intelligent measurement , 1988 .

[52]  Erling B. Andersen,et al.  Sufficient statistics and latent trait models , 1977 .