How to estimate the measurement error variance associated with ancestry proportion estimates.

To show how the variance of the measurement error (ME) associated with individual ancestry proportion estimates can be estimated, especially when the number of ancestral populations (k) is greater than 2. We extend existing internal consistency measures to estimate the ME variance, and we compare these estimates with the ME variance estimated by use of the repeated measurement (RM) approach. Both approaches work by dividing the genotyped markers into subsets. We examine the effect of the number of subsets and of the allocation of markers to each subset on the performance of each approach. We used simulated data for all comparisons. Independently of the value of k, the measures of internal reliability provided less biased and more precise estimates of the ME variance than did those obtained with the RM approach. Both methods tend to perform better when a large number of subsets of markers with similar sizes are considered. Our results will facilitate the use of ME correction methods to address the ME problem in individual ancestry proportion estimates. Our method will improve the ability to control for type I error inflation and loss of power in association tests and other genomic research involving ancestry estimates.

[1]  R. Ward,et al.  Informativeness of genetic markers for inference of ancestry. , 2003, American journal of human genetics.

[2]  Michael Miller Coefficient alpha: A basic introduction from the perspectives of classical test theory and structural equation modeling , 1995 .

[3]  J. Long,et al.  Information on ancestry from genetic markers , 2004, Genetic epidemiology.

[4]  P. Donnelly,et al.  Case-control studies of association in structured or admixed populations. , 2001, Theoretical population biology.

[5]  D. Allison,et al.  Correcting for Measurement Error in Individual Ancestry Estimates in Structured Association Tests , 2007, Genetics.

[6]  M. R. Novick,et al.  Coefficient alpha and the reliability of composite measurements. , 1967, Psychometrika.

[7]  D. Armor Theta Reliability and Factor Scaling , 1973 .

[8]  N. Risch,et al.  Estimation of individual admixture: Analytical and study design considerations , 2005, Genetic epidemiology.

[9]  D. Allison,et al.  Estimating African American admixture proportions by use of population-specific alleles. , 1998, American journal of human genetics.

[10]  Edward G. Carmines,et al.  Assessing the Reliability of Linear Composites , 1980 .

[11]  M. J. Allen Introduction to Measurement Theory , 1979 .

[12]  D. Paul ASYMPTOTICS OF SAMPLE EIGENSTRUCTURE FOR A LARGE DIMENSIONAL SPIKED COVARIANCE MODEL , 2007 .

[13]  Michael I. Jordan,et al.  A randomization test for controlling population stratification in whole-genome association studies. , 2007, American journal of human genetics.

[14]  Scott M. Williams,et al.  A high-density admixture map for disease gene discovery in african americans. , 2004, American journal of human genetics.

[15]  Kimmo Vehkalahti,et al.  Measurement errors in multivariate measurement scales , 2005 .

[16]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[17]  Rui Mei,et al.  Recent genetic selection in the ancestral admixture of Puerto Ricans. , 2007, American journal of human genetics.

[18]  Arnold Zellner,et al.  Introduction to Measurement with Theory , 2009 .

[19]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[20]  P. Donnelly,et al.  Association mapping in structured populations. , 2000, American journal of human genetics.

[21]  J. Pritchard,et al.  Use of unlinked genetic markers to detect population stratification in association studies. , 1999, American journal of human genetics.

[22]  T. Mark Beasley,et al.  Regional Admixture Mapping and Structured Association Testing: Conceptual Unification and an Extensible General Linear Model , 2006, PLoS genetics.

[23]  M. W. Richardson,et al.  The theory of the estimation of test reliability , 1937 .

[24]  L. Cronbach Coefficient alpha and the internal structure of tests , 1951 .

[25]  D. Allison,et al.  Multiple Imputation to Correct for Measurement Error in Admixture Estimates in Genetic Structured Association Testing , 2009, Human Heredity.

[26]  M. Shriver,et al.  Ancestral proportions and their association with skin pigmentation and bone mineral density in Puerto Rican women from New York city , 2004, Human Genetics.

[27]  D. Gudbjartsson,et al.  A high-resolution recombination map of the human genome , 2002, Nature Genetics.

[28]  L. Crocker,et al.  Introduction to Classical and Modern Test Theory , 1986 .