Ancestral Components of Admixed Genomes in a Mexican Cohort

For most of the world, human genome structure at a population level is shaped by interplay between ancient geographic isolation and more recent demographic shifts, factors that are captured by the concepts of biogeographic ancestry and admixture, respectively. The ancestry of non-admixed individuals can often be traced to a specific population in a precise region, but current approaches for studying admixed individuals generally yield coarse information in which genome ancestry proportions are identified according to continent of origin. Here we introduce a new analytic strategy for this problem that allows fine-grained characterization of admixed individuals with respect to both geographic and genomic coordinates. Ancestry segments from different continents, identified with a probabilistic model, are used to construct and study “virtual genomes” of admixed individuals. We apply this approach to a cohort of 492 parent–offspring trios from Mexico City. The relative contributions from the three continental-level ancestral populations—Africa, Europe, and America—vary substantially between individuals, and the distribution of haplotype block length suggests an admixing time of 10–15 generations. The European and Indigenous American virtual genomes of each Mexican individual can be traced to precise regions within each continent, and they reveal a gradient of Amerindian ancestry between indigenous people of southwestern Mexico and Mayans of the Yucatan Peninsula. This contrasts sharply with the African roots of African Americans, which have been characterized by a uniform mixing of multiple West African populations. We also use the virtual European and Indigenous American genomes to search for the signatures of selection in the ancestral populations, and we identify previously known targets of selection in other populations, as well as new candidate loci. The ability to infer precise ancestral components of admixed genomes will facilitate studies of disease-related phenotypes and will allow new insight into the adaptive and demographic history of indigenous people.

[1]  Scott M. Williams,et al.  The Genetic Structure and History of Africans and African Americans , 2009, Science.

[2]  M. Feldman,et al.  Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation , 2008 .

[3]  D. Absher,et al.  Characterizing the admixed African ancestry of African Americans , 2009, Genome Biology.

[4]  Pardis C Sabeti,et al.  Detecting recent positive selection in the human genome from haplotype structure , 2002, Nature.

[5]  H. Takayanagi Osteoimmunology: shared mechanisms and crosstalk between the immune and bone systems , 2007, Nature Reviews Immunology.

[6]  Anders Albrechtsen,et al.  Natural Selection and the Distribution of Identity-by-Descent in the Human Genome , 2010, Genetics.

[7]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[8]  Joseph K. Pickrell,et al.  Signals of recent positive selection in a worldwide sample of human populations. , 2009, Genome research.

[9]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[10]  N. Risch,et al.  Reconstructing genetic ancestry blocks in admixed individuals. , 2006, American journal of human genetics.

[11]  Stephen L. Hauser,et al.  Genome-wide patterns of population structure and admixture in West Africans and African Americans , 2009, Proceedings of the National Academy of Sciences.

[12]  Carlos D Bustamante,et al.  Localizing Recent Adaptive Evolution in the Human Genome , 2007, PLoS genetics.

[13]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[14]  Warren W. Kretzschmar,et al.  Balancing Selection Maintains a Form of ERAP2 that Undergoes Nonsense-Mediated Decay and Affects Antigen Presentation , 2010, PLoS genetics.

[15]  R. Caetano,et al.  Hispanic Americans Baseline Alcohol Survey (HABLAS): alcohol-related problems across Hispanic national groups. , 2009, Journal of studies on alcohol and drugs.

[16]  Amit R. Indap,et al.  Genes mirror geography within Europe , 2008, Nature.

[17]  P. Gregersen,et al.  Accounting for ancestry: population substructure and genome-wide association studies. , 2008, Human molecular genetics.

[18]  Rui Mei,et al.  Identifying Signatures of Natural Selection in Tibetan and Andean Populations Using Dense Genome Scan Data , 2010, PLoS genetics.

[19]  Or Zuk,et al.  A Composite of Multiple Signals Distinguishes Causal Variants in Regions of Positive Selection , 2010, Science.

[20]  H. Ostrer,et al.  Genome-wide patterns of population structure and admixture among Hispanic/Latino populations , 2010, Proceedings of the National Academy of Sciences.

[21]  D. Reich,et al.  Sensitive Detection of Chromosomal Segments of Distinct Ancestry in Admixed Populations , 2009, PLoS genetics.

[22]  N. Risch,et al.  Dissecting complex diseases in complex populations: asthma in latino americans. , 2007, Proceedings of the American Thoracic Society.

[23]  P. Sullivan,et al.  Genome-Wide Association Study Implicates Chromosome 9q21.31 as a Susceptibility Locus for Asthma in Mexican Children , 2009, PLoS genetics.

[24]  D. Gudbjartsson,et al.  A high-resolution recombination map of the human genome , 2002, Nature Genetics.

[25]  Á. Carracedo,et al.  Charting the ancestry of African Americans. , 2005, American journal of human genetics.

[26]  B. Browning,et al.  A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. , 2009, American journal of human genetics.

[27]  Amit R. Indap,et al.  A role for clonal inactivation in T cell tolerance to Mls-1a , 2008, Nature.

[28]  Philippa Marrack,et al.  A role for clonal inactivation in T cell tolerance to Mls-1a , 2008, Nature.

[29]  Eduardo Barrientos,et al.  Analysis of genomic diversity in Mexican Mestizo populations to develop genomic medicine in Mexico , 2009, Proceedings of the National Academy of Sciences.

[30]  N. Risch,et al.  Estimation of individual admixture: Analytical and study design considerations , 2005, Genetic epidemiology.

[31]  Lin Geng,et al.  A genetic interaction network of five genes for human polycystic kidney and liver diseases defines polycystin-1 as the central determinant of cyst formation , 2011, Nature Genetics.

[32]  M. Shriver,et al.  Admixture in the Hispanics of the San Luis Valley, Colorado, and its implications for complex trait gene mapping , 2004, Annals of human genetics.

[33]  P. Visscher,et al.  Whole-genome genetic diversity in a sample of Australians with deep Aboriginal ancestry. , 2010, American journal of human genetics.

[34]  Mattias Jakobsson,et al.  Genetic Variation and Population Structure in Native Americans , 2007, PLoS genetics.

[35]  R. Nielsen,et al.  Inference of Historical Changes in Migration Rate From the Lengths of Migrant Tracts , 2009, Genetics.

[36]  Xiaofeng Zhu,et al.  Interrogating local population structure for fine mapping in genome-wide association studies , 2010, Bioinform..

[37]  M. Feldman,et al.  Genetic Structure of Human Populations , 2002, Science.

[38]  Rui Mei,et al.  Recent genetic selection in the ancestral admixture of Puerto Ricans. , 2007, American journal of human genetics.

[39]  L. Borrell,et al.  Self-Reported Diabetes in Hispanic Subgroup, Non-Hispanic Black, and Non-Hispanic White Populations: National Health Interview Survey, 1997–2005 , 2009, Public health reports.

[40]  Juha Karhunen,et al.  Principal Component Analysis for Large Scale Problems with Lots of Missing Values , 2007, ECML.

[41]  R. Tibshirani,et al.  PATHWISE COORDINATE OPTIMIZATION , 2007, 0708.1485.