Genome-wide signatures of male-mediated migration shaping the Indian gene pool

Multiple questions relating to contributions of cultural and demographical factors in the process of human geographical dispersal remain largely unanswered. India, a land of early human settlement and the resulting diversity is a good place to look for some of the answers. In this study, we explored the genetic structure of India using a diverse panel of 78 males genotyped using the GenoChip. Their genome-wide single-nucleotide polymorphism (SNP) diversity was examined in the context of various covariates that influence Indian gene pool. Admixture analysis of genome-wide SNP data showed high proportion of the Southwest Asian component in all of the Indian samples. Hierarchical clustering based on admixture proportions revealed seven distinct clusters correlating to geographical and linguistic affiliations. Convex hull overlay of Y-chromosomal haplogroups on the genome-wide SNP principal component analysis brought out distinct non-overlapping polygons of F*-M89, H*-M69, L1-M27, O2a-M95 and O3a3c1-M117, suggesting a male-mediated migration and expansion of the Indian gene pool. Lack of similar correlation with mitochondrial DNA clades indicated a shared genetic ancestry of females. We suggest that ancient male-mediated migratory events and settlement in various regional niches led to the present day scenario and peopling of India.

Li Jin | Laxmi Parida | Tatiana V Tatarinova | Jaume Bertranpetit | David Comas | Matthew C Dulik | Lluis Quintana-Murci | Nirav C Merchant | Nirav C. Merchant | Alan Cooper | Colin Renfrew | Chris Tyler-Smith | Petr Triska | R. J. Mitchell | R John Mitchell | V. S. Arun | Himla Soodyall | C. Tyler-Smith | Asif Javed | E. Matisoo-Smith | J. Ziegle | R. Pitchappan | E. Balanovska | O. Balanovsky | D. Comas | Marta Melé | J. Bertranpetit | C. Renfrew | L. Parida | A. Royyuru | T. Tatarinova | D. Platt | Marc Haber | A. Cooper | W. Haak | T. Schurr | H. Soodyall | L. Quintana-Murci | P. Zalloua | Li Jin | G. ArunKumar | Shilin Li | P. Tříska | Andrew C. Clarke | M. Vilar | F. R. Santos | Bennett Greenspan | R. Spencer Wells | R. Wells | Asif Javed | Wolfgang Haak | M. Dulik | Daniel E Platt | Marc Haber | R Spencer Wells | Pierre A Zalloua | Ramasamy Pitchappan | Ajay K Royyuru | Angela Hobbs | Fabrício R Santos | Oleg Balanovsky | Marta Melé | Andrew C Clarke | Janet S Ziegle | Shilin Li | Elena Balanovska | GaneshPrasad ArunKumar | Jeff Duty | Debra Rollo | Adhikarla Syama | Varatharajan Santhakumari Arun | Valampuri John Kavitha | Bennett Greenspan | Christina J Elena Oleg Jaume Andrew C David Alan Clio SI Mat Adlera Balanovska Balanovsky Bertranpet | Christina J Adlera | Clio SI Der Sarkissian | Jill B Gaieski | Angela Hobbs | Matthew E Kaplan | Begoña Martínez-Cruz | Elizabeth A Matisoo-Smith | Amanda C Owings | Daniela R Lacerda | Theodore G Schurr | David F Soria Hernanz | Pandikumar Swamikrishnan | Pedro Paulo Vieira | Miguel G Vilar | R Spencer Wells | David F. Soria Hernanz | A. Owings | A. Syama | M. Kaplan | D. R. Lacerda | Clio Der Sarkissian | Pandikumar Swamikrishnan | V. Kavitha | J. Duty | B. Martínez-Cruz | P. Vieira | Debra Rollo | Christina J Elena Oleg Jaume Andrew C David Alan Clio SI Mat Adlera Balanovska Balanovsky Bertranpet

[1]  M. Stoneking,et al.  Mitochondrial DNA analysis reveals diverse histories of tribal populations from India , 2003, European Journal of Human Genetics.

[2]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.

[3]  Kenneth Lange,et al.  Enhancements to the ADMIXTURE algorithm for individual ancestry estimation , 2011, BMC Bioinformatics.

[4]  Maido Remm,et al.  Population genetic structure in Indian Austroasiatic speakers: the role of landscape barriers and sex-specific admixture. , 2011, Molecular biology and evolution.

[5]  Joseph K. Pickrell,et al.  Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data , 2012, PLoS genetics.

[6]  Bonnie Berger,et al.  Genetic evidence for recent population mixture in India. , 2013, American journal of human genetics.

[7]  R. Trivedi,et al.  Genetic Imprints of Pleistocene Origin of Indian Populations: A Comprehensive Phylogeographic Sketch of Indian Y-Chromosomes , 2008 .

[8]  T. Zerjal,et al.  The Eurasian Heartland: A continental perspective on Y-chromosome diversity , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Eran Halperin,et al.  A model-based approach for analysis of spatial structure in genetic data , 2012, Nature Genetics.

[10]  R. J. Herrera,et al.  Separating the post-Glacial coancestry of European and Asian Y chromosomes within haplogroup R1a , 2010, European Journal of Human Genetics.

[11]  Shuhua Xu,et al.  Human Migration through Bottlenecks from Southeast Asia into East Asia during Last Glacial Maximum Revealed by Y Chromosomes , 2011, PloS one.

[12]  P. Majumder The Human Genetic History of South Asia , 2010, Current Biology.

[13]  G. Chaubey,et al.  BMC Evolutionary Biology BioMed Central Research article Phylogeography of mtDNA haplogroup R7 in the Indian peninsula , 2022 .

[14]  L. Singh,et al.  Y-chromosome evidence suggests a common paternal heritage of Austro-Asiatic populations , 2007, BMC Evolutionary Biology.

[15]  Klaus Mehnert,et al.  The Southern Route , 1942 .

[16]  Peter A Underhill,et al.  New binary polymorphisms reshape and increase resolution of the human Y chromosomal haplogroup tree. , 2008, Genome research.

[17]  Mehedi Hassan,et al.  Differential Evolution Approach to Detect Recent Admixture , 2015 .

[18]  R. J. Herrera,et al.  The Himalayas as a directional barrier to gene flow. , 2007, American journal of human genetics.

[19]  Q. Kong,et al.  The dazzling array of basal branches in the mtDNA macrohaplogroup M from India as inferred from complete genomes. , 2006, Molecular biology and evolution.

[20]  Hans-Jürgen Bandelt,et al.  Phylogeny of mitochondrial DNA macrohaplogroup N in India, based on complete sequencing: implications for the peopling of South Asia. , 2004, American journal of human genetics.

[21]  David W. McAlpin Proto Elamo Dravidian: The Evidence and Its Implications , 1981 .

[22]  P. Majumder Ethnic populations of India as seen from an evolutionary perspective , 2001, Journal of Biosciences.

[23]  R. Villems,et al.  Ancient DNA from European Early Neolithic Farmers Reveals Their Near Eastern Affinities , 2010, PLoS biology.

[24]  P. Luisi,et al.  Population and genomic lessons from genetic analysis of two Indian populations , 2014, Human Genetics.

[25]  V. Usik,et al.  The Southern Route “Out of Africa”: Evidence for an Early Expansion of Modern Humans into Arabia , 2011, Science.

[26]  V. S. Arun,et al.  Population Differentiation of Southern Indian Male Lineages Correlates with Agricultural Expansions Predating the Caste System , 2012, PloS one.

[27]  L. Singh,et al.  Austro-Asiatic Tribes of Northeast India Provide Hitherto Missing Genetic Link between South and Southeast Asia , 2007, PloS one.

[28]  Alkes L. Price,et al.  Reconstructing Indian Population History , 2009, Nature.

[29]  Christopher A. Edmonds,et al.  Polarity and temporality of high-resolution y-chromosome distributions in India identify both indigenous and exogenous expansions and reveal minor genetic influence of Central Asian pastoralists. , 2006, American journal of human genetics.

[30]  Tatiana V Tatarinova,et al.  Differential Evolution approach to detect recent admixture , 2015, bioRxiv.

[31]  Luca Pagani,et al.  The GenoChip: A New Tool for Genetic Anthropology , 2012, Genome biology and evolution.

[32]  L. Cavalli-Sforza Human evolution and its relevance for genetic epidemiology. , 2007, Annual review of genomics and human genetics.

[33]  Ajay K. Royyuru,et al.  Geographic population structure analysis of worldwide human populations infers their biogeographical origins , 2014, Nature Communications.

[34]  P. Majumder,et al.  Ethnic India: a genomic view, with special reference to peopling and structure. , 2003, Genome research.

[35]  Thibaut Jombart,et al.  adegenet: a R package for the multivariate analysis of genetic markers , 2008, Bioinform..

[36]  S. Mastana,et al.  Genetic variability of transferrin subtypes in the populations of India. , 1998, Human biology.

[37]  M. Metspalu,et al.  A prehistory of Indian Y chromosomes: evaluating demic diffusion scenarios. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[38]  D. F. Roberts,et al.  The History and Geography of Human Genes , 1996 .

[39]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[40]  Shirley A. Miller,et al.  A simple salting out procedure for extracting DNA from human nucleated cells. , 1988, Nucleic acids research.

[41]  C. Tyler-Smith,et al.  A Worldwide Survey of Human Male Demographic History Based on Y-SNP and Y-STR Data from the HGDP–CEPH Populations , 2009, Molecular biology and evolution.

[42]  K. A. Nilakanta Sastri,et al.  A History of South India: From Prehistoric Times to the Fall of Vijayanagar , 1955 .

[43]  John Novembre,et al.  Perspectives on human population structure at the cusp of the sequencing era. , 2011, Annual review of genomics and human genetics.

[44]  P. Underhill,et al.  The genetic heritage of the earliest settlers persists both in Indian tribal and caste populations. , 2003, American journal of human genetics.