GeneOnEarth: Fitting Genetic PC Plots on the Globe

Principal component (PC) plots have become widely used to summarize genetic variation of individuals in a sample. The similarity between genetic distance in PC plots and geographical distance has shown to be quite impressive. However, in most situations, individual ancestral origins are not precisely known or they are heterogeneously distributed; hence, they are hardly linked to a geographical area. We have developed GeneOnEarth, a user-friendly web-based tool to help geneticists to understand whether a linear isolation-by-distance model may apply to a genetic data set; thus, genetic distances among a set of individuals resemble geographical distances among their origins. Its main goal is to allow users to first apply a by-view Procrustes method to visually learn whether this model holds. To do that, the user can choose the exact geographical area from an on line 2D or 3D world map by using, respectively, Google Maps or Google Earth, and rotate, flip, and resize the images. GeneOnEarth can also compute the optimal rotation angle using Procrustes analysis and assess statistical evidence of similarity when a different rotation angle has been chosen by the user. An online version of GeneOnEarth is available for testing and using purposes at >http://bios.ugr.es/GeneOnEarth.

[1]  David H. Alexander,et al.  Fast model-based estimation of ancestry in unrelated individuals. , 2009, Genome research.

[2]  Michael F. Seldin,et al.  Analysis of East Asia Genetic Substructure Using Genome-Wide SNP Arrays , 2008, PloS one.

[3]  Ole A. Andreassen,et al.  The Impact of Divergence Time on the Nature of Population Structure: An Example from Iceland , 2009, PLoS genetics.

[4]  Stephen L. Hauser,et al.  Genome-wide patterns of population structure and admixture in West Africans and African Americans , 2009, Proceedings of the National Academy of Sciences.

[5]  Anuj Srivastava,et al.  Statistical Shape Analysis , 2014, Computer Vision, A Reference Guide.

[6]  Eran Halperin,et al.  A model-based approach for analysis of spatial structure in genetic data , 2012, Nature Genetics.

[7]  Christian Gieger,et al.  Correlation between Genetic and Geographic Structure in Europe , 2008, Current Biology.

[8]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[9]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[10]  Jinchuan Xing,et al.  Fine-scaled human genetic structure revealed by SNP microarrays. , 2009, Genome research.

[11]  Noah A. Rosenberg,et al.  A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations , 2012, PLoS genetics.

[12]  M. Seielstad,et al.  Genetic structure of the Han Chinese population revealed by genome-wide SNP variation. , 2009, American journal of human genetics.

[13]  W. Marsden I and J , 2012 .

[14]  Nicholas A. Johnson,et al.  Ancestral Components of Admixed Genomes in a Mexican Cohort , 2011, PLoS genetics.

[15]  Zachary A. Szpiech,et al.  Statistical Applications in Genetics and Molecular Biology Comparing Spatial Maps of Human Population-Genetic Variation Using Procrustes Analysis , 2011 .

[17]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[18]  John Novembre,et al.  The Population Reference Sample, POPRES: a resource for population, disease, and pharmacological genetics research. , 2008, American journal of human genetics.

[19]  Snigdhansu Chatterjee,et al.  Procrustes Problems , 2005, Technometrics.

[20]  Alkes L. Price,et al.  New approaches to population stratification in genome-wide association studies , 2010, Nature Reviews Genetics.

[21]  Amit R. Indap,et al.  Genes mirror geography within Europe , 2008, Nature.

[22]  Jinchuan Xing,et al.  Toward a more uniform sampling of human genetic diversity: a survey of worldwide populations by high-density genotyping. , 2010, Genomics.