WIDDE: a Web-Interfaced next generation database for genetic diversity exploration, with a first application in cattle

BackgroundThe advent and democratization of next generation sequencing and genotyping technologies lead to a huge amount of data for the characterization of population genetic diversity in model and non model-species. However, efficient storage, management, cross-analyzing and exploration of such dense genotyping datasets remain challenging. This is particularly true for the bovine species where many SNP datasets have been generated in various cattle populations with different genotyping tools.DescriptionWe developed WIDDE, a Web-Interfaced Next Generation Database that stands as a generic tool applicable to a wide range of species and marker types (http://widde.toulouse.inra.fr). As a first illustration, we hereby describe its first version dedicated to cattle biodiversity, which includes a large and evolving cattle genotyping dataset for over 750,000 SNPs available on 129 (89 public) different cattle populations representative of the world-wide bovine genetic diversity and on 7 outgroup bovid species. This version proposes an optional marker and individual filtering step, an export of genotyping data in different popular formats, and an exploration of genetic diversity through a principal component analysis. Users can also explore their own genotyping data together with data from WIDDE, assign their samples to WIDDE populations based on distance assignment method and supervised clustering, and estimate their ancestry composition relative to the populations represented in the database.ConclusionThe cattle version of WIDDE represents to our knowledge the first database dedicated to cattle biodiversity and SNP genotyping data that will be very useful for researchers interested in this field. As a generic tool applicable to a wide range of marker types, WIDDE is overall intended to the genetic diversity exploration of any species and will be extended to other species shortly. The structure makes it easy to include additional output formats and new tools dedicated to genetic diversity exploration.

[1]  I. Jackson,et al.  Signatures of Diversifying Selection in European Pig Breeds , 2013, PLoS genetics.

[2]  N. Risch,et al.  Estimation of individual admixture: Analytical and study design considerations , 2005, Genetic epidemiology.

[3]  J. Cornuet,et al.  GENECLASS2: a software for genetic assignment and first-generation migrant detection. , 2004, The Journal of heredity.

[4]  Bertrand Servin,et al.  Genome-Wide Analysis of the World's Sheep Breeds Reveals High Levels of Historic Mixture and Strong Recent Selection , 2012, PLoS biology.

[5]  G Luikart,et al.  New methods employing multilocus genotypes to select or exclude populations as origins of individuals. , 1999, Genetics.

[6]  Mathieu Gautier,et al.  A Quasi-Exclusive European Ancestry in the Senepol Tropical Cattle Breed Highlights the Importance of the slick Locus in Tropical Adaptation , 2012, PloS one.

[7]  Mathieu Gautier,et al.  The Genome Response to Artificial Selection: A Case Study in Dairy Cattle , 2009, PloS one.

[8]  M. Blaxter,et al.  Genome-wide genetic marker discovery and genotyping using next-generation sequencing , 2011, Nature Reviews Genetics.

[9]  G. Abecasis,et al.  A note on exact tests of Hardy-Weinberg equilibrium. , 2005, American journal of human genetics.

[10]  Jun Wang,et al.  Design and Characterization of a 52K SNP Chip for Goats , 2014, PloS one.

[11]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[12]  David H. Alexander,et al.  Fast model-based estimation of ancestry in unrelated individuals. , 2009, Genome research.

[13]  Mathieu Gautier,et al.  A whole genome Bayesian scan for adaptive genetic divergence in West African cattle , 2009, BMC Genomics.

[14]  M. P. Heaton,et al.  Worldwide Patterns of Ancestry, Divergence, and Admixture in Domesticated Cattle , 2013, PLoS genetics.

[15]  Mathieu Gautier,et al.  Insights into the Genetic History of French Cattle from Dense SNP Data on 47 Worldwide Breeds , 2010, PloS one.

[16]  Timothy P. L. Smith,et al.  Development and Characterization of a High Density SNP Genotyping Assay for Cattle , 2009, PloS one.

[17]  M. Gautier,et al.  Adaptive admixture in the West African bovine hybrid zone: insight from the Borgou population , 2014, Molecular ecology.

[18]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[19]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[20]  Robert D Schnabel,et al.  Resolving the evolution of extant and extinct ruminants with high-throughput phylogenomics , 2009, Proceedings of the National Academy of Sciences.

[21]  Mathieu Gautier,et al.  Footprints of selection in the ancestral admixture of a New World Creole cattle breed , 2011, Molecular ecology.

[22]  J. Cornuet,et al.  Computer Note GENECLASS 2 : A Software for Genetic Assignment and First-Generation Migrant Detection , 2004 .

[23]  K. Worley,et al.  The Genome Sequence of Taurine Cattle: A Window to Ruminant Biology and Evolution , 2009, Science.