JWAS: Julia implementation of Whole-genome Analyses Software

Genome-wide high-density molecular markers (e.g., SNPs) are increasingly being incorporated into animal and plant breeding programs to speed up genetic improvement through genomic prediction. The same statistical models can also be used for genome-wide association studies. Bayesian multiple-regression methods are widely used in genomic prediction with complete genomic data (all phenotyped individuals in the analysis are genotyped). These methods have been extended to accommodate incomplete genomic data (some phenotyped animals not genotyped), simultaneously using all available pedigree, phenotypic and genomic information (“single-step" Bayesian methods). We have developed a well-documented software tool called JWAS (acronym for "Julia Whole-genome Analysis Software”) in a relatively new scientific programming language, Julia, which approaches the computing speed of compiled languages such as C, C++ or Fortran, but has the benefits of dynamic languages such as R or Python. JWAS is a well-documented software platform based on Julia and an interactive Jupyter notebook for analyses of general univariate and multivariate Bayesian mixed effects models. These models are especially useful for, but not limited to, routine single-trait and multi-trait genomic prediction and genome-wide association studies using either complete or incomplete genomic data ("single-step" methods). Currently, JWAS provides broad scope of analyses, e.g., a wide collection of Bayesian methods for whole-genome analyses, including shrinkage estimation and variable selection methods. We believe the friendly user interface and fast computing speed of JWAS will provide power and convenience for users in both industry and academia to analyse large datasets. Further, as a well-documented open-source software tool, JWAS will also be used by a group of active community members, who will contribute to the source code and help maintain the project. Junior scientists can understand and learn the methodologies for whole-genome analyses by using JWAS and reading the tutorials and source code. JWAS is available with source code and documentation at http://QTL.rocks.

[1]  Rohan L. Fernando,et al.  Extension of the bayesian alphabet for genomic selection , 2011, BMC Bioinformatics.

[2]  Jack Dekkers,et al.  Comparison of alternative approaches to single-trait genomic prediction using genotyped and non-genotyped Hanwoo beef cattle , 2017, Genetics Selection Evolution.

[3]  R. Fernando,et al.  Genomic prediction of simulated multibreed and purebred performance using observed fifty thousand single nucleotide polymorphism genotypes. , 2010, Journal of animal science.

[4]  Alan Edelman,et al.  Julia: A Fresh Approach to Numerical Computing , 2014, SIAM Rev..

[5]  Jack Dekkers,et al.  Application of Whole-Genome Prediction Methods for Genome-Wide Association Studies: A Bayesian Approach , 2014 .

[6]  José Crossa,et al.  Prediction of Genetic Values of Quantitative Traits in Plant Breeding Using Pedigree and Molecular Markers , 2010, Genetics.

[7]  Ignacy Misztal,et al.  Experiences with a single-step genome evaluation. , 2013, Poultry science.

[8]  M Erbe,et al.  Improving accuracy of genomic predictions within and between dairy cattle breeds with imputed high-density single nucleotide polymorphism panels. , 2012, Journal of dairy science.

[9]  Rohan L Fernando,et al.  A class of Bayesian methods to combine large numbers of genotyped and non-genotyped animals for whole-genome analyses , 2014, Genetics Selection Evolution.

[10]  Brian E. Granger,et al.  IPython: A System for Interactive Scientific Computing , 2007, Computing in Science & Engineering.

[11]  Rohan L Fernando,et al.  An efficient exact method to obtain GBLUP and single-step GBLUP when the genomic relationship matrix is singular , 2016, Genetics Selection Evolution.

[12]  A. Maw Poultry Science. , 1961, Science.

[13]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[14]  D. Gianola Priors in Whole-Genome Regression: The Bayesian Alphabet Returns , 2013, Genetics.

[15]  I Misztal,et al.  A relationship matrix including full pedigree and genomic information. , 2009, Journal of dairy science.

[16]  M. Goddard,et al.  Genetic Architecture of Complex Traits and Accuracy of Genomic Prediction: Coat Colour, Milk-Fat Percentage, and Type in Holstein Cattle as Contrasting Model Traits , 2010, PLoS genetics.

[17]  R. Fernando,et al.  Computational strategies for alternative single-step Bayesian regression models with large numbers of genotyped and non-genotyped animals , 2016, Genetics Selection Evolution.