Selectiongain: an R package for optimizing multi-stage selection

Multi-stage selection is practised in numerous fields of the life sciences and particularly in breeding. A special characteristic of multi-stage selection is that candidates are evaluated in successive stages with increasing intensity and efforts, and only a fraction of the superior candidates is selected and promoted to the next stage. For the optimum design of such selection programs, the selection gain $$\varDelta G(y)$$ΔG(y) plays a central role. It can be calculated by integration of a truncated multivariate normal distribution. While mathematical formulas for calculating $$\varDelta G(y)$$ΔG(y) and $$\psi (y)$$ψ(y), the variance among the selected candidates, were developed a long time ago, solutions and software for numerical calculations were not available. We developed the R package selectiongain for efficient and precise calculation of $$\varDelta G(y)$$ΔG(y) and $$\psi (y)$$ψ(y) for (i) a given matrix $$\varvec{\varSigma }^{*}$$Σ∗ of correlations among the unobservable target character and the selection criteria and (ii) given coordinates $$\mathbf Q $$Q of the truncation point or the selected fractions $$\varvec{\alpha }$$α in each stage. In addition, our software can be used for optimizing multi-stage selection programs under a given total budget and different costs of evaluating the candidates in each stage. Besides a detailed description of the functions of the software, the package is illustrated with two examples.

[1]  William H. Press,et al.  Book-Review - Numerical Recipes in Pascal - the Art of Scientific Computing , 1989 .

[2]  D. Anderson,et al.  Algorithms for minimization without derivatives , 1974 .

[3]  Anthony J. Hayter,et al.  The evaluation of general non‐centred orthant probabilities , 2003 .

[4]  B. S. Dhillon,et al.  Hybrid maize breeding with doubled haploids: V. Selection strategies for testcross performance with variable sizes of crosses and S1 families , 2010, Theoretical and Applied Genetics.

[5]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[6]  A. Genz,et al.  Numerical computation of multivariate t-probabilities with application to power calculation of multiple contrasts , 1999 .

[7]  W. Press,et al.  Numerical Recipes in Fortran: The Art of Scientific Computing.@@@Numerical Recipes in C: The Art of Scientific Computing. , 1994 .

[8]  J. Reif,et al.  Hybrid maize breeding with doubled haploids: III. Efficiency of early testing prior to doubled haploid production in two-stage selection for testcross performance , 2007, Theoretical and Applied Genetics.

[9]  T. Hothorn,et al.  Multivariate Normal and t Distributions , 2016 .

[10]  G. Terrell,et al.  Iterated grid search algorithm on unimodal criteria , 1997 .

[11]  William H. Press,et al.  Numerical Recipes: FORTRAN , 1988 .

[12]  D. Falconer,et al.  Introduction to Quantitative Genetics. , 1962 .

[13]  J. Villeneuve,et al.  Selection of a multiple drug-resistant hepatitis B virus strain in a liver-transplanted patient. , 2006, Gastroenterology.

[14]  R. Brent Table errata: Algorithms for minimization without derivatives (Prentice-Hall, Englewood Cliffs, N. J., 1973) , 1975 .

[15]  R. Frankham Introduction to quantitative genetics (4th edn): by Douglas S. Falconer and Trudy F.C. Mackay Longman, 1996. £24.99 pbk (xv and 464 pages) ISBN 0582 24302 5 , 1996 .

[16]  W. G. Cochran,et al.  Improvement by Means of Selection , 1951 .

[17]  W. Muir,et al.  Multistage selection for maximum economic return with an application to beef cattle breeding. , 1995, Journal of animal science.

[18]  Jianjun Shi,et al.  Quality control and improvement for multistage systems: A survey , 2009 .

[19]  Torsten Hothorn,et al.  mvtnorm: New numerical algorithm for multivariate normal probabilities , 2009 .

[20]  G. M. Tallis The Moment Generating Function of the Truncated Multi‐Normal Distribution , 1961 .

[21]  M. West-Eberhard Sexual Selection, Social Competition, and Speciation , 1983, The Quarterly Review of Biology.

[22]  D. Falconer Introduction to quantitative genetics. 1. ed. , 1984 .

[23]  William H. Press,et al.  Numerical recipes in C. The art of scientific computing , 1987 .

[24]  Wei Yan,et al.  Evolving robust GP solutions for hedge fund stock selection in emerging markets , 2011, Soft Comput..

[25]  M. Lynch,et al.  Genetics and Analysis of Quantitative Traits , 1996 .

[26]  B. S. Dhillon,et al.  Best linear unbiased prediction and optimum allocation of test resources in maize breeding with doubled haploids , 2011, Theoretical and Applied Genetics.

[27]  Torsten Hothorn,et al.  New Numerical Algorithm for Multivariate Normal Probabilities in Package mvtnorm , 2009 .