Genetical and Mathematical Properties of Similarity and Dissimilarity Coefficients Applied in Plant Breeding and Seed Bank Management

A proper choice of a dissimilarity measure is important in surveys investigating genetic relationships among germplasm with molecular marker data. The objective of our study was to examine 10 dissimilarity coefficients widely used in germplasm surveys, with special focus on applications in plant breeding and seed banks. In particular, we (i) investigated the genetical and mathematical properties of these coefficients, (ii) examined consequences of these properties for different areas of application in plant breeding and seed banks, and (iii) determined relationships between these 10 coefficients. The genetical and mathematical concepts of the coefficients were described in detail. A Procrustes analysis of a published data set consisting of seven CIMMYT maize populations demonstrated close affinity between Euclidean, Rogers', modified Rogers', and Cavalli-Sforza and Edwards' distance on one hand, and Nei's standard and Reynolds dissimilarity on the other hand. Our investigations show that genetical and mathematical properties of dissimilarity measures are of crucial importance when choosing a genetic dissimilarity coefficient for analyzing molecular marker data. The presented results assist experimenters to extract the maximum amount of information from genetic data and, thus, facilitate the interpretation of findings from molecular marker studies on a theoretically sound basis.

[1]  Yoshio Tateno,et al.  Accuracy of estimated phylogenetic trees from molecular data , 1983, Journal of Molecular Evolution.

[2]  J. Reif,et al.  Genetic distance based on simple sequence Repeats and Heterosis in Tropical Maize Populations , 2003 .

[3]  Seyed Abolghasem Mohammadi,et al.  Analysis of Genetic Diversity in Crop Plants—Salient Statistical Tools and Considerations , 2003 .

[4]  S. Kresovich,et al.  Molecular and Historical Aspects of Corn Belt Dent Diversity , 2003 .

[5]  T. V. Hintum,et al.  Marker-assisted rationalisation of genetic resource collections: a case study in flax using AFLPs , 2001, Theoretical and Applied Genetics.

[6]  R. Jansen,et al.  A comment on codominant scoring of AFLP markers. , 2001, Genetics.

[7]  V. Lombard,et al.  Genetic Relationships and Fingerprinting of Rapeseed Cultivars by AFLP: Consequences for Varietal Registration , 2000 .

[8]  H. Piepho,et al.  Codominant analysis of banding data from a dominant marker system by normal mixtures. , 2000, Genetics.

[9]  M. Bohn,et al.  PLABSIM: software for simulation of marker-assisted backcrossing. , 2000, The Journal of heredity.

[10]  E. Brummer,et al.  Capturing heterosis in forage crop cultivar development , 1999 .

[11]  P. Dubreuil,et al.  Genetic diversity within and among maize populations: a comparison between isozyme and nuclear RFLP loci , 1998, Theoretical and Applied Genetics.

[12]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[13]  A. Melchinger,et al.  Diversity and Relationships among U.S. Maize Inbreds Revealed by Restriction Fragment Length Polymorphisms , 1991 .

[14]  J. Gower,et al.  Metric and Euclidean properties of dissimilarity coefficients , 1986 .

[15]  B S Weir,et al.  Estimation of the coancestry coefficient: basis for a short-term genetic distance. , 1983, Genetics.

[16]  M. Goodman Races of maize. 6: Isozyme variation among races of maize in Bolivia , 1983 .

[17]  M. Nei,et al.  Mathematical model for studying genetic variation in terms of restriction endonucleases. , 1979, Proceedings of the National Academy of Sciences of the United States of America.

[18]  M. Nei The theory of genetic distance and evolution of human races , 1978, Japanese Journal of Human Genetics.

[19]  J. Gower Generalized procrustes analysis , 1975 .

[20]  Masatoshi Nei,et al.  Genetic Distance between Populations , 1972, The American Naturalist.

[21]  L. Cavalli-Sforza,et al.  PHYLOGENETIC ANALYSIS: MODELS AND ESTIMATION PROCEDURES , 1967, Evolution; international journal of organic evolution.

[22]  J. Crow,et al.  THE NUMBER OF ALLELES THAT CAN BE MAINTAINED IN A FINITE POPULATION. , 1964, Genetics.

[23]  R. Abelson,et al.  The additive constant problem in multidimensional scaling , 1956 .

[24]  M. Kimura,et al.  Process Leading to Quasi-Fixation of Genes in Natural Populations Due to Random Fluctuation of Selection Intensities. , 1954, Genetics.

[25]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .