SNP mining in C. clementina BAC end sequences; transferability in the Citrus genus (Rutaceae), phylogenetic inferences and perspectives for genetic mapping

BackgroundWith the increasing availability of EST databases and whole genome sequences, SNPs have become the most abundant and powerful polymorphic markers. However, SNP chip data generally suffers from ascertainment biases caused by the SNP discovery and selection process in which a small number of individuals are used as discovery panels. The ongoing International Citrus Genome Consortium sequencing project of the highly heterozygous Clementine and sweet orange genomes will soon result in the release of several hundred thousand SNPs. The primary goals of this study were: (i) to estimate the transferability within the genus Citrus of SNPs discovered from Clementine BACend sequencing (BES), (ii) to estimate bias associated with the very narrow discovery panel, and (iii) to evaluate the usefulness of the Clementine-derived SNP markers for diversity analysis and comparative mapping studies between the different cultivated Citrus species.ResultsFifty-four accessions covering the main Citrus species and 52 interspecific hybrids between pummelo and Clementine were genotyped on a GoldenGate array platform using 1,457 SNPs mined from Clementine BES and 37 SNPs identified between and within C. maxima, C. medica, C. reticulata and C. micrantha. Consistent results were obtained from 622 SNP loci. Of these markers, 116 displayed incomplete transferability primarily in C. medica, C. maxima and wild Citrus species. The two primary biases associated with the SNP mining in Clementine were an overestimation of the C. reticulata diversity and an underestimation of the interspecific differentiation. However, the genetic stratification of the gene pool was high, with very frequent significant linkage disequilibrium. Furthermore, the shared intraspecific polymorphism and accession heterozygosity were generally enough to perform interspecific comparative genetic mapping.ConclusionsA set of 622 SNP markers providing consistent results was selected. Of the markers mined from Clementine, 80.5% were successfully transferred to the whole Citrus gene pool. Despite the ascertainment biases in relation to the Clementine origin, the SNP data confirm the important stratification of the gene pools around C. maxima, C. medica and C. reticulata as well as previous hypothesis on the origin of secondary species. The implemented SNP marker set will be very useful for comparative genetic mapping in Citrus and genetic association in C. reticulata.

[1]  J. Kumamoto,et al.  Contribution to the Origin of the Grapefruit, Citrus paradisi (Rutaceae) , 1982 .

[2]  S. Wright Evolution and the Genetics of Populations, Volume 3: Experimental Results and Evolutionary Deductions , 1977 .

[3]  M. Machado,et al.  Development of genetic maps of the citrus varieties ‘Murcott’ tangor and ‘Pêra’ sweet orange by using fluorescent AFLP markers , 2010, Journal of Applied Genetics.

[4]  Patrick Wincker,et al.  Transferability of the EST-SSRs developed on Nules clementine (Citrus clementina Hort ex Tan) to other Citrus species and their effectiveness for genetic mapping , 2008, BMC Genomics.

[5]  A. Brookes The essence of SNPs. , 1999, Gene.

[6]  R. Krueger,et al.  Assessing genetic diversity and population structure in a citrus germplasm collection utilizing simple sequence repeat markers (SSRs) , 2006, Theoretical and Applied Genetics.

[7]  R. Scora,et al.  On the History and Origin of Citrus , 1975 .

[8]  C. Bustamante,et al.  Development of genome-wide SNP assays for rice , 2010 .

[9]  E. Pahlich,et al.  A rapid DNA isolation procedure for small quantities of fresh leaf tissue , 1980 .

[10]  H. C. Barrett,et al.  A Numerical Taxonomic Study of Affinity Relationships in Cultivated Citrus and Its Close Relatives , 1976 .

[11]  J. Pina,et al.  The diversification of Citrus clementina Hort. ex Tan., a vegetatively propagated crop species. , 2001, Molecular phylogenetics and evolution.

[12]  Kejun Liu,et al.  PowerMarker: an integrated analysis environment for genetic marker analysis , 2005, Bioinform..

[13]  K. Saitoh,et al.  Application of single nucleotide polymorphisms to non‐model species: a technical review , 2010, Molecular ecology resources.

[14]  M. Roose,et al.  Phylogenetic relationships within the genus Citrus (Rutaceae) and related genera as revealed by RFLP and RAPD analysis , 1998, Theoretical and Applied Genetics.

[15]  B. Dickson,et al.  High-resolution, high-throughput SNP mapping in Drosophila melanogaster , 2008, Nature Methods.

[16]  Xavier Perrier,et al.  Genetic Diversity of Cultivated Tropical Plants , 2003 .

[17]  M. Morgante,et al.  Corn and humans: recombination and linkage disequilibrium in two genomes of similar size. , 2004, Trends in genetics : TIG.

[18]  D. Conrad,et al.  A high-resolution survey of deletion polymorphism in the human genome , 2006, Nature Genetics.

[19]  Carlos D Bustamante,et al.  Ascertainment bias in studies of human genome-wide polymorphism. , 2005, Genome research.

[20]  T. A. Hall,et al.  BIOEDIT: A USER-FRIENDLY BIOLOGICAL SEQUENCE ALIGNMENT EDITOR AND ANALYSIS PROGRAM FOR WINDOWS 95/98/ NT , 1999 .

[21]  E. Nicolosi,et al.  Citrus phylogeny and genetic origin of important species as investigated by molecular markers , 2000, Theoretical and Applied Genetics.

[22]  Amanda J. Garris,et al.  Population structure and its effect on haplotype diversity and linkage disequilibrium surrounding the xa5 locus of rice (Oryza sativa L.). , 2003, Genetics.

[23]  Dustin A. Cartwright,et al.  A Dense Single-Nucleotide Polymorphism-Based Genetic Linkage Map of Grapevine (Vitis vinifera L.) Anchoring Pinot Noir Bacterial Artificial Chromosome Contigs , 2007, Genetics.

[24]  M. Talón,et al.  Development of genomic resources for Citrus clementina: Characterization of three deep-coverage BAC libraries and analysis of 46,000 BAC end sequences , 2008, BMC Genomics.

[25]  P. Ingvarsson Nucleotide Polymorphism and Linkage Disequilibrium Within and Among Natural Populations of European Aspen (Populus tremula L., Salicaceae) , 2005, Genetics.

[26]  O. Gulsen,et al.  A new citrus linkage map based on SRAP, SSR, ISSR, POGP, RGA and RAPD markers , 2010, Euphytica.

[27]  X. Huang,et al.  CAP3: A DNA sequence assembly program. , 1999, Genome research.

[28]  M. Guerra,et al.  Karyotype diversity and the origin of grapefruit , 2007, Chromosome Research.

[29]  Q. He,et al.  AFLP ANALYSIS AND THE TAXONOMY OF CITRUS , 2007 .

[30]  I. Abdurakhmonov,et al.  Application of Association Mapping to Understanding the Genetic Diversity of Plant Germplasm Resources , 2008, International journal of plant genomics.

[31]  R. Visser,et al.  Construction of a 10,000-Marker Ultradense Genetic Recombination Map of Potato: Providing a Framework for Accelerated Gene Isolation and a Genomewide Physical Map , 2006, Genetics.

[32]  M. Nei Analysis of gene diversity in subdivided populations. , 1973, Proceedings of the National Academy of Sciences of the United States of America.

[33]  O. Gulsen,et al.  Genetic diversity and relationships within Citrus and related genera based on sequence related amplified polymorphism markers (SRAPs) , 2009 .

[34]  P. Ollitrault,et al.  Recovery and characterization of a Citrus clementina Hort. ex Tan. 'Clemenules' haploid plant selected to establish the reference whole Citrus genome sequence , 2009, BMC Plant Biology.

[35]  Riccardo Velasco,et al.  A SNP transferability survey within the genus Vitis , 2008, BMC Plant Biology.

[36]  A. Berno,et al.  High-throughput polymorphism screening and genotyping with high-density oligonucleotide arrays. , 1999, Genetic analysis : biomolecular engineering.

[37]  P. Ollitrault,et al.  EVALUATION OF GENETIC RELATIONSHIPS IN CITRUS GENUS BY MEANS OF SEQUENCE TAGGED MICROSATELLITES , 2001 .

[38]  P. Oefner,et al.  The extent of linkage disequilibrium in Arabidopsis thaliana , 2002, Nature Genetics.

[39]  S. Wright,et al.  Evolution and the Genetics of Populations: Volume 2, The Theory of Gene Frequencies , 1968 .

[40]  John Novembre,et al.  Ascertainment bias in spatially structured populations: a case study in the eastern fence lizard. , 2007, The Journal of heredity.

[41]  W. T. Swingle,et al.  The botany of citrus and its wild relatives of the orange subfamily (family Rutaceae, subfamily Aurantioideae) , 1943 .

[42]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.

[43]  P. Snelgrove,et al.  Evaluating SNP ascertainment bias and its impact on population assignment in Atlantic cod, Gadus morhua , 2011, Molecular ecology resources.

[44]  R. Nielsen,et al.  Ascertainment biases in SNP chips affect measures of population divergence. , 2010, Molecular biology and evolution.

[45]  J. Pina,et al.  Genetic diversity in the orange subfamily Aurantioideae. II. Genetic relationships among genera and species , 1996, Theoretical and Applied Genetics.

[46]  M. Talón,et al.  A reference linkage Map of C. clementina based on SNPs, SSRs and indels (P477) , 2011 .

[47]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[48]  V. Malladi,et al.  The effect of SNP discovery method and sample size on estimation of population genetic data for Chinese and Indian rhesus macaques (Macaca mulatta) , 2011, Primates.

[49]  C. Carlson,et al.  Direct detection of null alleles in SNP genotyping data. , 2006, Human Molecular Genetics.

[50]  Pardis C Sabeti,et al.  Common deletion polymorphisms in the human genome , 2006, Nature Genetics.

[51]  E S Buckler,et al.  Structure of linkage disequilibrium and phenotypic associations in the maize genome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[52]  P. Ollitrault,et al.  New universal mitochondrial PCR markers reveal new information on maternal citrus phylogeny , 2011, Tree Genetics & Genomes.

[53]  M. N. Rao,et al.  EST-SSR genetic maps for Citrus sinensis and Poncirus trifoliata , 2007, Tree Genetics & Genomes.

[54]  M. Daly,et al.  Transferability of tag SNPs in genetic association studies in multiple populations , 2006, Nature Genetics.

[55]  Sewall Wright,et al.  The theory of gene frequencies , 1969 .

[56]  Y. Rogers,et al.  Genetic Bit Analysis: a solid phase method for typing single nucleotide polymorphisms. , 1994, Nucleic acids research.

[57]  C. Carlson,et al.  Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. , 2004, American journal of human genetics.

[58]  G. P. Bernet,et al.  Comparative genome-wide segregation analysis and map construction using a reciprocal cross design to facilitate citrus germplasm utilization , 2010, Molecular Breeding.

[59]  E. Buckler,et al.  Structure of linkage disequilibrium in plants. , 2003, Annual review of plant biology.