Variation block-based genomics method for crop plants

BackgroundIn contrast with wild species, cultivated crop genomes consist of reshuffled recombination blocks, which occurred by crossing and selection processes. Accordingly, recombination block-based genomics analysis can be an effective approach for the screening of target loci for agricultural traits.ResultsWe propose the variation block method, which is a three-step process for recombination block detection and comparison. The first step is to detect variations by comparing the short-read DNA sequences of the cultivar to the reference genome of the target crop. Next, sequence blocks with variation patterns are examined and defined. The boundaries between the variation-containing sequence blocks are regarded as recombination sites. All the assumed recombination sites in the cultivar set are used to split the genomes, and the resulting sequence regions are termed variation blocks. Finally, the genomes are compared using the variation blocks. The variation block method identified recurring recombination blocks accurately and successfully represented block-level diversities in the publicly available genomes of 31 soybean and 23 rice accessions. The practicality of this approach was demonstrated by the identification of a putative locus determining soybean hilum color.ConclusionsWe suggest that the variation block method is an efficient genomics method for the recombination block-level comparison of crop genomes. We expect that this method will facilitate the development of crop genomics by bringing genomics technologies to the field of crop breeding.

[1]  D. D. Kosambi The estimation of map distances from recombination values. , 1943 .

[2]  S. Rogers,et al.  Extraction of total cellular DNA from plants, algae and fungi , 1994 .

[3]  R. Shoemaker,et al.  Soybean pedigree analysis using map-based molecular markers: I. Tracking RFLP markers in cultivars , 1995 .

[4]  L. Vodkin,et al.  Duplications That Suppress and Deletions That Restore Expression from a Chalcone Synthase Multigene Family. , 1996, The Plant cell.

[5]  M. Marra,et al.  Genetic definition and sequence analysis of Arabidopsis centromeres. , 1999, Science.

[6]  Pardis C Sabeti,et al.  Linkage disequilibrium in the human genome , 2001, Nature.

[7]  A. Oliphant,et al.  A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). , 2002, Science.

[8]  Huanming Yang,et al.  A Draft Sequence of the Rice Genome (Oryza sativa L. ssp. indica) , 2002, Science.

[9]  Junhua Peng,et al.  The organization and rate of evolution of wheat genomes are correlated with recombination rates along chromosome arms. , 2003, Genome research.

[10]  Richard J. Mural,et al.  Genome-wide single-nucleotide polymorphism analysis defines haplotype patterns in mouse , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[11]  R. Shoemaker,et al.  Features of a 103-kb gene-rich region in soybean include an inverted perfect repeat cluster of CHS genes comprising the I locus. , 2004, Genome.

[12]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[13]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[14]  P. Klein,et al.  Comprehensive Molecular Cytogenetic Analysis of Sorghum Genome Architecture: Distribution of Euchromatin, Heterochromatin, Genes and Recombination in Comparison to Rice , 2005, Genetics.

[15]  Mark Daly,et al.  Haploview: analysis and visualization of LD and haplotype maps , 2005, Bioinform..

[16]  M. Davis,et al.  Heterozygous Insertions Alter Crossover Distribution but Allow Crossover Interference in Caenorhabditis elegans , 2005, Genetics.

[17]  S. Tanksley,et al.  Euchromatin and Pericentromeric Heterochromatin: Comparative Composition in the Tomato Genome , 2006, Genetics.

[18]  Randall L. Nelson,et al.  Impacts of genetic bottlenecks on soybean genome diversity , 2006, Proceedings of the National Academy of Sciences.

[19]  Hiroshi Inoue,et al.  Evaluation of sample size effect on the identification of haplotype blocks , 2006, BMC Bioinformatics.

[20]  David L. Hyten,et al.  Soybean cultivars resulted from more recombination events than unselected lines in the same population , 2006 .

[21]  Richard M. Clark,et al.  Common Sequence Polymorphisms Shaping Genetic Diversity in Arabidopsis thaliana , 2007, Science.

[22]  J. Dvorak,et al.  Recombination: an underappreciated factor in the evolution of plant genomes , 2007, Nature Reviews Genetics.

[23]  M. Senda,et al.  Structural features of GmIRCHS, candidate of the I gene inhibiting seed coat pigmentation in soybean: implications for inducing endogenous RNA silencing of chalcone synthase genes , 2007, Plant Molecular Biology.

[24]  M. McMullen,et al.  Genetic Design and Statistical Power of Nested Association Mapping in Maize , 2008, Genetics.

[25]  E. Birney,et al.  SNP and haplotype mapping for genetic analysis in the rat , 2008, Nature Genetics.

[26]  Robert J. Elshire,et al.  A First-Generation Haplotype Map of Maize , 2009, Science.

[27]  Xuehui Huang,et al.  High-throughput genotyping by whole-genome resequencing. , 2009, Genome research.

[28]  S. Tabata,et al.  Map-Based Cloning of the Gene Associated With the Soybean Maturity Locus E3 , 2009, Genetics.

[29]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[30]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[31]  M. McMullen,et al.  Genetic Properties of the Maize Nested Association Mapping Population , 2009, Science.

[32]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[33]  J. Schmutz,et al.  Whole-genome sequencing and intensive analysis of the undomesticated soybean (Glycine soja Sieb. and Zucc.) genome , 2010, Proceedings of the National Academy of Sciences.

[34]  Bo Wang,et al.  Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection , 2010, Nature Genetics.

[35]  T. Richmond,et al.  The Composition and Origins of Genomic Variation among Individuals of the Soybean Reference Cultivar Williams 821[W][OA] , 2010, Plant Physiology.

[36]  R. Palmer,et al.  Genetic analysis of genes controlling natural variation of seed coat and flower colors in soybean. , 2010, The Journal of heredity.

[37]  T. Sakurai,et al.  Genome sequence of the palaeopolyploid soybean , 2010, Nature.

[38]  Qi Feng,et al.  Parent-independent genotyping for constructing an ultrahigh-density linkage map based on population sequencing , 2010, Proceedings of the National Academy of Sciences.

[39]  J. Shannon,et al.  Loss-of-function mutations affecting a specific Glycine max R2R3 MYB transcription factor result in brown hilum and brown seed coats , 2011, BMC Plant Biology.

[40]  Qian Qian,et al.  Genome-wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm , 2011, Nature Genetics.

[41]  Lin Fang,et al.  Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes , 2011, Nature Biotechnology.

[42]  M. Yano,et al.  Genome-Wide Haplotype Changes Produced by Artificial Selection during Modern Rice Breeding in Japan , 2012, PloS one.

[43]  James K. Hane,et al.  Draft genome sequence of chickpea (Cicer arietinum) provides a resource for trait improvement , 2013, Nature Biotechnology.