Discovery of Genome-Wide DNA Polymorphisms in a Landrace Cultivar of Japonica Rice by Whole-Genome Sequencing

Molecular breeding approaches are of growing importance to crop improvement. However, closely related cultivars generally used for crossing material lack sufficient known DNA polymorphisms due to their genetic relatedness. Next-generation sequencing allows the identification of a massive number of DNA polymorphisms such as single nucleotide polymorphisms (SNPs) and insertions–deletions (InDels) between highly homologous genomes. Using this technology, we performed whole-genome sequencing of a landrace of japonica rice, Omachi, which is used for sake brewing and is an important source for modern cultivars. A total of 229 million reads, each comprising 75 nucleotides of the Omachi genome, was generated with 45-fold coverage and uniquely mapped to 89.7% of the Nipponbare genome, a closely related cultivar. We identified 132,462 SNPs, 16,448 insertions and 19,318 deletions between the Omachi and Nipponbare genomes. An SNP array was designed to validate 731 selected SNPs, resulting in validation rates of 95 and 88% for the Omachi and Nipponbare genomes, respectively. Among the 577 SNPs validated in both genomes, 532 are entirely new SNP markers not previously reported between related rice cultivars. We also validated InDels on a part of chromosome 2 as DNA markers and successfully genotyped five japonica rice cultivars. Our results present the methodology and extensive data on SNPs and InDels available for whole-genome genotyping and marker-assisted breeding. The polymorphism information between Omachi and Nipponbare is available at NGRC_Rice_Omachi (http://www.nodai-genome.org/oryza_sativa_en.html).

[1]  M. Yano,et al.  Core single-nucleotide polymorphisms—a tool for genetic analysis of the Japanese rice population , 2010 .

[2]  C. Bustamante,et al.  Development of genome-wide SNP assays for rice , 2010 .

[3]  M. Yano,et al.  Fine definition of the pedigree haplotypes of closely related rice cultivars by means of genome-wide discovery of single-nucleotide polymorphisms , 2010, BMC Genomics.

[4]  D. Palm,et al.  Discovery and application of insertion-deletion (INDEL) polymorphisms for QTL mapping of early life-history traits in Atlantic salmon , 2010, BMC Genomics.

[5]  Sebastian Bauer,et al.  Microindel detection in short-read sequence data , 2010, Bioinform..

[6]  Kenneth L. McNally,et al.  Genomewide SNP variation reveals relationships among landraces and modern varieties of rice , 2009, Proceedings of the National Academy of Sciences.

[7]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[8]  R. Fernando,et al.  Factors Affecting Accuracy From Genomic Selection in Populations Derived From Multiple Inbred Lines: A Barley Case Study , 2009, Genetics.

[9]  Thomas Altmann,et al.  SNP identification in crop plants. , 2009, Current opinion in plant biology.

[10]  Sang Hong Lee,et al.  Predicting Unobserved Phenotypes for Complex Traits from Whole-Genome SNP Data , 2008, PLoS genetics.

[11]  D. Mackill,et al.  Molecular Markers and Their Use in Marker-Assisted Selection in Rice , 2008 .

[12]  Wenying Xu,et al.  Case study for identification of potentially indel-caused alternative expression isoforms in the rice subspecies japonica and indica by integrative genome analysis. , 2008, Genomics.

[13]  Yoshihiro Kawahara,et al.  The Rice Annotation Project Database (RAP-DB): 2008 update , 2007, Nucleic Acids Res..

[14]  Miron Livny,et al.  Validation of rice genome sequence by optical mapping , 2007, BMC Genomics.

[15]  K. Gunderson,et al.  Indel arrays: an affordable alternative for genotyping. , 2007, The Plant journal : for cell and molecular biology.

[16]  Kanako O. Koyanagi,et al.  Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana. , 2007, Genome research.

[17]  Yasuyuki Fujii,et al.  The Rice Annotation Project Database (RAP-DB): hub for Oryza sativa ssp. japonica genome information , 2005, Nucleic Acids Res..

[18]  Takuji Sasaki,et al.  The map-based sequence of the rice genome , 2005, Nature.

[19]  C. Nakamura,et al.  QTL Analysis for Plant and Grain Characters of Sake-brewing Rice Using a Doubled Haploid Population , 2002 .

[20]  S. Lewis,et al.  The generic genome browser: a building block for a model organism system database. , 2002, Genome research.

[21]  A. Rafalski Applications of single nucleotide polymorphisms in crop genetics. , 2002, Current opinion in plant biology.

[22]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.

[23]  Cai-guo Xu,et al.  Comparative analysis of microsatellite DNA polymorphism in landraces and cultivars of rice , 1994, Molecular and General Genetics MGG.

[24]  Bjarni J. Vilhjálmsson,et al.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines , 2010 .

[25]  Claude-Alain H. Roten,et al.  Theoretical and practical advances in genome halving , 2004 .