Genome sequencing of rice subspecies and genetic analysis of recombinant lines reveals regional yield- and quality-associated loci

BackgroundTwo of the most widely cultivated rice strains are Oryza sativa indica and O. sativa japonica, and understanding the genetic basis of their agronomic traits is of importance for crop production. These two species are highly distinct in terms of geographical distribution and morphological traits. However, the relationship among genetic background, ecological conditions, and agronomic traits is unclear.ResultsIn this study, we performed the de novo assembly of a high-quality genome of SN265, a cultivar that is extensively cultivated as a backbone japonica parent in northern China, using single-molecule sequencing. Recombinant inbred lines (RILs) derived from a cross between SN265 and R99 (indica) were re-sequenced and cultivated in three distinct ecological conditions. We identify 79 QTLs related to 15 agronomic traits. We found that several genes underwent functional alterations when the ecological conditions were changed, and some alleles exhibited contracted responses to different genetic backgrounds. We validated the involvement of one candidate gene, DEP1, in determining panicle length, using CRISPR/Cas9 gene editing.ConclusionsThis study provides information on the suitable environmental conditions, and genetic background, for functional genes in rice breeding. Moreover, the public availability of the reference genome of northern japonica SN265 provides a valuable resource for plant biologists and the genetic improvement of crops.

[1]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[2]  Michael Y. Galperin,et al.  The COG database: new developments in phylogenetic classification of proteins from complete genomes , 2001, Nucleic Acids Res..

[3]  Susan R. Wessler,et al.  MITE-Hunter: a program for discovering miniature inverted-repeat transposable elements from genomic sequences , 2010, Nucleic acids research.

[4]  James C. W. Locke,et al.  Phytochromes function as thermosensors in Arabidopsis , 2016, Science.

[5]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[6]  Zhao Xu,et al.  LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons , 2007, Nucleic Acids Res..

[7]  Qian Qian,et al.  Genome-wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm , 2011, Nature Genetics.

[8]  Jens Keilwagen,et al.  Using intron position conservation for homology-based gene prediction , 2016, Nucleic acids research.

[9]  Nansheng Chen,et al.  Genblasta: Enabling Blast to Identify Homologous Gene Sequences , 2022 .

[10]  M. Yano,et al.  Rice gibberellin-insensitive dwarf mutant gene Dwarf 1 encodes the alpha-subunit of GTP-binding protein. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[11]  G. S. Khush,et al.  Green revolution: A mutant gibberellin-synthesis gene in rice , 2002, Nature.

[12]  Enrique Blanco,et al.  Using geneid to Identify Genes , 2002, Current protocols in bioinformatics.

[13]  R. Ishikawa,et al.  Phytochrome B regulates Heading date 1 (Hd1)-mediated expression of rice florigen Hd3a and critical day length in rice , 2011, Molecular Genetics and Genomics.

[14]  Jonathan E. Allen,et al.  Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments , 2007, Genome Biology.

[15]  Nansheng Chen,et al.  Using RepeatMasker to Identify Repetitive Elements in Genomic Sequences , 2009, Current protocols in bioinformatics.

[16]  Q. Qian,et al.  Cytokinin Oxidase Regulates Rice Grain Production , 2005, Science.

[17]  Jianmin Wan,et al.  DTH8 Suppresses Flowering in Rice, Influencing Plant Height and Yield Potential Simultaneously1[W][OA] , 2010, Plant Physiology.

[18]  R. Durbin,et al.  GeneWise and Genomewise. , 2004, Genome research.

[19]  Russell E. Durrett,et al.  Assembly and diploid architecture of an individual human genome via single-molecule technologies , 2015, Nature Methods.

[20]  Tao Huang,et al.  Genomic architecture of heterosis for yield traits in rice , 2016, Nature.

[21]  J. Bennetzen,et al.  Reply: A unified classification system for eukaryotic transposable elements should reflect their phylogeny , 2009, Nature Reviews Genetics.

[22]  Kaworu Ebana,et al.  Deletion in a gene associated with grain size increased yields during rice domestication , 2008, Nature Genetics.

[23]  Ping Li,et al.  A Natural Allele of a Transcription Factor in Rice Confers Broad-Spectrum Blast Resistance , 2017, Cell.

[24]  Qun Xu,et al.  Pan-genome analysis highlights the extent of genomic variation in cultivated and wild rice , 2018, Nature Genetics.

[25]  J. Jurka,et al.  Repbase Update, a database of eukaryotic repetitive elements , 2005, Cytogenetic and Genome Research.

[26]  Shoshi Kikuchi,et al.  Genome-wide analysis of NAC transcription factor family in rice. , 2010, Gene.

[27]  J. Bennetzen,et al.  A unified classification system for eukaryotic transposable elements , 2007, Nature Reviews Genetics.

[28]  Rod A Wing,et al.  Extensive sequence divergence between the reference genomes of two elite indica rice varieties Zhenshan 97 and Minghui 63 , 2016, Proceedings of the National Academy of Sciences.

[29]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[30]  Steven Salzberg,et al.  TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders , 2004, Bioinform..

[31]  Qian Qian,et al.  Natural variation at the DEP1 locus enhances grain yield in rice , 2009, Nature Genetics.

[32]  Qifa Zhang,et al.  Genome-wide association studies of 14 agronomic traits in rice landraces , 2010, Nature Genetics.

[33]  Quan Xu,et al.  Deciphering the Environmental Impacts on Rice Quality for Different Rice Cultivated Areas , 2018, Rice.

[34]  Kenneth L. McNally,et al.  Genomic variation in 3,010 diverse accessions of Asian cultivated rice , 2018, Nature.

[35]  Zhijun Cheng,et al.  Isolation and initial characterization of GW5, a major QTL associated with rice grain width and weight , 2008, Cell Research.

[36]  L. Tang,et al.  The contribution of intersubspecific hybridization to the breeding of super-high-yielding japonica rice in northeast China , 2012, Theoretical and Applied Genetics.

[37]  Siu-Ming Yiu,et al.  SOAP2: an improved ultrafast tool for short read alignment , 2009, Bioinform..

[38]  S. Koren,et al.  Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation , 2016, bioRxiv.

[39]  Eli J. Fine,et al.  DNA targeting specificity of RNA-guided Cas9 nucleases , 2013, Nature Biotechnology.

[40]  J. Bennetzen,et al.  A unified classification system for eukaryotic transposable elements should reflect their phylogeny. , 2009 .

[41]  Quan Xu,et al.  The DENSE AND ERECT PANICLE 1 (DEP1) gene offering the potential in the breeding of high-yielding rice , 2016, Breeding science.

[42]  W. Shen,et al.  SET DOMAIN GROUP 708, a histone H3 lysine 36-specific methyltransferase, controls flowering time in rice (Oryza sativa). , 2016, The New phytologist.

[43]  Ian Korf,et al.  Gene finding in novel genomes , 2004, BMC Bioinformatics.

[44]  Q. Qian,et al.  Breeding high-yield superior quality hybrid super rice by rational design , 2016 .

[45]  Wei Liu,et al.  A Robust CRISPR/Cas9 System for Convenient, High-Efficiency Multiplex Genome Editing in Monocot and Dicot Plants. , 2015, Molecular plant.

[46]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[47]  Eugene W. Myers,et al.  PILER : identification and classification of genomic repeats , 2005 .

[48]  Rolf Apweiler,et al.  InterProScan - an integration platform for the signature-recognition methods in InterPro , 2001, Bioinform..

[49]  Narmada Thanki,et al.  CDD: a Conserved Domain Database for the functional annotation of proteins , 2010, Nucleic Acids Res..

[50]  A. Fujiyama,et al.  A map of rice genome variation reveals the origin of cultivated rice , 2012, Nature.

[51]  Takuji Sasaki,et al.  The map-based sequence of the rice genome , 2005, Nature.

[52]  Xiaowu Wang,et al.  Construction and Analysis of High-Density Linkage Map Using High-Throughput Sequencing Data , 2014, PloS one.

[53]  Yan Li,et al.  Sequencing and de novo assembly of a near complete indica rice genome , 2017, Nature Communications.

[54]  Nansheng Chen,et al.  Using RepeatMasker to Identify Repetitive Elements in Genomic Sequences , 2009, Current protocols in bioinformatics.

[55]  M. Matsuoka,et al.  A protocol for Agrobacterium-mediated transformation in rice , 2007, Nature Protocols.

[56]  Mario Stanke,et al.  Gene prediction with a hidden Markov model and a new intron submodel , 2003, ECCB.

[57]  L. Xiong,et al.  Conserved miR164-targeted NAC genes negatively regulate drought resistance in rice , 2014, Journal of experimental botany.

[58]  Sean R. Eddy,et al.  Rfam: annotating non-coding RNAs in complete genomes , 2004, Nucleic Acids Res..

[59]  Stephen M. Mount,et al.  Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis , 2006, BMC Genomics.

[60]  M. Yano,et al.  A pair of floral regulators sets critical day length for Hd3a florigen expression in rice , 2010, Nature Genetics.

[61]  Sean R. Eddy,et al.  Infernal 1.1: 100-fold faster RNA homology searches , 2013, Bioinform..

[62]  D. Schwartz,et al.  Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data , 2013, Rice.

[63]  Huanming Yang,et al.  SNP detection for massively parallel whole-genome resequencing. , 2009, Genome research.

[64]  Pavel A. Pevzner,et al.  De novo identification of repeat families in large genomes , 2005, ISMB.