RNA-Seq Uncovers SNPs and Alternative Splicing Events in Asian Lotus (Nelumbo nucifera)

RNA-Seq is an efficient way to comprehensively identify single nucleotide polymorphisms (SNPs) and alternative splicing (AS) events from the expressed genes. In this study, we conducted transcriptome sequencing of four Asian lotus (Nelumbo nucifera) cultivars using Illumina HiSeq2000 platform to identify SNPs and AS events in lotus. A total of 505 million pair-end RNA-Seq reads were generated from four cultivars, of which 86% were mapped to the lotus reference genome. Using the four sets of data together, a total of 357,689 putative SNPs were identified with an average density of one SNP per 2.2 kb. These SNPs were located in 1,253 scaffolds and 15,016 expressed genes. A/G and C/T were the two major types of SNPs in the Asian lotus transcriptome. In parallel, a total of 177,540 AS events were detected in the four cultivars and were distributed in 64% of the expressed genes of lotus. The predominant type of AS events was alternative 5’ first exon, which accounted for 41.2% of all the observed AS events, and exon skipping only accounted for 4.3% of all AS. Gene Ontology analysis was conducted to analyze the function of the genes containing SNPs and AS events. Validation of selected SNPs and AS events revealed that 74% of SNPs and 80% of AS events were reliable, which indicates that RNA-Seq is an efficient approach to uncover gene-associated SNPs and AS events. A large number of SNPs and AS events identified in our study will facilitate further genetic and functional genomics research in lotus.

[1]  Xianghong Jasmine Zhou,et al.  Integrative analysis of many RNA-seq datasets to study alternative splicing. , 2014, Methods.

[2]  L. Rieseberg,et al.  Association Mapping and the Genomic Consequences of Selection in Sunflower , 2013, PLoS genetics.

[3]  Caofeng Pan,et al.  Comparative transcriptomic analysis of the regulation of flowering in temperate and tropical lotus (Nelumbo nucifera) by RNA-Seq , 2014 .

[4]  Xiaohong Yang,et al.  Genome-wide association study dissects the genetic architecture of oil biosynthesis in maize kernels , 2012, Nature Genetics.

[5]  Peter J. Bradbury,et al.  Genome-wide association study of quantitative resistance to southern leaf blight in the maize nested association mapping population , 2011, Nature Genetics.

[6]  J. Galagan,et al.  Cross-kingdom patterns of alternative splicing and splice recognition , 2008, Genome Biology.

[7]  Paola Bonizzoni,et al.  Modeling Alternative Splicing Variants from RNA-Seq Data with Isoform Graphs , 2014, J. Comput. Biol..

[8]  John M. Burke,et al.  SNP Discovery and Development of a High-Density Genotyping Array for Sunflower , 2012, PloS one.

[9]  Belén Picó,et al.  High-throughput SNP genotyping in Cucurbita pepo for map construction and quantitative trait loci mapping , 2012, BMC Genomics.

[10]  Stephen M. Mount,et al.  Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis , 2006, BMC Genomics.

[11]  Matthew D. Young,et al.  Gene ontology analysis for RNA-seq: accounting for selection bias , 2010, Genome Biology.

[12]  Peter J. Bradbury,et al.  Genome-wide association study of leaf architecture in the maize nested association mapping population , 2011, Nature Genetics.

[13]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[14]  Transcriptome profiling and molecular marker discovery in red pepper, Capsicum annuum L. TF68 , 2012, Molecular Biology Reports.

[15]  Jiqiang Yao,et al.  De novo assembly of the pepper transcriptome (Capsicum annuum): a benchmark for in silico discovery of SNPs, SSRs and candidate genes , 2012, BMC Genomics.

[16]  A. Chan,et al.  Transcriptome sequencing to produce SNP-based genetic maps of onion , 2013, Theoretical and Applied Genetics.

[17]  Mei Yang,et al.  Assessment of the Genetic Diversity and Population Structure of Lotus Cultivars Grown in China by Amplified Fragment Length Polymorphism , 2011 .

[18]  Fine mapping of a major flowering time QTL on soybean chromosome 6 combining linkage and association analysis , 2013, Euphytica.

[19]  Genetic linkage maps for Asian and American lotus constructed using novel SSR markers derived from the genome of sequenced cultivar , 2012, BMC Genomics.

[20]  M. Yoon,et al.  Transcriptome analysis and SNP/SSR marker information of red pepper variety YCM334 and Taean , 2011 .

[21]  R. Ming,et al.  Analysis of expressed sequence tags and alternative splicing genes in sacred lotus (Nelumbo nucifera Gaertn.) , 2013 .

[22]  A. Palloix,et al.  Discovery of a large set of SNP and SSR genetic markers by high-throughput sequencing of pepper (Capsicum annuum). , 2012, Genetics and molecular research : GMR.

[23]  J. Cañizares,et al.  Transcriptome sequencing for SNP discovery across Cucumis melo , 2012, BMC Genomics.

[24]  H. Maxwell-Lefroy,et al.  THE ANNALS OF APPLIED BIOLOGY , 1914 .

[25]  David R. Kelley,et al.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2012, Nature Protocols.

[26]  V. Brendel,et al.  Genomewide comparative analysis of alternative splicing in plants. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Chao Gu,et al.  Deep RNA-Seq uncovers the peach transcriptome landscape , 2013, Plant Molecular Biology.

[28]  Qian Qian,et al.  Genome-wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm , 2011, Nature Genetics.

[29]  Yamile Marquez,et al.  Transcriptome survey reveals increased complexity of the alternative splicing landscape in Arabidopsis , 2012, Genome research.

[30]  B. Blencowe Alternative Splicing: New Insights from Global Analyses , 2006, Cell.

[31]  S. Brenner,et al.  The evolving roles of alternative splicing. , 2004, Current opinion in structural biology.

[32]  R. Ming,et al.  Secretome Prediction and Analysis in Sacred Lotus (Nelumbo nucifera Gaertn.) , 2013, Tropical Plant Biology.

[33]  Zhixi Tian,et al.  Global Dissection of Alternative Splicing in Paleopolyploid Soybean[W] , 2014, Plant Cell.

[34]  J. Shen-Miller Sacred lotus, the long-living fruits of China Antique , 2002, Seed Science Research.

[35]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[36]  Muhammad Ali Amer,et al.  Genome-wide association study of 107 phenotypes in a common set of Arabidopsis thaliana inbred lines , 2010, Nature.

[37]  Mukesh Jain,et al.  NGS QC Toolkit: A Toolkit for Quality Control of Next Generation Sequencing Data , 2012, PloS one.

[38]  Joy Bergelson,et al.  Linkage and Association Mapping of Arabidopsis thaliana Flowering Time in Nature , 2010, PLoS genetics.

[39]  D. Black Mechanisms of alternative pre-messenger RNA splicing. , 2003, Annual review of biochemistry.

[40]  Dongsheng Zou,et al.  Sequencing, de novo assembly, annotation and SSR and SNP detection of sabaigrass (Eulaliopsis binata) transcriptome. , 2013, Genomics.

[41]  Huanming Yang,et al.  Deep RNA sequencing at single base-pair resolution reveals high complexity of the rice transcriptome. , 2010, Genome research.

[42]  J. Gouzy,et al.  Combined linkage and association mapping of flowering time in Sunflower (Helianthus annuus L.) , 2013, Theoretical and Applied Genetics.

[43]  Mei Yang,et al.  Optimization of linkage mapping strategy and construction of a high-density American lotus linkage map , 2014, BMC Genomics.

[44]  S. Salzberg,et al.  Thousands of exon skipping events differentiate among splicing patterns in sixteen human tissues , 2013, F1000Research.

[45]  Miranda J. Haus,et al.  Genome of the long-living sacred lotus (Nelumbo nucifera Gaertn.) , 2013, Genome Biology.

[46]  Qifa Zhang,et al.  Genome-wide association studies of 14 agronomic traits in rice landraces , 2010, Nature Genetics.