Variant Phasing and Haplotypic Expression from Single-molecule Long-read Sequencing in Maize

Haplotype phasing of genetic variants is important for interpretation of the maize genome, population genetic analysis, and functional genomic analysis of allelic activity. Accordingly, accurate methods for phasing full-length isoforms are essential for functional genomics study. In this study, we performed an isoform-level phasing study in maize, using two inbred lines and their reciprocal crosses, based on single-molecule full-length cDNA sequencing. To phase and analyze full-length transcripts between hybrids and parents, we developed a tool called IsoPhase. Using this tool, we validated the majority of SNPs called against matching short read data and identified cases of allele-specific, gene-level, and isoform-level expression. Our results revealed that maize parental and hybrid lines exhibit different splicing activities. After phasing 6,847 genes in two reciprocal hybrids using embryo, endosperm and root tissues, we annotated the SNPs and identified large-effect genes. In addition, based on single-molecule sequencing, we identified parent-of-origin isoforms in maize hybrids, different novel isoforms between maize parent and hybrid lines, and imprinted genes from different tissues. Finally, we characterized variation in cis- and trans-regulatory effects. Our study provides measures of haplotypic expression that could increase power and accuracy in studies of allelic expression.

[1]  D. Ware,et al.  Reviving the Transcriptome Studies: An Insight Into the Emergence of Single-Molecule Transcriptome Sequencing , 2019, Front. Genet..

[2]  Qinghua Zhang,et al.  Patterns of genome-wide allele-specific expression in hybrid rice and the implications on the genetic basis of heterosis , 2019, Proceedings of the National Academy of Sciences.

[3]  M. Tress,et al.  Corrigendum: SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification. , 2018, Genome research.

[4]  Liya Wang,et al.  SciApps: a cloud-based platform for reproducible bioinformatics workflows , 2018, Bioinform..

[5]  Jian Wang,et al.  WEGO 2.0: a web tool for analyzing and plotting GO annotations, 2018 update , 2018, Nucleic Acids Res..

[6]  W. McCombie,et al.  A comparative transcriptional landscape of maize and sorghum obtained by single-molecule sequencing , 2018, Genome research.

[7]  Bo Wang,et al.  Gramene 2018: unifying comparative genomics and pathway resources for plant research , 2017, Nucleic Acids Res..

[8]  Heng Li,et al.  Minimap2: pairwise alignment for nucleotide sequences , 2017, Bioinform..

[9]  Maojun Wang,et al.  A global survey of alternative splicing in allopolyploid cotton: landscape, complexity and regulation. , 2018, The New phytologist.

[10]  Zhou Du,et al.  agriGO v2.0: a GO analysis toolkit for the agricultural community, 2017 update , 2017, Nucleic Acids Res..

[11]  Lennart Martens,et al.  1 SQANTI : extensive characterization of long read transcript sequences for quality control in 1 full-length transcriptome identification and quantification 2 3 , 2017 .

[12]  Kevin L. Schneider,et al.  Improved maize reference genome with single-molecule technologies , 2017, Nature.

[13]  Kin Fai Au,et al.  IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing , 2016, Nucleic acids research.

[14]  Stephane E. Castel,et al.  Rare variant phasing and haplotypic expression from RNA sequencing with phASER , 2016, Nature Communications.

[15]  Tyson A. Clark,et al.  Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing , 2016, Nature Communications.

[16]  Faye D. Schilkey,et al.  A survey of the sorghum transcriptome using single-molecule long reads , 2016, Nature Communications.

[17]  P. Ng,et al.  SIFT missense predictions for genomes , 2015, Nature Protocols.

[18]  Eric Banks,et al.  Tools and best practices for data processing in allelic expression analysis , 2015, Genome Biology.

[19]  Xiandong Meng,et al.  Widespread Polycistronic Transcripts in Fungi Revealed by Single-Molecule mRNA Sequencing , 2015, PloS one.

[20]  C. Gregg Known unknowns for allele-specific expression and genomic imprinting effects , 2014, F1000prime reports.

[21]  Donald Sharon,et al.  Defining a personal, allele-specific, and single-molecule long-read transcriptome , 2014, Proceedings of the National Academy of Sciences.

[22]  Liping Zhang,et al.  Global RNA sequencing reveals that genotype-dependent allele-specific expression contributes to differential expression in rice F1 hybrids , 2013, BMC Plant Biology.

[23]  Nathan M. Springer,et al.  Comprehensive analysis of imprinted genes in maize reveals allelic variation for imprinting and limited conservation with other species , 2013, Proceedings of the National Academy of Sciences.

[24]  J. Harrow,et al.  Assessment of transcript reconstruction methods for RNA-seq , 2013, Nature Methods.

[25]  L. Rieseberg,et al.  RNA-Seq Analysis of Allele-Specific Expression, Hybrid Effects, and Regulatory Divergence in Hybrids Compared with Their Parents from Natural Populations , 2013, Genome biology and evolution.

[26]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[27]  Steven P. Lund,et al.  Complementation contributes to transcriptome complexity in maize (Zea mays L.) hybrids relative to their inbred parents , 2012, Genome research.

[28]  Pablo Cingolani,et al.  © 2012 Landes Bioscience. Do not distribute. , 2022 .

[29]  David R. Kelley,et al.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2012, Nature Protocols.

[30]  Patrick S. Schnable,et al.  Parent-of-Origin Effects on Gene Expression and DNA Methylation in the Maize Endosperm[W] , 2011, Plant Cell.

[31]  Jinsheng Lai,et al.  Extensive, clustered parental imprinting of protein-coding and noncoding RNAs in developing maize endosperm , 2011, Proceedings of the National Academy of Sciences.

[32]  Z. Chen,et al.  Molecular mechanisms of polyploidy and hybrid vigor. , 2010, Trends in plant science.

[33]  Nathan M. Springer,et al.  Heterosis Is Prevalent for Multiple Traits in Diverse Maize Germplasm , 2009, PloS one.

[34]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[35]  F. Hochholdinger,et al.  Towards the molecular basis of heterosis. , 2007, Trends in plant science.

[36]  Nathan M. Springer,et al.  Nonadditive Expression and Parent-of-Origin Effects Identified by Microarray and Allele-Specific Expression Profiling of Maize Endosperm1[W][OA] , 2007, Plant Physiology.

[37]  R. Stupar,et al.  Allele-Specific Expression Patterns Reveal Biases and Embryo-Specific Parent-of-Origin Effects in Hybrid Maize[W] , 2007, The Plant Cell Online.

[38]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .