Differential BUM-HMM: a robust statistical modelling approach for detecting RNA flexibility changes in high-throughput structure probing data

Combining RNA structure probing with high-throughput sequencing technologies has greatly enhanced our ability to analyze RNA structure at transcriptome scale. However, the high level of noise and variability encountered in these data call for the development of computational tools that robustly extract RNA structural information. Here we present diffBUM-HMM, a noise-aware model that enables accurate detection of RNA flexibility and conformational changes from high-throughput RNA structure-probing data. DiffBUM-HMM is compatible with a wide variety of high-throughput RNA structure probing data, taking into consideration biological variation, sequence coverage and sequence representation biases. We demonstrate that, compared to the existing approaches, diffBUM-HMM displays higher sensitivity while calling virtually no false positives. DiffBUM-HMM analysis of ex vivo and in vivo Xist SHAPE-MaP data detected many more RNA structural differences, involving mostly single-stranded nucleotides located at or near protein-binding sites. Collectively, our analyses demonstrate the value of diffBUM-HMM for quantitatively detecting RNA structural changes and reinforce the notion that RNA structure probing is a very powerful tool for identifying protein-binding sites.

[1]  Howard Y. Chang,et al.  Genome-wide measurement of RNA secondary structure in yeast , 2010, Nature.

[2]  Stuart Aitken,et al.  Snapshots of pre-rRNA structural flexibility reveal eukaryotic 40S assembly dynamics at nucleotide resolution , 2014, Nucleic acids research.

[3]  S. Belikov,et al.  Maturation of the 90S pre-ribosome requires Mrd1 dependent U3 snoRNA and 35S pre-rRNA structural rearrangements , 2018, Nucleic acids research.

[4]  Howard Y. Chang,et al.  Structural imprints in vivo decode RNA regulatory mechanisms , 2015, Nature.

[5]  E. Heard,et al.  Noncoding RNAs and epigenetic mechanisms during X-chromosome inactivation. , 2014, Annual review of cell and developmental biology.

[6]  Cole Trapnell,et al.  Multiplexed RNA structure characterization with selective 2′-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq) , 2011, Proceedings of the National Academy of Sciences.

[7]  K. Weeks,et al.  Principles for targeting RNA with drug-like small molecules , 2018, Nature Reviews Drug Discovery.

[8]  K. Weeks,et al.  Time-resolved RNA SHAPE chemistry. , 2008, Journal of the American Chemical Society.

[9]  D. Haussler,et al.  FragSeq: transcriptome-wide RNA structure probing using high-throughput sequencing , 2010, Nature Methods.

[10]  Niranjan Nagarajan,et al.  Genome-wide identification of natural RNA aptamers in prokaryotes and eukaryotes , 2018, Nature Communications.

[11]  Kaoru Inoue,et al.  SHAPE reveals transcript-wide interactions, complex structural domains, and protein interactions across the Xist lncRNA in living cells , 2016, Proceedings of the National Academy of Sciences.

[12]  Grzegorz Kudla,et al.  PAR-CLIP data indicate that Nrd1-Nab3-dependent transcription termination regulates expression of hundreds of protein coding genes in yeast , 2014, Genome Biology.

[13]  J. Weissman,et al.  DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo , 2016, Nature Methods.

[14]  Christina D. Smolke,et al.  Biomedical Applications of RNA-Based Devices. , 2017, Current opinion in biomedical engineering.

[15]  Y. Zhang,et al.  In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features , 2013, Nature.

[16]  Steven Busan,et al.  RNA motif discovery by SHAPE and mutational profiling (SHAPE-MaP) , 2014, Nature Methods.

[17]  Howard Y. Chang,et al.  RNA SHAPE analysis in living cells. , 2013, Nature chemical biology.

[18]  C. Thermes,et al.  Library preparation methods for next-generation sequencing: tone down the bias. , 2014, Experimental cell research.

[19]  Guido Sanguinetti,et al.  Robust statistical modeling improves sensitivity of high-throughput RNA structure probing experiments , 2016, Nature Methods.

[20]  Qiangfeng Cliff Zhang,et al.  Landscape and variation of RNA secondary structure across the human transcriptome , 2014, Nature.

[21]  S. Aviran,et al.  dStruct: identifying differentially reactive regions from RNA structurome profiling data , 2019, Genome Biology.

[22]  Kyle E. Watters,et al.  SHAPE-Seq 2.0: systematic optimization and extension of high-throughput chemical probing of RNA secondary structure with next generation sequencing , 2014, Nucleic acids research.

[23]  K. Weeks,et al.  Physical and Functional Analysis of Viral RNA Genomes by SHAPE. , 2019, Annual review of virology.

[24]  K. Weeks,et al.  Selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE): quantitative RNA structure analysis at single nucleotide resolution , 2006, Nature Protocols.

[25]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[26]  Aharon Nachshon,et al.  Virus-Induced Changes in mRNA Secondary Structure Uncover cis-Regulatory Elements that Directly Control Gene Expression. , 2018, Molecular cell.

[27]  Manolis Kellis,et al.  Genome-wide probing of RNA structure reveals active unfolding of mRNA structures in vivo , 2013, Nature.

[28]  Bo Li,et al.  Metrics for rapid quality control in RNA structure probing experiments , 2016, Bioinform..

[29]  K. Weeks,et al.  A fast-acting reagent for accurate analysis of RNA secondary and tertiary structure by SHAPE chemistry. , 2007, Journal of the American Chemical Society.

[30]  D. Murdoch,et al.  P-Values are Random Variables , 2008 .

[31]  Roland Marquet,et al.  Structural and Functional Motifs in Influenza Virus RNAs , 2018, Front. Microbiol..

[32]  S. Cockroft,et al.  High-throughput RNA structure probing reveals critical folding events during early 60S ribosome assembly in yeast , 2017, Nature Communications.

[33]  J. Woolford,et al.  Mod-seq: high-throughput sequencing for chemical probing of RNA structure , 2014, RNA.

[34]  Qiangfeng Cliff Zhang,et al.  Systematic Discovery of Xist RNA Binding Proteins , 2015, Cell.

[35]  K. Weeks,et al.  Guidelines for SHAPE Reagent Choice and Detection Strategy for RNA Structure Probing Studies. , 2019, Biochemistry.

[36]  Matthew D. Edwards,et al.  Determination of RNA structural diversity and its role in HIV-1 RNA splicing , 2020, Nature.

[37]  Alain Laederach,et al.  Classification of RNA structure change by ‘gazing’ at experimental data , 2017, Bioinform..

[38]  K. Weeks,et al.  Detection of RNA-Protein Interactions in Living Cells with SHAPE. , 2015, Biochemistry.

[39]  S. Volinia,et al.  SNPs and Somatic Mutation on Long Non-Coding RNA: New Frontier in the Cancer Studies? , 2018, High-throughput.