Allele-specific binding of RNA-binding proteins reveals functional genetic variants in the RNA

Allele-specific protein-RNA binding is an essential aspect that may reveal functional genetic variants (GVs) mediating post-transcriptional regulation. Recently, genome-wide detection of in vivo binding of RNA-binding proteins is greatly facilitated by the enhanced crosslinking and immunoprecipitation (eCLIP) method. We developed a new computational approach, called BEAPR, to identify allele-specific binding (ASB) events in eCLIP-Seq data. BEAPR takes into account crosslinking-induced sequence propensity and variations between replicated experiments. Using simulated and actual data, we show that BEAPR largely outperforms often-used count analysis methods. Importantly, BEAPR overcomes the inherent overdispersion problem of these methods. Complemented by experimental validations, we demonstrate that the application of BEAPR to ENCODE eCLIP-Seq data of 154 proteins helps to predict functional GVs that alter splicing or mRNA abundance. Moreover, many GVs with ASB patterns have known disease relevance. Overall, BEAPR is an effective method that helps to address the outstanding challenge of functional interpretation of GVs.Differential binding of RNA-binding proteins mediated by genetic variants (GVs) can influence posttranscriptional regulation. Here, the authors develop BEAPR, a computational approach to identify allele-specific binding events in eCLIP-Seq data.

[1]  M. Rosbash,et al.  A cooperative interaction between U2AF65 and mBBP/SF1 facilitates branchpoint region recognition. , 1998, Genes & development.

[2]  J. Valcárcel,et al.  Inhibition of msl-2 splicing by Sex-lethal reveals interaction between U2AF35 and the 3′ splice site AG , 1999, Nature.

[3]  Thomas Blumenthal,et al.  Both subunits of U2AF recognize the 3′ splice site in Caenorhabditis elegans , 1999, Nature.

[4]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[5]  M. Mann,et al.  Gemin5, a Novel WD Repeat Protein Component of the SMN Complex That Binds Sm Proteins* , 2002, The Journal of Biological Chemistry.

[6]  Henning Urlaub,et al.  Characterization of novel SF3b and 17S U2 snRNP proteins, including a human Prp5p homologue and an SF3b DEAD‐box protein , 2002, The EMBO journal.

[7]  †The International HapMap Consortium The International HapMap Project , 2003, Nature.

[8]  Jernej Ule,et al.  CLIP Identifies Nova-Regulated RNA Networks in the Brain , 2003, Science.

[9]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[10]  J. Beggs,et al.  Prp8 protein: at the heart of the spliceosome. , 2005, RNA.

[11]  T. Hamilton,et al.  Introns Regulate the Rate of Unstable mRNA Decay* , 2007, Journal of Biological Chemistry.

[12]  Zhaojing Meng,et al.  Alterations in Gemin5 expression contribute to alternative mRNA splicing patterns and tumor cell motility. , 2008, Cancer research.

[13]  R. Luehrmann At the heart of the spliceosome. , 2008 .

[14]  T. Glisovic,et al.  RNA‐binding proteins and post‐transcriptional gene regulation , 2008, FEBS letters.

[15]  Eric T. Wang,et al.  Splice Site Strength-Dependent Activity and Genetic Buffering by Poly-G Runs , 2009, Nature Structural &Molecular Biology.

[16]  S. Luo,et al.  High-Resolution Analysis of Parent-of-Origin Allelic Expression in the Mouse Brain , 2010, Science.

[17]  C Joel McManus,et al.  Regulatory divergence in Drosophila revealed by mRNA-seq. , 2010, Genome research.

[18]  Heng Li,et al.  Improving SNP discovery by base alignment quality , 2011, Bioinform..

[19]  J. Lupski,et al.  Human genome sequencing in health and disease. , 2012, Annual review of medicine.

[20]  Sébastien Tempel Using and understanding RepeatMasker. , 2012, Methods in molecular biology.

[21]  Stanley F. Nelson,et al.  Identification of allele-specific alternative mRNA processing via transcriptome sequencing , 2012, Nucleic acids research.

[22]  Julian König,et al.  Analysis of CLIP and iCLIP methods for nucleotide-resolution studies of protein-RNA interactions , 2012, Genome Biology.

[23]  Nick C Fox,et al.  Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer's disease , 2013, Nature Genetics.

[24]  Gene W. Yeo,et al.  Rbfox proteins regulate alternative mRNA splicing through evolutionarily conserved RNA bridges , 2013, Nature Structural &Molecular Biology.

[25]  A. D. den Hollander,et al.  Genome-wide association study identifies genetic risk underlying primary rhegmatogenous retinal detachment. , 2013, Human molecular genetics.

[26]  Charity W. Law,et al.  voom: precision weights unlock linear model analysis tools for RNA-seq read counts , 2014, Genome Biology.

[27]  Xinshu Xiao,et al.  Analysis and design of RNA sequencing experiments for identifying RNA editing and other single-nucleotide variants. , 2013, RNA.

[28]  S. Gerstberger,et al.  A census of human RNA-binding proteins , 2014, Nature Reviews Genetics.

[29]  P. Sharp,et al.  RNA Bind-n-Seq: quantitative assessment of the sequence and structural binding specificity of RNA binding proteins. , 2014, Molecular cell.

[30]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[31]  Yu Zhou,et al.  De Novo Prediction of PTBP1 Binding and Splicing Targets Reveals Unexpected Features of Its RNA Recognition and Function , 2014, PLoS Comput. Biol..

[32]  J. Mata,et al.  Systematic Analysis of the Role of RNA-Binding Proteins in the Regulation of RNA Stability , 2014, PLoS genetics.

[33]  Beiying Liu,et al.  LASS2/TMSG1 inhibits growth and invasion of breast cancer cell in vitro through regulation of vacuolar ATPase activity , 2015, Tumor Biology.

[34]  F. Guo,et al.  The DGCR8 RNA-binding heme domain recognizes primary microRNAs by clamping the hairpin. , 2014, Cell reports.

[35]  Ian J. Deary,et al.  Common genetic variants associated with cognitive performance identified using the proxy-phenotype method , 2014, Proceedings of the National Academy of Sciences.

[36]  D. Rio,et al.  Mechanisms and Regulation of Alternative Pre-mRNA Splicing. , 2015, Annual review of biochemistry.

[37]  S. Choi,et al.  Introns: The Functional Benefits of Introns in Genomes , 2015, Genomics & informatics.

[38]  G. Kempermann Faculty Opinions recommendation of Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. , 2015 .

[39]  Takaya Saito,et al.  The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets , 2015, PloS one.

[40]  Wei Cheng,et al.  CERS2 Suppresses Tumor Cell Invasion and is Associated with Decreased V‐ATPase and MMP‐2/MMP‐9 Activities in Breast Cancer , 2015, Journal of cellular biochemistry.

[41]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[42]  J. Lupski,et al.  Non-coding genetic variants in human disease. , 2015, Human molecular genetics.

[43]  Xinshu Xiao,et al.  Alternative splicing modulated by genetic variants demonstrates accelerated evolution regulated by highly conserved proteins , 2016, Genome research.

[44]  D. Krakow,et al.  Altered mRNA Splicing, Chondrocyte Gene Expression and Abnormal Skeletal Development due to SF3B4 Mutations in Rodriguez Acrofacial Dysostosis , 2016, PLoS genetics.

[45]  Gene W. Yeo,et al.  Robust transcriptome-wide discovery of RNA binding protein binding sites with enhanced CLIP (eCLIP) , 2016, Nature Methods.

[46]  O. Mühlemann,et al.  Nonsense‐mediated mRNA decay: novel mechanistic insights and biological impact , 2016, Wiley interdisciplinary reviews. RNA.

[47]  Henning Urlaub,et al.  Molecular Architecture of SF3b and Structural Consequences of Its Cancer-Related Mutations. , 2016, Molecular cell.

[48]  Nicola J. Rinaldi,et al.  Genetic effects on gene expression across human tissues , 2017, Nature.

[49]  Christian Gieger,et al.  Connecting genetic risk to disease end points through the human blood plasma proteome , 2016, Nature Communications.

[50]  Faculty Opinions recommendation of Nonsense-mediated mRNA decay: novel mechanistic insights and biological impact. , 2017 .

[51]  T. Cooper,et al.  The roles of RNA processing in translating genotype to phenotype , 2016, Nature Reviews Molecular Cell Biology.

[52]  Gene W. Yeo,et al.  A Large-Scale Binding and Functional Map of Human RNA Binding Proteins , 2017, bioRxiv.

[53]  Esko Ukkonen,et al.  Fast motif matching revisited: high‐order PWMs, SNPs and indels , 2016, Bioinform..

[54]  Hanlee P. Ji,et al.  Haplotype-resolved and integrated genome analysis of the cancer cell line HepG2 , 2018, bioRxiv.

[55]  Eric L Van Nostrand,et al.  Sequence, Structure and Context Preferences of Human RNA Binding Proteins , 2017, bioRxiv.

[56]  Eric L Van Nostrand,et al.  Widespread RNA editing dysregulation in brains from autistic individuals , 2018, Nature Neuroscience.

[57]  Zhao Zhang,et al.  PancanQTL: systematic identification of cis-eQTLs and trans-eQTLs in 33 cancer types , 2017, Nucleic Acids Res..

[58]  Nan Yang,et al.  CancerSplicingQTL: a database for genome-wide identification of splicing QTLs in human cancer , 2018, Nucleic Acids Res..

[59]  Hanlee P. Ji,et al.  Haplotype-resolved and integrated genome analysis of the cancer cell line HepG2 , 2019, Nucleic acids research.

[60]  Noah Spies,et al.  Comprehensive, integrated, and phased whole-genome analysis of the primary ENCODE cell line K562. , 2019, Genome research.