Combinatorial Genetics Reveals a Scaling Law for the Effects of Mutations on Splicing

Despite a wealth of molecular knowledge, quantitative laws for accurate prediction of biological phenomena remain rare. Alternative pre-mRNA splicing is an important regulated step in gene expression frequently perturbed in human disease. To understand the combined effects of mutations during evolution, we quantified the effects of all possible combinations of exonic mutations accumulated during the emergence of an alternatively spliced human exon. This revealed that mutation effects scale non-monotonically with the inclusion level of an exon, with each mutation having maximum effect at a predictable intermediate inclusion level. This scaling is observed genome-wide for cis and trans perturbations of splicing, including for natural and disease-associated variants. Mathematical modeling suggests that competition between alternative splice sites is sufficient to cause this non-linearity in the genotype-phenotype map. Combining the global scaling law with specific pairwise interactions between neighboring mutations allows accurate prediction of the effects of complex genotype changes involving >10 mutations.

[1]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[2]  Guido Sanguinetti,et al.  Network of epistatic interactions within a yeast snoRNA , 2016, Science.

[3]  Nicola J. Rinaldi,et al.  Genetic effects on gene expression across human tissues , 2017, Nature.

[4]  J. Kinney,et al.  Using deep sequencing to characterize the biophysical mechanism of a transcriptional regulatory sequence , 2010, Proceedings of the National Academy of Sciences.

[5]  M Cazzola,et al.  Disruption of SF3B1 results in deregulated expression and splicing of key genes and pathways in myelodysplastic syndrome hematopoietic stem and progenitor cells , 2014, Leukemia.

[6]  Nicholas C. Wu,et al.  A Comprehensive Biophysical Description of Pairwise Epistasis throughout an Entire Protein Domain , 2014, Current Biology.

[7]  Z. Yakhini,et al.  Systematic Dissection of the Sequence Determinants of Gene 3’ End Mediated Expression Control , 2015, PLoS genetics.

[8]  Matthias Heinig,et al.  Alternative Splicing Signatures in RNA‐seq Data: Percent Spliced in (PSI) , 2015, Current protocols in human genetics.

[9]  Jay Shendure,et al.  High-resolution analysis of DNA regulatory elements by synthetic saturation mutagenesis , 2009, Nature Biotechnology.

[10]  P. Phillips Epistasis — the essential role of gene interactions in the structure and evolution of genetic systems , 2008, Nature Reviews Genetics.

[11]  Robert B. Heckendorn,et al.  Should evolutionary geneticists worry about higher-order epistasis? , 2013, Current opinion in genetics & development.

[12]  Ruiqiang Li,et al.  Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells , 2013, Nature Structural &Molecular Biology.

[13]  G. Fiucci,et al.  Three functional soluble forms of the human apoptosis-inducing Fas molecule are produced by alternative splicing. , 1995, Journal of immunology.

[14]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[15]  Z. Darieva,et al.  A Competitive Transcription Factor Binding Mechanism Determines the Timing of Late Cell Cycle-Dependent Gene Expression , 2010, Molecular cell.

[16]  Jingyue Ju,et al.  Quantitative evaluation of all hexamers as exonic splicing elements. , 2011, Genome research.

[17]  Mark Johnson,et al.  NCBI BLAST: a better web interface , 2008, Nucleic Acids Res..

[18]  Dan S. Tawfik,et al.  Stability effects of mutations and protein evolvability. , 2009, Current opinion in structural biology.

[19]  Jingyue Ju,et al.  Saturation mutagenesis reveals manifold determinants of exon definition , 2018, Genome research.

[20]  Dmitry Chudakov,et al.  Local fitness landscape of the green fluorescent protein , 2016, Nature.

[21]  J. Shendure,et al.  The origins, determinants, and consequences of human mutations , 2015, Science.

[22]  G. von Heijne,et al.  Tissue-based map of the human proteome , 2015, Science.

[23]  B. Frey,et al.  The human splicing code reveals new insights into the genetic determinants of disease , 2015, Science.

[24]  Jianzhi Zhang,et al.  The fitness landscape of a tRNA gene , 2016, Science.

[25]  F. Allain,et al.  RRM-RNA recognition: NMR or crystallography…and new findings. , 2013, Current opinion in structural biology.

[26]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[27]  Joseph B Hiatt,et al.  Massively parallel functional dissection of mammalian enhancers in vivo , 2012, Nature Biotechnology.

[28]  Ben Lehner Genotype to phenotype: lessons from model organisms for human genetics , 2013, Nature Reviews Genetics.

[29]  Frank J. Poelwijk,et al.  The Context-Dependence of Mutations: A Linkage of Formalisms , 2015, PLoS Comput. Biol..

[30]  T. Mikkelsen,et al.  Rapid dissection and model-based optimization of inducible enhancers in human cells using a massively parallel reporter assay , 2012, Nature Biotechnology.

[31]  Brendan J. Frey,et al.  A compendium of RNA-binding motifs for decoding gene regulation , 2013, Nature.

[32]  Eran Segal,et al.  Deciphering the rules by which 5′-UTR sequences affect protein expression in yeast , 2013, Proceedings of the National Academy of Sciences.

[33]  D. Duelli,et al.  Targeting RNA splicing for disease therapy , 2013, Wiley interdisciplinary reviews. RNA.

[34]  B. Blencowe,et al.  An atlas of alternative splicing profiles and functional associations reveals new regulatory programs and genes that simultaneously express multiple major isoforms , 2017, Genome Research.

[35]  T. Hastie,et al.  Principal Curves , 2007 .

[36]  Alan Medlar,et al.  Wasabi: An Integrated Platform for Evolutionary Sequence Analysis and Data Visualization. , 2016, Molecular biology and evolution.

[37]  C. Burge,et al.  Evolutionary Dynamics of Gene and Isoform Regulation in Mammalian Tissues , 2012, Science.

[38]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[39]  K. Talbot,et al.  The clinical landscape for SMA in a new therapeutic era , 2017, Gene Therapy.

[40]  C. Smith,et al.  The organization of RNA contacts by PTB for regulation of FAS splicing , 2014, Nucleic acids research.

[41]  J. Valcárcel,et al.  The complete local genotype–phenotype landscape for the alternative splicing of a human exon , 2016, Nature Communications.

[42]  J. König,et al.  Decoding a cancer-relevant splicing decision in the RON proto-oncogene using high-throughput mutagenesis , 2018, Nature Communications.

[43]  O. Gascuel,et al.  SeaView version 4: A multiplatform graphical user interface for sequence alignment and phylogenetic tree building. , 2010, Molecular biology and evolution.

[44]  Sudhir Kumar,et al.  TimeTree: A Resource for Timelines, Timetrees, and Divergence Times. , 2017, Molecular biology and evolution.

[45]  D. Baker,et al.  High Resolution Mapping of Protein Sequence–Function Relationships , 2010, Nature Methods.

[46]  Georg Seelig,et al.  Learning the Sequence Determinants of Alternative Splicing from Millions of Random Sequences , 2015, Cell.

[47]  J. Valcárcel,et al.  The pathogenicity of splicing defects: mechanistic insights into pre‐mRNA processing inform novel therapeutic approaches , 2015, EMBO reports.

[48]  J. Valcárcel,et al.  The apoptosis-promoting factor TIA-1 is a regulator of alternative pre-mRNA splicing. , 2000, Molecular cell.

[49]  Richard A. Watson,et al.  PERSPECTIVE:SIGN EPISTASIS AND GENETIC CONSTRAINT ON EVOLUTIONARY TRAJECTORIES , 2005 .

[50]  E. Gerhart H. Wagner,et al.  Massive functional mapping of a 5′-UTR by saturation mutagenesis, phenotypic sorting and deep sequencing , 2013, Nucleic acids research.

[51]  Brendan J. Frey,et al.  Deciphering the splicing code , 2010, Nature.

[52]  G. K. Ackers,et al.  Asymmetric Cooperativity in a Symmetric Tetramer: Human Hemoglobin* , 2006, Journal of Biological Chemistry.

[53]  Erik Verschueren,et al.  Integration of Protein Abundance and Structure Data Reveals Competition in the ErbB Signaling Network , 2013, Science Signaling.

[54]  Michael J. Harms,et al.  High-order epistasis shapes evolutionary trajectories , 2017, PLoS Comput. Biol..

[55]  Kamil J. Cygan,et al.  Pathogenic variants that alter protein code often disrupt splicing , 2017, Nature Genetics.

[56]  Ben Lehner,et al.  Pairwise and higher order genetic interactions during the evolution of a tRNA , 2018, Nature.

[57]  N. Barbosa-Morais,et al.  psichomics: graphical application for alternative splicing quantification and analysis , 2018, bioRxiv.

[58]  Ben Lehner,et al.  Molecular mechanisms of epistasis within and between genes. , 2011, Trends in genetics : TIG.

[59]  Jean Thierry-Mieg,et al.  Tissue-specific transcriptome sequencing analysis expands the non-human primate reference transcriptome resource (NHPRTR) , 2014, Nucleic Acids Res..

[60]  Panagiotis K. Papasaikas,et al.  Genome-wide identification of Fas/CD95 alternative splicing regulators reveals links with iron homeostasis. , 2015, Molecular cell.

[61]  G. K. Ackers,et al.  Quantitative model for gene regulation by lambda phage repressor. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[62]  Ben Lehner,et al.  The genetic landscape of a physical interaction , 2018, eLife.

[63]  L. Chasin,et al.  Searching for splicing motifs. , 2007, Advances in experimental medicine and biology.

[64]  B. Frey,et al.  Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing , 2008, Nature Genetics.

[65]  Robert Castelo,et al.  Regulation of Fas alternative splicing by antagonistic effects of TIA-1 and PTB on exon definition. , 2005, Molecular cell.

[66]  Jiajie Zhang,et al.  PEAR: a fast and accurate Illumina Paired-End reAd mergeR , 2013, Bioinform..