New scoring system to identify RNA G-quadruplex folding

G-quadruplexes (G4s) are non-canonical structures involved in many important cellular processes. To date, the prediction of potential G-quadruplex structures (PG4s) has been based almost exclusively on the sequence of interest agreeing with the algorithm Gx-N-1–7-Gx-N1–7-Gx-N1–7-Gx (where x ≥ 3 and N = A, U, G or C). However, many sequences agreeing with this algorithm do not form G4s and are considered false-positive predictions. Here we show the RNA PG4 candidate in the 3′-untranslated region (UTR) of the TTYH1 gene to be one such false positive. Specifically, G4 folding was observed to be inhibited by the presence of multiple-cytosine tracks, located in the candidate’s genomic context, that adopted a Watson–Crick base-paired structure. Clearly, the neighbouring sequences of a PG4 may influence its folding. The secondary structure of 12 PG4 motifs along with either 15 or 50 nucleotides of their upstream and downstream genomic contexts were evaluated by in-line probing. Data permitted the development of a scoring system for the prediction of PG4s taking into account the effect of the neighbouring sequences. The accuracy of this scoring system was assessed by probing 14 other novel PG4 candidates retrieved in human 5′-UTRs. This new scoring system can be used, in combination with the standard algorithm, to better predict the folding of RNA G4s.

[1]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[2]  S. Balasubramanian,et al.  An RNA Hairpin to G-Quadruplex Conformational Transition , 2012, Journal of the American Chemical Society.

[3]  R. Altman,et al.  SAFA: semi-automated footprinting analysis software for high-throughput quantification of nucleic acid footprinting experiments. , 2005, RNA.

[4]  J. Darnell,et al.  Structure-function studies of FMRP RGG peptide recognition of an RNA duplex-quadruplex junction , 2011, Nature Structural &Molecular Biology.

[5]  Shankar Balasubramanian,et al.  Prevalence of quadruplexes in the human genome , 2005, Nucleic acids research.

[6]  Souvik Maiti,et al.  Effect of loops and G-quartets on the stability of RNA G-quadruplexes. , 2013, The journal of physical chemistry. B.

[7]  A. Phan,et al.  Bulges in G-quadruplexes: broadening the definition of G-quadruplex-forming sequences. , 2013, Journal of the American Chemical Society.

[8]  S. Neidle,et al.  Highly prevalent putative quadruplex sequence motifs in human DNA , 2005, Nucleic acids research.

[9]  Atsuko Mizuno,et al.  A Novel Human Cl- Channel Family Related to Drosophila flightless Locus* , 2004, Journal of Biological Chemistry.

[10]  N. Maizels,et al.  Gene function correlates with potential for G4 DNA formation in the human genome , 2006, Nucleic acids research.

[11]  Graziano Pesole,et al.  UTRdb and UTRsite: a collection of sequences and regulatory motifs of the untranslated regions of eukaryotic mRNAs , 2004, Nucleic Acids Res..

[12]  S. Balasubramanian,et al.  Quantitative visualization of DNA G-quadruplex structures in human cells. , 2013, Nature chemistry.

[13]  J. Beaudoin,et al.  Exploring mRNA 3′-UTR G-quadruplexes: evidence of roles in both alternative polyadenylation and mRNA shortening , 2013, Nucleic acids research.

[14]  S. Vagner,et al.  Essential role for the interaction between hnRNP H/F and a G quadruplex in maintaining p53 pre-mRNA 3'-end processing and function during DNA damage. , 2011, Genes & development.

[15]  Jean-Louis Mergny,et al.  How long is too long? Effects of loop size on G-quadruplex stability , 2010, Nucleic acids research.

[16]  J. Hartig,et al.  Reporter assays for studying quadruplex nucleic acids. , 2012, Methods.

[17]  S. Maiti,et al.  Effect of flanking bases on quadruplex stability and Watson–Crick duplex competition , 2009, The FEBS journal.

[18]  G. Stormo,et al.  Combining SELEX with quantitative assays to rapidly obtain accurate models of protein–DNA interactions , 2005, Nucleic acids research.

[19]  A. Serero,et al.  Formation of pearl-necklace monomorphic G-quadruplexes in the human CEB25 minisatellite. , 2012, Journal of the American Chemical Society.

[20]  F. Major,et al.  RNA G-Quadruplexes in the model plant species Arabidopsis thaliana: prevalence and possible functional roles , 2010, Nucleic acids research.

[21]  Mitali Mukerji,et al.  Genome-wide prediction of G4 DNA as regulatory motifs: role in Escherichia coli global regulation. , 2006, Genome research.

[22]  P. Bolton,et al.  Circular dichroism of quadruplex DNAs: applications to structure, cation effects and ligand binding. , 2007, Methods.

[23]  S. Balasubramanian,et al.  A sequence-independent analysis of the loop length dependence of intramolecular RNA G-quadruplex stability and topology. , 2011, Biochemistry.

[24]  J. Huppert,et al.  Hunting G-quadruplexes. , 2008, Biochimie.

[25]  H. Moine,et al.  G‐quadruplexes in RNA biology , 2012, Wiley interdisciplinary reviews. RNA.

[26]  Julian Leon Huppert,et al.  G-quadruplexes: the beginning and end of UTRs , 2008, Nucleic acids research.

[27]  Oleg Kikin,et al.  QGRS Mapper: a web-based server for predicting G-quadruplexes in nucleotide sequences , 2006, Nucleic Acids Res..

[28]  J. Mergny,et al.  UV Melting of G‐Quadruplexes , 2009, Current protocols in nucleic acid chemistry.

[29]  J. Beaudoin,et al.  In-line probing of RNA G-quadruplexes. , 2013, Methods.

[30]  J. Beaudoin,et al.  5′-UTR G-quadruplex structures acting as translational repressors , 2010, Nucleic acids research.

[31]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.

[32]  Shankar Balasubramanian,et al.  A sequence-independent study of the influence of short loop lengths on the stability and topology of intramolecular DNA G-quadruplexes. , 2008, Biochemistry.

[33]  S. Maiti,et al.  A thermodynamic overview of naturally occurring intramolecular DNA quadruplexes , 2008, Nucleic acids research.

[34]  D. Ecker,et al.  RNAMotif, an RNA secondary structure definition and search algorithm. , 2001, Nucleic acids research.

[35]  Peter F. Stadler,et al.  RNA Folding Algorithms with G-Quadruplexes , 2012, BSB.