G.U base pairing motifs in ribosomal RNA.

An increasing number of recognition mechanisms in RNA are found to involve G.U base pairs. In order to detect new functional sites of this type, we exhaustively analyzed the sequence alignments and secondary structures of eubacterial and chloroplast 16S and 23S rRNA, seeking positions with high levels of G.U pairs. Approximately 120 such sites were identified and classified according to their secondary structure and sequence environment. Overall biases in the distribution of G.U pairs are consistent with previously proposed structural rules: the side of the wobble pair that is subject to a loss of stacking is preferentially exposed to a secondary structure loop, where stacking is not as essential as in helical regions. However, multiple sites violate these rules and display highly conserved G.U pairs in orientations that could cause severe stacking problems. In addition, three motifs displaying a conserved G.U pair in a specific sequence/structure environment occur at an unusually high frequency. These motifs, of which two had not been reported before, involve sequences 5'UG3' 3'GA5' and 5'UG3' 3'GU5', as well as G.U pairs flanked by a bulge loop 3' of U. The possible structures and functions of these recurrent motifs are discussed.