RNA folding and combinatory landscapes.

In this paper we view the folding of polynucleotide (RNA) sequences as a map that assigns to each sequence a minimum-free-energy pattern of base pairings, known as secondary structure. Considering only the free energy leads to an energy landscape over the sequence space. Taking into account structure generates a less visualizable nonscalar landscape,'' where a sequence space is mapped into a space of discrete shapes.'' We investigate the statistical features of both types of landscapes by computing autocorrelation functions, as well as distributions of energy and structure distances, as a function of distance in sequence space. RNA folding is characterized by very short structure correlation lengths compared to the diameter of the sequence space. The correlation lengths depend strongly on the size and the pairing rules of the underlying nucleotide alphabet. Our data suggest that almost every minimum-free-energy structure is found within a small neighborhood of any random sequence. The interest in such landscapes results from the fact that they govern natural and artificial processes of optimization by mutation and selection. Simple statistical model landscapes, like Kauffman and Levin's [ital n]-[ital k] model [J. Theor. Biol. 128, 11 (1987)], are often used as a proxy for understanding realistic landscapes, likemore » those induced by RNA folding. We make a detailed comparison between the energy landscapes derived from RNA folding and those obtained from the [ital n]-[ital k] model. We derive autocorrelation functions for several variants of the [ital n]-[ital k] model, and briefly summarize work on its fine structure. The comparison leads to an estimate for [ital k]=7--8, independent of [ital n], where [ital n] is the chain length. While the scaling behaviors agree, the fine structure is considerably different in the two cases.« less

[1]  S. Kirkpatrick,et al.  Solvable Model of a Spin-Glass , 1975 .

[2]  Jerrold R. Griggs,et al.  Algorithms for Loop Matchings , 1978 .

[3]  G. Oster,et al.  Theoretical studies of clonal selection: minimal antibody repertoire size and reliability of self-non-self discrimination. , 1979, Journal of theoretical biology.

[4]  B. Derrida Random-energy model: An exactly solvable model of disordered systems , 1981 .

[5]  Michael Zuker,et al.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information , 1981, Nucleic Acids Res..

[6]  David Sankoff,et al.  RNA secondary structures and their prediction , 1984 .

[7]  Paulien Hogeweg,et al.  Energy directed folding of RNA sequences , 1984, Nucleic Acids Res..

[8]  P. Anderson,et al.  Application of statistical mechanics to NP-complete problems in combinatorial optimisation , 1986 .

[9]  D. Turner,et al.  Improved free-energy parameters for predictions of RNA duplex stability. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[10]  N. Biggs THE TRAVELING SALESMAN PROBLEM A Guided Tour of Combinatorial Optimization , 1986 .

[11]  Giorgio Parisi,et al.  Mean-Field Equations for the Matching and the Travelling Salesman Problems , 1986 .

[12]  J. Bernasconi Low autocorrelation binary sequences : statistical mechanics and configuration space analysis , 1987 .

[13]  S. Kauffman,et al.  Towards a general theory of adaptive walks on rugged landscapes. , 1987, Journal of theoretical biology.

[14]  Bruce A. Shapiro,et al.  An algorithm for comparing multiple RNA secondary structures , 1988, Comput. Appl. Biosci..

[15]  E. D. Weinberger,et al.  A more rigorous derivation of some properties of uncorrelated fitness landscapes , 1988 .

[16]  D. Turner,et al.  Improved predictions of secondary structures for RNA. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[17]  C. A. Macken,et al.  Protein evolution on rugged landscapes. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Schuster,et al.  Physical aspects of evolutionary optimization and adaptation. , 1989, Physical review. A, General physics.

[19]  E. D. Weinberger,et al.  The NK model of rugged fitness landscapes and its application to maturation of the immune response. , 1989, Journal of theoretical biology.

[20]  P. Schuster,et al.  Statistics of landscapes based on free energies, replication and degradation rate constants of RNA secondary structures , 1991 .

[21]  A. Perelson,et al.  Evolutionary walks on rugged landscapes , 1991 .

[22]  E D Weinberger,et al.  Why some fitness landscapes are fractal. , 1993, Journal of theoretical biology.

[23]  P. Schuster,et al.  Statistics of RNA secondary structures , 1993, Biopolymers.