Efficient Universal Computation by Greedy Molecular Folding

We introduce and study the computational power of Oritatami, a theoretical model to explore greedy molecular folding, by which the molecule begins to fold before waiting the end of its production. This model is inspired by our recent experimental work demonstrating the construction of shapes at the nanoscale by folding an RNA molecule during its transcription from an engineered sequence of synthetic DNA. While predicting the most likely conformation is known to be NP-complete in other models, Oritatami sequences fold optimally in linear time. Although our model uses only a small subset of the mechanisms known to be involved in molecular folding, we show that it is capable of efficient universal computation, implying that any extension of this model will have this property as well. We develop several general design techniques for programming these molecules. Our main result in this direction is an algorithm in time linear in the sequence length, that finds a rule for folding the sequence deterministically into a prescribed set of shapes depending of its environment. This shows the corresponding problem is fixed-parameter tractable although we proved it is NP-complete in the number of possible environments. This algorithm was used effectively to design several key steps of our constructions.

[1]  Mihalis Yannakakis,et al.  On the Complexity of Protein Folding , 1998, J. Comput. Biol..

[2]  Ron Unger,et al.  Finding the lowest free energy conformation of a protein is an NP-hard problem: Proof and implications , 1993 .

[3]  William E. Hart,et al.  On the Intractability of Protein Folding with a Finite Alphabet of Amino Acids , 1999, Algorithmica.

[4]  Alantha Newman A new algorithm for protein folding in the HP model , 2002, SODA '02.

[5]  P. Rothemund Folding DNA to create nanoscale shapes and patterns , 2006, Nature.

[6]  Cody W. Geary,et al.  A single-stranded architecture for cotranscriptional folding of RNA nanostructures , 2014, Science.

[7]  Mike Paterson,et al.  On the Complexity of String Folding , 1996, Discret. Appl. Math..

[8]  Turlough Neary,et al.  P-completeness of Cellular Automaton Rule 110 , 2006, ICALP.

[9]  Kirsten L. Frieda,et al.  Direct Observation of Cotranscriptional Folding in an Adenine Riboswitch , 2012, Science.

[10]  Laurent Vuillon,et al.  Intermolecular β-Strand Networks Avoid Hub Residues and Favor Low Interconnectedness: A Potential Protection Mechanism against Chain Dissociation upon Mutation , 2014, PloS one.

[11]  S. Kim,et al.  Sequential folding of transfer RNA. A nuclear magnetic resonance study of successively longer tRNA fragments with a common 5' end. , 1980, Journal of molecular biology.

[12]  K. Dill Theory for the folding and stability of globular proteins. , 1985, Biochemistry.

[13]  Frank Thomson Leighton,et al.  Protein folding in the hydrophobic-hydrophilic (HP) is NP-complete , 1998, RECOMB '98.

[14]  Steven Skiena,et al.  Local Rules for Protein Folding on a Triangular Lattice and Generalized Hydrophobicity in the HP Model , 1997, J. Comput. Biol..