A novel state space representation for the solution of 2D-HP protein folding problem using reinforcement learning methods

A new state space representation of the protein folding problem in 2D-HP model is proposed for the use of reinforcement learning methods.The proposed state space representation reduces the dependency of the size of the state-action space to the amino acid sequence length.The proposed state space representation also provides an actual learning for an agent. Thus, at the end of a learning process an agent could find the optimum fold of any sequence of a certain length, which is not the case in the existing reinforcement learning methods.By using the Ant-Q algorithm (an ant based reinforcement learning method), optimum fold of a protein sequence is found rapidly when compared to the standard Q-learning algorithm. In this study, a new state space representation of the protein folding problem for the use of reinforcement learning methods is proposed. In the existing studies, the way of defining the state-action space prevents the agent to learn the state space for any amino-acid sequence, but rather, the defined state-action space is valid for only a particular amino-acid sequence. Moreover, in the existing methods, the size of the state space is strictly depends on the amino-acid sequence length. The newly proposed state-action space reduces this dependency and allows the agent to find the optimal fold of any sequence of a certain length. Additionally, by utilizing an ant based reinforcement learning algorithm, the Ant-Q algorithm, optimum fold of a protein is found rapidly when compared to the standard Q-learning algorithm. Experiments showed that, the new state-action space with the ant based reinforcement learning method is much more suited for the protein folding problem in two dimensional lattice model.

[1]  Yasuaki Kuroe,et al.  Swarm Reinforcement Learning Method for a Multi-robot Formation Problem , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[2]  Fred W. Glover,et al.  A filter-and-fan approach to the 2D HP model of the protein folding problem , 2011, Ann. Oper. Res..

[3]  Yasuaki Kuroe,et al.  Swarm reinforcement learning algorithms based on particle swarm optimization , 2008, 2008 IEEE International Conference on Systems, Man and Cybernetics.

[4]  Cheng-Jian Lin,et al.  An efficient hybrid Taguchi-genetic algorithm for protein folding simulation , 2009, Expert Syst. Appl..

[5]  Yibin Li,et al.  State-chain sequential feedback reinforcement learning for path planning of autonomous mobile robots , 2013, Journal of Zhejiang University SCIENCE C.

[6]  Gabriela Czibula,et al.  A distributed reinforcement learning approach for solving optimization problems , 2011 .

[7]  Marco Dorigo,et al.  Ant colony optimization , 2006, IEEE Computational Intelligence Magazine.

[8]  K. Dill Theory for the folding and stability of globular proteins. , 1985, Biochemistry.

[9]  Yasuaki Kuroe,et al.  Swarm reinforcement learning method based on ant colony optimization , 2010, 2010 IEEE International Conference on Systems, Man and Cybernetics.

[10]  Yasuaki Kuroe,et al.  Swarm Reinforcement Learning Method Based on an Actor-Critic Method , 2010, SEAL.

[11]  Marco Dorigo,et al.  Optimization, Learning and Natural Algorithms , 1992 .

[12]  Yasuaki Kuroe,et al.  Swarm reinforcement learning methods for problems with continuous state-action space , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.

[13]  Zhihong He,et al.  Protein folding simulations of 2D HP model by the genetic algorithm based on optimal secondary structures , 2010, Comput. Biol. Chem..

[14]  Henri Luchian,et al.  Protein Structure Prediction in Lattice Models with Particle Swarm Optimization , 2010, ANTS Conference.

[15]  Shingo Mabu,et al.  Fuzzy genetic Network Programming with Reinforcement Learning for mobile robot navigation , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.

[16]  Gang Li,et al.  Protein-folding simulations of the hydrophobic-hydrophilic model by combining pull moves with energy landscape paving. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[17]  Luca Maria Gambardella,et al.  Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem , 1995, ICML.

[18]  Gabriela Czibula,et al.  AN EXPERIMENT ON PROTEIN STRUCTURE PREDICTION USING REINFORCEMENT LEARNING , 2011 .

[19]  Qiang Lv,et al.  Solving 2D HP Protein Folding Problem by Parallel Ant Colonies , 2009, 2009 2nd International Conference on Biomedical Engineering and Informatics.

[20]  Marco Dorigo,et al.  Ant system: optimization by a colony of cooperating agents , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[21]  Y. Kuroe,et al.  Swarm reinforcement learning algorithms based on Sarsa method , 2008, 2008 SICE Annual Conference.

[22]  Martha Grabowski,et al.  Human and organizational error in large scale systems , 1996, IEEE Trans. Syst. Man Cybern. Part A.