论文信息 - Model-based reinforcement learning for biological sequence design

Model-based reinforcement learning for biological sequence design

The ability to design biological structures such as DNA or proteins would have considerable medical and industrial impact. Doing so presents a challenging black-box optimization problem characterized by the large-batch, low round setting due to the need for labor-intensive wet lab evaluations. In response, we propose using reinforcement learning (RL) based on proximal-policy optimization (PPO) for biological sequence design. RL provides a flexible framework for optimization generative sequence models to achieve specific criteria, such as diversity among the high-quality sequences discovered. We propose a model-based variant of PPO, DyNA-PPO, to improve sample efficiency, where the policy for a new round is trained offline using a simulator fit on functional measurements from prior rounds. To accommodate the growing number of observations across rounds, the simulator model is automatically selected at each round from a pool of diverse models of varying capacity. On the tasks of designing DNA transcription factor binding sites, designing antimicrobial proteins, and optimizing the energy of Ising models based on protein structure, we find that DyNA-PPO performs significantly better than existing methods in settings in which modeling is feasible, while still not performing worse in situations in which a reliable model cannot be learned.

[1] Zachary Wu,et al. Machine learning-assisted directed protein evolution with combinatorial libraries , 2019, Proceedings of the National Academy of Sciences.

[2] Regina Barzilay,et al. Learning Multimodal Graph-to-Graph Translation for Molecular Optimization , 2018, ICLR.

[3] Brendan J. Frey,et al. Generating and designing DNA with deep generative models , 2017, ArXiv.

[4] Georg Seelig,et al. Human 5′ UTR design and variant effect prediction from a massively parallel translation assay , 2018, Nature Biotechnology.

[5] Samy Bengio,et al. Neural Combinatorial Optimization with Reinforcement Learning , 2016, ICLR.

[6] Li Li,et al. Optimization of Molecules via Deep Reinforcement Learning , 2018, Scientific Reports.

[7] Mohamed Ahmed,et al. Exploring Deep Recurrent Models with Reinforcement Learning for Molecule Design , 2018, ICLR.

[8] Xiaowo Wang,et al. Synthetic Promoter Design in Escherichia coli based on Generative Adversarial Network , 2019 .

[9] Thomas A. Hopf,et al. Protein 3D Structure Computed from Evolutionary Sequence Variation , 2011, PloS one.

[10] Frances H. Arnold,et al. Design by Directed Evolution. , 1998 .

[11] Alán Aspuru-Guzik,et al. Inverse molecular design using machine learning: Generative models for matter engineering , 2018, Science.

[12] Kam-Fai Wong,et al. Integrating planning for task-completion dialogue policy learning , 2018, ACL.

[13] Tom Schaul,et al. Natural Evolution Strategies , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[14] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[15] Tom Schaul,et al. Unifying Count-Based Exploration and Intrinsic Motivation , 2016, NIPS.

[16] Ziheng Wang,et al. Antibody complementarity determining region design using high-capacity machine learning , 2019, bioRxiv.

[17] Dick de Ridder,et al. Designing Eukaryotic Gene Expression Regulation Using Machine Learning. , 2020, Trends in biotechnology.

[18] Jacob Witten,et al. Deep learning regression model for antimicrobial peptide design , 2019, bioRxiv.

[19] E. Shakhnovich,et al. A new approach to the design of stable proteins. , 1993, Protein engineering.

[20] Jennifer Listgarten,et al. Design by adaptive sampling , 2018, ArXiv.

[21] Jasper Snoek,et al. Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[22] T. N. Bhat,et al. The Protein Data Bank , 2000, Nucleic Acids Res..

[23] Joelle Pineau,et al. An Actor-Critic Algorithm for Sequence Prediction , 2016, ICLR.

[24] Frank Hutter,et al. Learning to Design RNA , 2018, ICLR.

[25] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.