Controlling an organic synthesis robot with machine learning to search for new reactivity

The discovery of chemical reactions is an inherently unpredictable and time-consuming process1. An attractive alternative is to predict reactivity, although relevant approaches, such as computer-aided reaction design, are still in their infancy2. Reaction prediction based on high-level quantum chemical methods is complex3, even for simple molecules. Although machine learning is powerful for data analysis4,5, its applications in chemistry are still being developed6. Inspired by strategies based on chemists’ intuition7, we propose that a reaction system controlled by a machine learning algorithm may be able to explore the space of chemical reactions quickly, especially if trained by an expert8. Here we present an organic synthesis robot that can perform chemical reactions and analysis faster than they can be performed manually, as well as predict the reactivity of possible reagent combinations after conducting a small number of experiments, thus effectively navigating chemical reaction space. By using machine learning for decision making, enabled by binary encoding of the chemical inputs, the reactions can be assessed in real time using nuclear magnetic resonance and infrared spectroscopy. The machine learning system was able to predict the reactivity of about 1,000 reaction combinations with accuracy greater than 80 per cent after considering the outcomes of slightly over 10 per cent of the dataset. This approach was also used to calculate the reactivity of published datasets. Further, by using real-time data from our robot, these predictions were followed up manually by a chemist, leading to the discovery of four reactions.A robot instructed by a machine learning algorithm and coupled with real-time spectroscopic systems provides fast and accurate reaction outcome predictions and reactivity assessments, leading to the discovery of new reactions.

[1]  Peter R. Schreiner,et al.  Heuristic Thinking Makes a Chemist Smart , 2010 .

[2]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[3]  D. Coomans,et al.  The application of linear discriminant analysis in the diagnosis of thyroid diseases , 1978 .

[4]  Michael I. Jordan,et al.  Machine learning: Trends, perspectives, and prospects , 2015, Science.

[5]  J. Hendler,et al.  Amplify scientific discovery with artificial intelligence , 2014, Science.

[6]  Wendy A Warr,et al.  A Short Review of Chemical Reaction Database Systems, Computer‐Aided Synthesis Design, Reaction Prediction and Synthetic Feasibility , 2014, Molecular informatics.

[7]  Derek T. Ahneman,et al.  Predicting reaction performance in C–N cross-coupling using machine learning , 2018, Science.

[8]  Paul Richardson,et al.  A platform for automated nanomole-scale reaction screening and micromole-scale synthesis in flow , 2018, Science.

[9]  Alán Aspuru-Guzik,et al.  Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules , 2016, ACS central science.

[10]  Alexei Lapkin,et al.  Automatic discovery and optimization of chemical processes , 2015 .

[11]  Károly Héberger,et al.  Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations? , 2015, Journal of Cheminformatics.

[12]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[13]  Martin D Burke,et al.  The Molecular Industrial Revolution: Automated Synthesis of Small Molecules. , 2018, Angewandte Chemie.

[14]  Kevin Warwick,et al.  March of the Machines , 1997 .

[15]  Frank Glorius,et al.  Contemporary Screening Approaches to Reaction Discovery and Development , 2014 .

[16]  Leroy Cronin,et al.  Towards dial-a-molecule by integrating continuous flow, analytics and self-optimisation. , 2016, Chemical Society reviews.

[17]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  D. Singleton,et al.  A Case Study of the Mechanism of Alcohol-Mediated Morita Baylis–Hillman Reactions. The Importance of Experimental Observations , 2015, Journal of the American Chemical Society.

[19]  Leroy Cronin,et al.  An autonomous organic reaction search engine for chemical reactivity , 2017, Nature Communications.

[20]  Paul Raccuglia,et al.  Machine-learning-assisted materials discovery using failed experiments , 2016, Nature.

[21]  Christopher. Simons,et al.  Machine learning with Python , 2017 .

[22]  Derek T. Ahneman,et al.  Deoxyfluorination with Sulfonyl Fluorides: Navigating Reaction Space with Machine Learning. , 2018, Journal of the American Chemical Society.

[23]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[24]  Andrea M E Palazzolo,et al.  The natural productome , 2017, Proceedings of the National Academy of Sciences.

[25]  Richard J Ingham,et al.  Organic synthesis: march of the machines. , 2015, Angewandte Chemie.