Catalyst Acquisition by Data Science (CADS): a web-based catalyst informatics platform for discovering catalysts

An innovative web-based integrated catalyst informatics platform, Catalyst Acquisition by Data Science (CADS), is developed for use towards the discovery and design of catalysts. The platform provides three main functionalities: a repository for data sharing and publishing, an analytic workspace for exploratory visual analysis, and catalyst property prediction tools with pretrained machine learning models. Access to such a platform helps decrease barriers to entry faced by researchers in catalytic chemistry when attempting to apply catalyst informatics towards data by providing analytical and visualization tools that can be simultaneously applied and easily accessed within a central space, thereby helping the advancement of catalyst informatics. The developed platform allows researchers to upload and collect data onto the platform and conduct data analysis using a system of linked workspaces consisting of interactive visualization tools and machine learning tools that simultaneously update according to the researchers' actions in real time. The platform also provides a space for collaboration where researchers can choose to publish their uploaded data and resulting analyses to the platform for collaborations with other users and groups. As an example, CADS is applied towards oxidative coupling of methane (OCM) data where use of the platform tools reveals underlying patterns and trends that are otherwise hidden within the original data. Thus, the proposed platform contributes towards the advancement of catalyst informatics for both specialists and non-specialists.

[1]  Andrew J. Medford,et al.  Extracting Knowledge from Data through Catalysis Informatics , 2018, ACS Catalysis.

[2]  Zachary W. Ulissi,et al.  To address surface reaction network complexity using scaling relations machine learning and DFT calculations , 2017, Nature Communications.

[3]  Yuzuru Tanaka,et al.  The Rise of Catalyst Informatics: Towards Catalyst Genomics , 2019, ChemCatChem.

[4]  S. Curtarolo,et al.  AFLOW: An automatic framework for high-throughput materials discovery , 2012, 1308.5715.

[5]  K. Jacobsen,et al.  The Computational 2D Materials Database: high-throughput modeling and discovery of atomically thin crystals , 2018, 2D Materials.

[6]  T. Bligaard,et al.  Machine Learning for Computational Heterogeneous Catalysis , 2019, ChemCatChem.

[7]  Itsuki Miyazato,et al.  High-Throughput Experimentation and Catalyst Informatics for Oxidative Coupling of Methane , 2020, ACS Catalysis.

[8]  Kristin A. Persson,et al.  Commentary: The Materials Project: A materials genome approach to accelerating materials innovation , 2013 .

[9]  Forest Rohwer,et al.  FastGroupII: A web-based bioinformatics platform for analyses of large 16S rDNA libraries , 2006, BMC Bioinformatics.

[10]  Muratahan Aykol,et al.  The Open Quantum Materials Database (OQMD): assessing the accuracy of DFT formation energies , 2015 .

[11]  Thomas F. Jaramillo,et al.  New cubic perovskites for one- and two-photon water splitting using the computational materials repository , 2012 .

[12]  Jens K Nørskov,et al.  The catalyst genome. , 2013, Angewandte Chemie.

[13]  J. Nørskov,et al.  CatApp: a web application for surface chemistry and heterogeneous catalysis. , 2012, Angewandte Chemie.

[14]  Claudia Draxl,et al.  NOMAD: The FAIR concept for big data-driven materials science , 2018, MRS Bulletin.

[15]  Martin Holena,et al.  Statistical Analysis of Past Catalytic Data on Oxidative Methane Coupling for New Insights into the Composition of High‐Performance Catalysts , 2011 .

[16]  Jacob R. Boes,et al.  Catalysis-Hub.org, an open electronic structure database for surface reactions , 2019, Scientific Data.