Computational Prediction of Critical Temperatures of Superconductors Based on Convolutional Gradient Boosting Decision Trees

Superconductors have been one of the most intriguing materials since they were discovered more than a century ago. However, superconductors at room temperature have yet to be discovered. On the other hand, machine learning and especially deep learning has been increasingly used in material properties prediction and discovery in recent years. In this paper, we propose to combine the deep convolutional neural network (CNN) model with fully convolutional layers for feature extraction with gradient boosting decision tree (GBDT) for superconductors critical temperature ( $T\text{c}$ ) prediction. Our prediction model only uses the elemental property statistics of the materials as original input and learns a hierarchical representation of superconductors using convolutional layers. Computational experiments showed that our convolutional gradient boosting decision tree (ConvGBDT) model achieved the state-of-the-art results on three superconductor data sets: DataS, DataH, and DataK. By visually comparing the raw elemental feature distribution and the learned feature distribution, it is found that the convolutional layers of our ConvGBDT can learn features that can more effectively distinguish cuprate and iron-based superconductors. On the other hand, the GBDT part of our ConvGBDT model can learn the sophisticated mapping relationship between extracted features and the critical temperatures to obtain good prediction performance.

[1]  Tara N. Sainath,et al.  State-of-the-Art Speech Recognition with Sequence-to-Sequence Models , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  H. Koinuma,et al.  Combinatorial solid-state chemistry of inorganic materials , 2004, Nature materials.

[3]  Vijay S. Pande,et al.  MoleculeNet: a benchmark for molecular machine learning , 2017, Chemical science.

[4]  C. Castleton,et al.  Managing the supercell approximation for charged defects in semiconductors: Finite-size scaling, charge correction factors, the band-gap problem, and the ab initio dielectric constant , 2006 .

[5]  Muratahan Aykol,et al.  Materials Design and Discovery with High-Throughput Density Functional Theory: The Open Quantum Materials Database (OQMD) , 2013 .

[6]  M. Anne Britt,et al.  Deep and Shallow Natural Language Understanding for Identifying Explanation Structure , 2018 .

[7]  Mark Granovetter,et al.  The Making of an Industry: Electricity in the United States , 1998 .

[8]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[10]  Svetha Venkatesh,et al.  Face Recognition Using Kernel Ridge Regression , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Chiho Kim,et al.  Machine learning in materials informatics: recent applications and prospects , 2017, npj Computational Materials.

[12]  Liao Rongtao,et al.  Improvement of Face Recognition Algorithm Based on Neural Network , 2018 .

[13]  V. Blatov Voronoi–dirichlet polyhedra in crystal chemistry: theory and applications , 2004 .

[14]  Herbert C. Freyhardt,et al.  European project on a self-limiting superconducting power link , 2000 .

[15]  R. Asahi,et al.  Microstructure recognition using convolutional neural networks for prediction of ionic conductivity in ceramics , 2017 .

[16]  To use or not to use cool superconductors? , 2011, Nature materials.

[17]  Kristin A. Persson,et al.  Commentary: The Materials Project: A materials genome approach to accelerating materials innovation , 2013 .

[18]  Klaus-Robert Müller,et al.  Optimizing transition states via kernel-based machine learning. , 2012, The Journal of chemical physics.

[19]  Wenquan Feng,et al.  Multiactivation Pooling Method in Convolutional Neural Networks for Image Recognition , 2018, Wirel. Commun. Mob. Comput..

[20]  Wei-keng Liao,et al.  ElemNet: Deep Learning the Chemistry of Materials From Only Elemental Composition , 2018, Scientific Reports.

[21]  T. Shun,et al.  Nanostructured High‐Entropy Alloys with Multiple Principal Elements: Novel Alloy Design Concepts and Outcomes , 2004 .

[22]  H. Kubota,et al.  Research & development of superconducting fault current limiter in Japan , 2005, IEEE Transactions on Applied Superconductivity.

[23]  Alexandre Tkatchenko,et al.  Quantum-chemical insights from deep tensor neural networks , 2016, Nature Communications.

[24]  Tony Hey,et al.  The Fourth Paradigm: Data-Intensive Scientific Discovery , 2009 .

[25]  Alok Choudhary,et al.  A General-Purpose Machine Learning Framework for Predicting Properties of Inorganic Materials , 2016 .

[26]  Jeffrey C Grossman,et al.  Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material Properties. , 2017, Physical review letters.

[27]  Juri Ganitkevitch Large-Scale Paraphrasing for Natural Language Understanding , 2013, HLT-NAACL.

[28]  Yuksel C. Yabansu,et al.  Material structure-property linkages using three-dimensional convolutional neural networks , 2018 .

[29]  Iwao Hosako,et al.  Deep Learning Model for Finding New Superconductors , 2018, ArXiv.

[30]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[31]  Krishna Rajan,et al.  Materials Informatics: The Materials ``Gene'' and Big Data , 2015 .

[32]  Marco Buongiorno Nardelli,et al.  AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations , 2012 .

[33]  Parijat Deshpande,et al.  Exploration of data science techniques to predict fatigue strength of steel from composition and processing parameters , 2014, Integrating Materials and Manufacturing Innovation.

[34]  Corey Oses,et al.  Machine learning modeling of superconducting critical temperature , 2017, npj Computational Materials.

[35]  Yoshua Bengio,et al.  Convolutional networks for images, speech, and time series , 1998 .

[36]  Kyle Chard,et al.  Matminer: An open source toolkit for materials data mining , 2018, Computational Materials Science.

[37]  Muratahan Aykol,et al.  The Open Quantum Materials Database (OQMD): assessing the accuracy of DFT formation energies , 2015 .

[38]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[39]  K. Ryu,et al.  1st Phase Results and Future Plan of DAPAS Program , 2006, IEEE Transactions on Applied Superconductivity.

[40]  Jian Yang,et al.  Discriminative Block-Diagonal Representation Learning for Image Recognition , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[41]  Christopher M Wolverton,et al.  Atomistic calculations and materials informatics: A review , 2017 .

[42]  Rongtao Liao,et al.  Improvement of Face Recognition Algorithm Based on Neural Network , 2018, 2018 10th International Conference on Measuring Technology and Mechatronics Automation (ICMTMA).

[43]  Jianjun Hu,et al.  An Ensemble Deep Convolutional Neural Network Model with Improved D-S Evidence Fusion for Bearing Fault Diagnosis , 2017, Sensors.

[44]  P. Liaw,et al.  Atomistic clustering-ordering and high-strain deformation of an Al0.1CrCoFeNi high-entropy alloy , 2016, Scientific Reports.

[45]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[46]  Gerbrand Ceder,et al.  Toward Computational Materials Design: The Impact of Density Functional Theory on Materials Research , 2006 .

[47]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[48]  Erin Antono,et al.  Building Data-driven Models with Microstructural Images: Generalization and Interpretability , 2017, ArXiv.

[49]  P. Luksch,et al.  New developments in the Inorganic Crystal Structure Database (ICSD): accessibility in support of materials research and design. , 2002, Acta crystallographica. Section B, Structural science.

[50]  Sanket A. Deshmukh,et al.  Crystallization kinetics in AlxCrCoFeNi (0 ≤ x ≤ 40) high-entropy alloys , 2017 .

[51]  A. Choudhary,et al.  Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science , 2016 .

[52]  Richard G. Hennig,et al.  Machine learning of ab-initio energy landscapes for crystal structure predictions , 2019, Computational Materials Science.

[53]  B. Meredig,et al.  Materials science with large-scale data and informatics: Unlocking new opportunities , 2016 .

[54]  K. Hamidieh A data-driven statistical model for predicting the critical temperature of a superconductor , 2018, Computational Materials Science.

[55]  K-R Müller,et al.  SchNet - A deep learning architecture for molecules and materials. , 2017, The Journal of chemical physics.

[56]  Jianjun Hu,et al.  End-To-End Convolutional Neural Network Model for Gear Fault Diagnosis Based on Sound Signals , 2018, Applied Sciences.

[57]  M. Rupp,et al.  Machine learning of molecular electronic properties in chemical compound space , 2013, 1305.7074.

[58]  G. Balasubramanian,et al.  Dislocation dynamics in Al0.1CoCrFeNi high-entropy alloy under tensile loading , 2017 .