Inferring phase diagrams from X-ray data with background signals using graph segmentation

ABSTRACT Automated composition-structure-processing phase diagram creation is critical for high-throughput experimental material studies. In particular, diffractogram datasets with large background signals are especially difficult to identify the phase regions. In this work, we proposed a novel graph segmentation algorithm from computer vision to solve the phase diagram prediction problem from X-ray diffraction data with large background signals. We introduced a novel background subtraction algorithm with graph-based clustering/segmentation to build the BGPhase algorithm. Experiments on three datasets with the Al–Cu–Mo material family showed that our phase attribution algorithm can achieve high prediction accuracy ranging from 88.6 to 94.8% or with MCC scores ranging from 0.715 to 0.890. The algorithm can be accessed online at http://mleg.cse.sc.edu/bgphase.

[1]  Estefania Argente,et al.  DoE framework for catalyst development based on soft computing techniques , 2009, Comput. Chem. Eng..

[2]  P Cignoni,et al.  DeWall: A fast divide and conquer Delaunay triangulation algorithm in Ed , 1998, Comput. Aided Des..

[3]  Warren H. Hunt Materials informatics: Growing from the Bio World , 2006 .

[4]  D. Farrusseng,et al.  Diversity management for efficient combinatorial optimization of materials , 2007 .

[5]  Michael W. Berry,et al.  Algorithms and applications for approximate nonnegative matrix factorization , 2007, Comput. Stat. Data Anal..

[6]  Manfred Baerns,et al.  An evolutionary approach in the combinatorial selection and optimization of catalytic materials , 2000 .

[7]  M. Baerns,et al.  Application of a genetic algorithm and a neural network for the discovery and optimization of new solid catalytic materials , 2004 .

[8]  Martin Holena,et al.  Optimization of Catalysts Using Specific, Description-Based Genetic Algorithms , 2008, J. Chem. Inf. Model..

[9]  Pierre Collet,et al.  Examination of genetic programming paradigm for high-throughput experimentation and heterogeneous catalysis , 2009 .

[10]  Wei Dong,et al.  PolySNAP: a computer program for analysing high-throughput powder diffraction data , 2004 .

[11]  S. Suram,et al.  Generating information-rich high-throughput experimental materials genomes using functional clustering via multitree genetic programming and information theory. , 2015, ACS combinatorial science.

[12]  N. Ashcroft,et al.  Vegard's law. , 1991, Physical review. A, Atomic, molecular, and optical physics.

[13]  Jianjun Hu,et al.  Semi-Supervised Approach to Phase Identification from Combinatorial Sample Diffraction Patterns , 2016 .

[14]  I. Takeuchi,et al.  Rapid structural mapping of ternary metallic alloy systems using the combinatorial approach and cluster analysis. , 2007, The Review of scientific instruments.

[15]  Yan Zhang,et al.  Generalized machine learning technique for automatic phase attribution in time variant high-throughput experimental studies , 2015 .

[16]  Stefano Ermon,et al.  Pattern Decomposition with Complex Combinatorial Constraints: Application to Materials Discovery , 2014, AAAI.

[17]  John D. Perkins,et al.  Strategy for the maximum extraction of information generated from combinatorial experimentation of Co-doped ZnO thin films , 2011 .

[18]  Ronan Le Bras,et al.  Automated Phase Mapping with AgileFD and its Application to Light Absorber Discovery in the V-Mn-Nb Oxide System. , 2016, ACS combinatorial science.

[19]  I Takeuchi,et al.  Rapid identification of structural phases in combinatorial thin-film libraries using x-ray diffraction and non-negative matrix factorization. , 2009, The Review of scientific instruments.

[20]  Manh Cuong Nguyen,et al.  On-the-fly machine-learning for high-throughput experiments: search for rare-earth-free permanent magnets , 2014, Scientific Reports.

[21]  Wei Dong,et al.  High-throughput powder diffraction. II. Applications of clustering methods and multivariate data analysis , 2004 .

[22]  G. Raynor Phase Equilibria in Iron Ternary Alloys , 1988 .

[23]  Ronan Le Bras,et al.  Constraint Reasoning and Kernel Clustering for Pattern Decomposition with Scaling , 2011, CP.

[24]  Charles H. Ward Materials Genome Initiative for Global Competitiveness , 2012 .

[25]  I Takeuchi,et al.  High-throughput determination of structural phase diagram and constituent phases using GRENDEL , 2015, Nanotechnology.

[26]  Paul F. Whelan,et al.  Image segmentation based on the integration of colour-texture descriptors - A review , 2011, Pattern Recognit..

[27]  Kim F. Ferris,et al.  The materials informatics workshop: Theory and application , 2007 .

[28]  Jianjun Hu,et al.  Automated Phase Segmentation for Large-Scale X-ray Diffraction Data Using a Graph-Based Phase Segmentation (GPhase) Algorithm. , 2017, ACS combinatorial science.

[29]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.