Prediction for Rational Synthesis Based on Weighted Feature Selection Method

In this paper, a novel integrated feature selection model is proposed to analyze the relationship between the synthesis factors and the specific resulting structure on the database of AlPO synthesis. Concretely, the proposed model can select the most significant synthesis factors affecting the formation of a (6,12)‐ring‐containing structure by combining multiple feature selection methods. Firstly, eight feature selection methods are employed to prerank the synthesis factors based on the predictive performance of support vector machine. Then, a weighted fusion mechanism is presented to rerank the results. Finally, sequential forward floating search method is utilized to select the most significant synthesis factors in view of the highest predictive performance. A large number of experimental results show that the proposed model is efficient and feasible. The predictive accuracy can reach 86.47 % with the selected 10 factors among 21 synthesis factors. The selection and ranking results also give a rational understanding for AlPO synthesis. More specifically, a proportional relationship among gel composition parameters is recommended based on the result of feature selection, which has important guiding significance for the rational design and synthesis.

[1]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[2]  B. Weckhuysen,et al.  On the synthesis of vanadium containing molecular sieves by experimental design from a VOSO4·5H2O·Al(iPrO)3·Pr2NH·H2O gel: occurrence of VAPO-41 as a secondary structure in the synthesis of VAPO-11 , 2000 .

[3]  Jun Kong,et al.  Missing value estimation for database of aluminophosphate (AlPO) syntheses , 2013 .

[4]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[5]  Jigui Sun,et al.  Introduction and application of zeobank : synthesis and structure databases of zeolites and related materials , 2007 .

[6]  Huaiqing Wang,et al.  A discretization algorithm based on a heterogeneity criterion , 2005, IEEE Transactions on Knowledge and Data Engineering.

[7]  Jihong Yu,et al.  Insight into the construction of open-framework aluminophosphates. , 2006, Chemical Society reviews.

[8]  Kilian Stoffel,et al.  Theoretical Comparison between the Gini Index and Information Gain Criteria , 2004, Annals of Mathematics and Artificial Intelligence.

[9]  Yuxiao Hu,et al.  Face recognition using Laplacianfaces , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Gavin C. Cawley,et al.  Gene Selection in Cancer Classification using Sparse Logistic Regression with Bayesian Regularisation , 2006 .

[11]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[12]  Yan Yan,et al.  Database of open-framework aluminophosphate syntheses: introduction and application (I) , 2009 .

[13]  J. Preston Ξ-filters , 1983 .

[14]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[15]  B. Weckhuysen,et al.  On the synthesis of CoAPO-46, -11 and -44 molecular sieves from a Co(Ac)2. 4H2O.Al(iPrO)3. H3PO4. Pr2NH. H2O gel via experimental design , 1999 .

[16]  R. Lathe Phd by thesis , 1988, Nature.

[17]  Jun Kong,et al.  Computational prediction of the formation of microporous aluminophosphates with desired structural features , 2010 .

[18]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[19]  S. Sathiya Keerthi,et al.  A simple and efficient algorithm for gene selection using sparse logistic regression , 2003, Bioinform..

[20]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[21]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[22]  Jianzhong Wang,et al.  Prediction of Microporous Aluminophosphate AlPO4‐5 Based on Resampling Using Partial Least Squares and Logistic Discrimination , 2010, Molecular informatics.

[23]  高娜,et al.  基于特征选择的决策树方法在磷酸铝AlPO 4 -5定向合成中的应用 , 2011 .

[24]  Chi Hau Chen,et al.  Pattern recognition and signal processing , 1978 .

[25]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[26]  Peng Chen,et al.  Template-Designed Syntheses of Open-Framework Zinc Phosphites with Extra-Large 24-Ring Channels , 2008 .

[27]  Jie-Hui Yu,et al.  Toward the rational design and synthesis of inorganic microporous and related materials , 2004 .

[28]  Huan Liu,et al.  Efficient Feature Selection via Analysis of Relevance and Redundancy , 2004, J. Mach. Learn. Res..

[29]  Stacey I. Zones,et al.  A combustion-free methodology for synthesizing zeolites and zeolite-like materials , 2003, Nature.

[30]  Perica Strbac,et al.  Toward optimal feature selection using ranking methods and classification algorithms , 2011 .

[31]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .