QSAR Study for Carcinogenic Potency of Aromatic Amines Based on GEP and MLPs

A new analysis strategy was used to classify the carcinogenicity of aromatic amines. The physical-chemical parameters are closely related to the carcinogenicity of compounds. Quantitative structure activity relationship (QSAR) is a method of predicting the carcinogenicity of aromatic amine, which can reveal the relationship between carcinogenicity and physical-chemical parameters. This study accessed gene expression programming by APS software, the multilayer perceptrons by Weka software to predict the carcinogenicity of aromatic amines, respectively. All these methods relied on molecular descriptors calculated by CODESSA software and eight molecular descriptors were selected to build function equations. As a remarkable result, the accuracy of gene expression programming in training and test sets are 0.92 and 0.82, the accuracy of multilayer perceptrons in training and test sets are 0.84 and 0.74 respectively. The precision of the gene expression programming is obviously superior to multilayer perceptrons both in training set and test set. The QSAR application in the identification of carcinogenic compounds is a high efficiency method.

[1]  Norrozila Sulaiman,et al.  A novel intrusion detection system by using intelligent data mining in weka environment , 2011, WCIT.

[2]  M. Arivazhagan,et al.  Electronic structure investigations of 4-aminophthal hydrazide by UV-visible, NMR spectral studies and HOMO-LUMO analysis by ab initio and DFT calculations. , 2015, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[3]  Maykel Pérez González,et al.  A topological substructural approach applied to the computational prediction of rodent carcinogenicity. , 2005, Bioorganic & medicinal chemistry.

[4]  P. Olsvik,et al.  Effects of oil pollution and persistent organic pollutants (POPs) on glycerophospholipids in liver and brain of male Atlantic cod (Gadus morhua). , 2013, Chemosphere.

[5]  M. Mochizuki,et al.  Mutagenicity of aromatic amines and amides with chemical models for cytochrome P450 in Ames assay. , 2009, Toxicology in vitro : an international journal published in association with BIBRA.

[6]  Kunal Roy,et al.  First report on development of quantitative interspecies structure-carcinogenicity relationship models and exploring discriminatory features for rodent carcinogenicity of diverse organic chemicals using OECD guidelines. , 2012, Chemosphere.

[7]  Cândida Ferreira,et al.  Gene Expression Programming: A New Adaptive Algorithm for Solving Problems , 2001, Complex Syst..

[8]  Harun Tanyildizi,et al.  Estimation of compressive strength of self compacting concrete containing polypropylene fiber and mineral additives exposed to high temperature using artificial neural network , 2012 .

[9]  Prasenjit Dey,et al.  A utilization of GEP (gene expression programming) metamodel and PSO (particle swarm optimization) tool to predict and optimize the forced convection around a cylinder , 2016 .

[10]  Shikha Gupta,et al.  Identifying pollution sources and predicting urban air quality using ensemble learning methods , 2013 .

[11]  Joanna Jedrzejowicz,et al.  Experimental evaluation of two new GEP-based ensemble classifiers , 2011, Expert Syst. Appl..

[12]  Changjie Tang,et al.  Distance Guided Classification with Gene Expression Programming , 2006, ADMA.

[13]  T. Shih,et al.  Rapid and intermediate N-acetylators are less susceptible to oxidative damage among 4,4'-methylenebis(2-chloroaniline) (MBOCA)-exposed workers. , 2013, International journal of hygiene and environmental health.

[14]  J. Angerer,et al.  Percutaneous absorption of aromatic amines - a contribution for human health risk assessment. , 2008, Food and chemical toxicology : an international journal published for the British Industrial Biological Research Association.

[15]  Qingzhu Zhang,et al.  Computational evidence for the detoxifying mechanism of epsilon class glutathione transferase toward the insecticide DDT. , 2014, Environmental science & technology.

[16]  Weimin Xiao,et al.  Evolving accurate and compact classification rules with gene expression programming , 2003, IEEE Trans. Evol. Comput..

[17]  Paul L A Popelier,et al.  Evaluation of aromatic amines with different purities and different solvent vehicles in the Ames test. , 2015, Regulatory toxicology and pharmacology : RTP.

[18]  M. Bahadir,et al.  Removal efficiency of a calix[4]arene-based polymer for water-soluble carcinogenic direct azo dyes and aromatic amines. , 2009, Journal of hazardous materials.

[19]  Bernard De Baets,et al.  Supervised ranking in the weka environment , 2010, Inf. Sci..

[20]  Erik Johansson,et al.  Megavariate analysis of environmental QSAR data. Part I – A basic framework founded on principal component analysis (PCA), partial least squares (PLS), and statistical molecular design (SMD) , 2006, Molecular Diversity.

[21]  C. Detweiler,et al.  The Biomechanisms of Metal and Metal-Oxide Nanoparticles’ Interactions with Cells , 2015, International journal of environmental research and public health.

[22]  E. Roemer,et al.  Heterocyclic aromatic amines and their contribution to the bacterial mutagenicity of the particulate phase of cigarette smoke. , 2016, Toxicology letters.

[23]  Jingtian Hu,et al.  Predicting carcinogenicity of organic compounds based on CPDB. , 2015, Chemosphere.

[24]  Shing Yip Lee,et al.  Using blood samples to estimate persistent organic pollutants and metals in green sea turtles (Chelonia mydas). , 2010, Marine pollution bulletin.

[25]  A. Jiménez,et al.  Optimization of the extraction of azo colorants used in toy products. , 2002, Journal of chromatography. A.

[26]  Roseane M. Misságia,et al.  Artificial neural networks to support petrographic classification of carbonate-siliciclastic rocks using well logs and textural information , 2015 .

[27]  Feng Luan,et al.  Prediction of retention times for a large set of pesticides or toxicants based on support vector machine and the heuristic method. , 2007, Toxicology letters.

[28]  Tao Wang,et al.  QSAR study of 1,4-dihydropyridine calcium channel antagonists based on gene expression programming. , 2006, Bioorganic & medicinal chemistry.

[29]  S. Chandel,et al.  Selection of most relevant input parameters using WEKA for artificial neural network based solar radiation prediction models , 2014 .

[30]  Liliana Teodorescu,et al.  High Energy Physics event selection with Gene Expression Programming , 2008, Comput. Phys. Commun..

[31]  Sama Azadi,et al.  Verifying the performance of artificial neural network and multiple linear regression in predicting the mean seasonal municipal solid waste generation rate: A case study of Fars province, Iran. , 2016, Waste management.

[32]  Marta Roca,et al.  Target analysis of primary aromatic amines combined with a comprehensive screening of migrating substances in kitchen utensils by liquid chromatography-high resolution mass spectrometry. , 2015, Talanta.

[33]  Swanirbhar Majumder,et al.  A Novel EMD based Watermarking of Fingerprint Biometric Using GEP , 2013 .

[34]  É. Latrille,et al.  TyPol - a new methodology for organic compounds clustering based on their molecular characteristics and environmental behavior. , 2014, Chemosphere.

[35]  Dominik Slezak,et al.  Feedforward neural networks for compound signals , 2011, Theor. Comput. Sci..

[36]  S. Inamdar,et al.  Solvents effect on the absorption and fluorescence spectra of 7-diethylamino-3-thenoylcoumarin: evaluation and correlation between solvatochromism and solvent polarity parameters. , 2015, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.