Neutrosophic rule-based prediction system for toxicity effects assessment of biotransformed hepatic drugs

Abstract Measuring toxicity is an important step in drug development. However, the current experimental methods which are used to estimate the drug toxicity are expensive and need high computational efforts. Therefore, these methods are not suitable for large-scale evaluation of drug toxicity. As a consequence, there is a high demand to implement computational models that can predict drug toxicity risks. In this paper, we used a dataset that consists of 553 drugs that biotransformed in the liver. In this data, there are four toxic effects, namely, mutagenic, tumorigenic, irritant and reproductive effects. Each drug is represented by 31 chemical descriptors. This paper proposes two models for predicting drug toxicity risks. The proposed models consist of three phases. In the first phase, the most discriminative features are selected using rough set-based methods to reduce the classification time and improve the classification performance. In the second phase, three different sampling algorithms, namely, Random Under-Sampling, Random Over-Sampling, and Synthetic Minority Oversampling Technique (SMOTE) are used to obtain balanced data. In the third phase, the first proposed model employs the Neutrosophic Rule-based Classification System (NRCS), and the second model uses Genetic NRCS (GNRCS) to classify an unknown drug into toxic or non-toxic. The experimental results proved that the proposed models obtained high sensitivity (89–93%), specificity (91–97%), and GM (90–94%) for all toxic effects. Overall, the results of the proposed models indicate that it could be utilized for the prediction of drug toxicity in the early stages of drug development.

[1]  Xiangyang Wang,et al.  Feature selection based on rough sets and particle swarm optimization , 2007, Pattern Recognit. Lett..

[2]  I. Turksen Interval valued fuzzy sets based on normal forms , 1986 .

[3]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[4]  Swati Aggarwal,et al.  Neutrosophic classifier: An extension of fuzzy classifer , 2014, Appl. Soft Comput..

[5]  Aboul Ella Hassanien,et al.  A Predictive Model for Seminal Quality Using Neutrosophic Rule-Based Classification System , 2018, AISI.

[6]  Y T Woo,et al.  Development of structure-activity relationship rules for predicting carcinogenic potential of chemicals. , 1995, Toxicology letters.

[7]  Oscar Cordón,et al.  On Designing Fuzzy Rule-Based Multiclassification Systems by Combining Furia with Bagging and Feature Selection , 2011, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[8]  Mohamed Elhoseny,et al.  Recognizing human activity in mobile crowdsensing environment using optimized k-NN algorithm , 2018, Expert Syst. Appl..

[9]  Duoqian Miao,et al.  Discernibility Matrix Based Algorithm for Reduction of Attributes , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology Workshops.

[10]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[11]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[12]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[13]  Florentin Smarandache,et al.  A unifying field in logics : neutrosophic logic : neutrosophy, neutrosophic set, neutrosophic probability , 2020 .

[14]  G. Klopman MULTICASE 1. A Hierarchical Computer Automated Structure Evaluation Program , 1992 .

[15]  Ranjit Biswas,et al.  Neutrosophic Relational Database Decomposition , 2011 .

[16]  Ruili Huang,et al.  Weighted feature significance: a simple, interpretable model of compound toxicity based on the statistical enrichment of structural features. , 2009, Toxicological sciences : an official journal of the Society of Toxicology.

[17]  W. Luxemburg Non-Standard Analysis , 1977 .

[18]  Mitchell N. Cayen,et al.  A guide to drug discovery: Making Better Drugs: Decision Gates in Non-Clinical Drug Development , 2003, Nature Reviews Drug Discovery.

[19]  Dariusz Plewczynski,et al.  TVscreen: Trend Vector Virtual SCREENing of Large Commercial Compounds Collections , 2008, 2008 International Conference on Biocomputation, Bioinformatics, and Biomedical Technologies.

[20]  Stephen H. Friend,et al.  Toxicogenomics and drug discovery: will new technologies help us produce better drugs? , 2002, Nature Reviews Drug Discovery.

[21]  Alaa Tharwat,et al.  Linear vs. quadratic discriminant analysis classifier: a tutorial , 2016, Int. J. Appl. Pattern Recognit..

[22]  Aboul Ella Hassanien,et al.  Generalization of Fuzzy C-Means based on Neutrosophic Logic , 2018 .

[23]  Bogdan E. Popescu,et al.  PREDICTIVE LEARNING VIA RULE ENSEMBLES , 2008, 0811.1679.

[24]  Robert LIN,et al.  NOTE ON FUZZY SETS , 2014 .

[25]  Md. Ridwan Al Iqbal Rule Extraction from Ensemble Methods Using Aggregated Decision Trees , 2012, ICONIP.

[26]  Alaa Tharwat,et al.  Classification assessment methods , 2020, Applied Computing and Informatics.

[27]  K. Atanassov More on intuitionistic fuzzy sets , 1989 .

[28]  George J. Klir,et al.  Fuzzy Sets, Fuzzy Logic, and Fuzzy Systems - Selected Papers by Lotfi A Zadeh , 1996, Advances in Fuzzy Systems - Applications and Theory.

[29]  Thomas Sander,et al.  DataWarrior: An Open-Source Program For Chemistry Aware Data Visualization And Analysis , 2015, J. Chem. Inf. Model..

[30]  Aboul Ella Hassanien,et al.  Linear discriminant analysis: A detailed tutorial , 2017, AI Commun..

[31]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[32]  Qiang Shen,et al.  Finding Rough Set Reducts with Ant Colony Optimization , 2003 .

[33]  C. Ashbacher Introduction to neutrosophic logic , 2002 .

[34]  Seetha Hari,et al.  Learning From Imbalanced Data , 2019, Advances in Computer and Electrical Engineering.

[35]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[36]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[37]  Qingsong Xu,et al.  Computer‐aided prediction of toxicity with substructure pattern and random forest , 2012 .

[38]  M J Prival,et al.  Evaluation of the TOPKAT system for predicting the carcinogenicity of chemicals , 2001, Environmental and molecular mutagenesis.

[39]  A. A. Salama,et al.  NEW CONCEPTS OF NEUTROSOPHIC SETS , 2013 .

[40]  Mohamed Elhoseny,et al.  Genetic Algorithm Based Model For Optimizing Bank Lending Decisions , 2017, Expert Syst. Appl..

[41]  Aboul Ella Hassanien,et al.  Classification of toxicity effects of biotransformed hepatic drugs using whale optimized support vector machines , 2017, J. Biomed. Informatics.

[42]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[43]  Francisco Herrera,et al.  An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics , 2013, Inf. Sci..

[44]  Aboul Ella Hassanien,et al.  Neutrosophic rule-based prediction system for assessment of pollution on Benthic Foraminifera in Burullus Lagoon in Egypt , 2017, 2017 12th International Conference on Computer Engineering and Systems (ICCES).

[45]  Duoqian Miao,et al.  A rough set approach to feature selection based on ant colony optimization , 2010, Pattern Recognit. Lett..

[46]  Aboul Ella Hassanien,et al.  NRCS: Neutrosophic Rule-Based Classification System , 2016 .

[47]  Yoav Freund,et al.  Boosting the margin: A new explanation for the effectiveness of voting methods , 1997, ICML.

[48]  Aboul Ella Hassanien,et al.  Towards an Automated Zebrafish-based Toxicity Test Model Using Machine Learning , 2015 .

[49]  Vincenzo Cutello,et al.  Fuzzy classification systems , 2004, Eur. J. Oper. Res..

[50]  Yumin Chen,et al.  Finding rough set reducts with fish swarm algorithm , 2015, Knowl. Based Syst..

[51]  Aboul Ella Hassanien,et al.  GNRCS: Hybrid Classification System based on Neutrosophic Logic and Genetic Algorithm , 2016, 2016 12th International Computer Engineering Conference (ICENCO).

[52]  Gary M. Weiss,et al.  Cost-Sensitive Learning vs. Sampling: Which is Best for Handling Unbalanced Classes with Unequal Error Costs? , 2007, DMIN.

[53]  Thomas Sander,et al.  Toxicity-Indicating Structural Patterns , 2006, J. Chem. Inf. Model..

[54]  Andrew K. C. Wong,et al.  Classification of Imbalanced Data: a Review , 2009, Int. J. Pattern Recognit. Artif. Intell..

[55]  A. Robinson Non-standard analysis , 1966 .

[56]  Mohamed Elhoseny,et al.  Bezier Curve Based Path Planning in a Dynamic Field using Modified Genetic Algorithm , 2017, J. Comput. Sci..

[57]  Yanqing Zhang,et al.  Interval Neutrosophic Sets and Logic: Theory and Applications in Computing , 2005, ArXiv.

[58]  Nuno A. Fonseca,et al.  Comparative Study of Classification Algorithms Using Molecular Descriptors in Toxicological DataBases , 2009, BSB.

[59]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[60]  Ahmad Taher Azar,et al.  Supervised hybrid feature selection based on PSO and rough sets for medical diagnosis , 2014, Comput. Methods Programs Biomed..

[61]  G. Klopman Artificial intelligence approach to structure-activity studies. Computer automated structure evaluation of biological activity of organic molecules , 1985 .

[62]  Aboul Ella Hassanien,et al.  Classification of Toxicity Effects of Biotransformed Hepatic Drugs Using Optimized Support Vector Machine , 2017, AISI.

[63]  Aboul Ella Hassanien,et al.  A New Multi-layer Perceptrons Trainer Based on Ant Lion Optimization Algorithm , 2015, 2015 Fourth International Conference on Information Science and Industrial Applications (ISI).

[64]  Theresa Beaubouef,et al.  Rough Sets , 2019, Lecture Notes in Computer Science.

[65]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..