Data Integration with Self-organising Neural Network Reveals Chemical Structure and Therapeutic Effects of Drug ATC Codes

Anatomical Therapeutic Codes (ATC) are a drug classification system which is extensively used in the field of drug development research. There are many drugs and medical compounds that as yet do not have ATC codes, it would be useful to have codes automatically assigned to them by computational methods. Our initial work involved building feedforward multi-layer perceptron models (MLP) but the classification accuracy was poor. To gain insights into the problem we used the Kohonen self-organizing neural network to visualize the relationship between the class labels and the independent variables. The information gained from the learned internal clusters gave a deeper insight into the mapping process. The ability to accurately predict ATC codes was unbalanced due to over and under representation of some ATC classes. Further difficulties arise because many drugs have several, quite different ATC codes because they have many therapeutic uses. We used chemical fingerprint data representing a drugs chemical structure and chemical activity variables. Evaluation metrics were computed, analysing the predictive performance of various self-organizing models.

[1]  Dan Wang,et al.  Similarity-based prediction for Anatomical Therapeutic Chemical classification of drugs by integrating multiple data sources , 2015, Bioinform..

[2]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[3]  Alfred Ultsch,et al.  A connectionist knowledge-acquisition tool: CONKAT , 1993 .

[4]  Kenneth McGarry,et al.  Data Mining Open Source Databases for Drug Repositioning using Graph Based Techniques , 2015 .

[5]  Martin Hofmann-Apitius,et al.  Concept-Based Semi-Automatic Classification of Drugs , 2009, J. Chem. Inf. Model..

[6]  Lutgarde M. C. Buydens,et al.  Self- and Super-organizing Maps in R: The kohonen Package , 2007 .

[7]  Kenneth McGarry,et al.  Identifying candidate drugs for repositioning by graph based modeling techniques based on drug side-effects , 2015 .

[8]  Alfred Ultsch,et al.  Automatic Acquisition of Symbolic Knowledge from Subsymbolic Neural Networks , 1993 .

[9]  Yong Wang,et al.  Network predicting drug's anatomical therapeutic chemical code , 2013, Bioinform..

[10]  Kuo-Chen Chou,et al.  iATC‐mISF: a multi‐label classifier for predicting the classes of anatomical therapeutic chemicals , 2016, Bioinform..

[11]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[12]  Fan-Shu Chen,et al.  Prediction of drug's Anatomical Therapeutic Chemical (ATC) code by integrating drug-domain network , 2015, J. Biomed. Informatics.

[13]  Stefan Günther,et al.  SuperPred: drug classification and target prediction , 2008, Nucleic Acids Res..

[14]  Kuo-Chen Chou,et al.  iATC-mISF: a multi-label classifier for predicting the classes of anatomical therapeutic chemicals , 2017, Bioinform..

[15]  Stefan Wermter,et al.  Data mining using rule extraction from Kohonen self-organising maps , 2006, Neural Computing & Applications.

[16]  Erkki Oja,et al.  Engineering applications of the self-organizing map , 1996, Proc. IEEE.

[17]  Yufeng Liu,et al.  Relating Anatomical Therapeutic Indications by the Ensemble Similarity of Drug Sets , 2013, J. Chem. Inf. Model..