Prediction of selective estrogen receptor beta agonist using open data and machine learning approach

Background Estrogen receptors (ERs) are nuclear transcription factors that are involved in the regulation of many complex physiological processes in humans. ERs have been validated as important drug targets for the treatment of various diseases, including breast cancer, ovarian cancer, osteoporosis, and cardiovascular disease. ERs have two subtypes, ER-α and ER-β. Emerging data suggest that the development of subtype-selective ligands that specifically target ER-β could be a more optimal approach to elicit beneficial estrogen-like activities and reduce side effects. Methods Herein, we focused on ER-β and developed its in silico quantitative structure-activity relationship models using machine learning (ML) methods. Results The chemical structures and ER-β bioactivity data were extracted from public chemogenomics databases. Four types of popular fingerprint generation methods including MACCS fingerprint, PubChem fingerprint, 2D atom pairs, and Chemistry Development Kit extended fingerprint were used as descriptors. Four ML methods including Naïve Bayesian classifier, k-nearest neighbor, random forest, and support vector machine were used to train the models. The range of classification accuracies was 77.10% to 88.34%, and the range of area under the ROC (receiver operating characteristic) curve values was 0.8151 to 0.9475, evaluated by the 5-fold cross-validation. Comparison analysis suggests that both the random forest and the support vector machine are superior for the classification of selective ER-β agonists. Chemistry Development Kit extended fingerprints and MACCS fingerprint performed better in structural representation between active and inactive agonists. Conclusion These results demonstrate that combining the fingerprint and ML approaches leads to robust ER-β agonist prediction models, which are potentially applicable to the identification of selective ER-β agonists.

[1]  Margaret Warner,et al.  Estrogen receptors: how do they signal and what are their targets. , 2007, Physiological reviews.

[2]  B. Katzenellenbogen,et al.  Molecular mechanisms of estrogen action: selective ligands and receptor pharmacology , 2000, The Journal of Steroid Biochemistry and Molecular Biology.

[3]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[4]  R. Girgert,et al.  Estrogen receptor β selective agonists reduce invasiveness of triple-negative breast cancer cells. , 2015, International journal of oncology.

[5]  Hongmao Sun A naive bayes classifier for prediction of multidrug resistance reversal activity on the basis of atom typing. , 2005, Journal of medicinal chemistry.

[6]  L. Hartmann,et al.  Selective estrogen-receptor modulators -- mechanisms of action and application to clinical practice. , 2003, The New England journal of medicine.

[7]  Zhaohui J. Cai,et al.  Pretreatment data is highly predictive of liver chemistry signals in clinical trials , 2012, Drug design, development and therapy.

[8]  A. Brown,et al.  Sulfonamides as selective oestrogen receptor β agonists. , 2011, Bioorganic & medicinal chemistry letters.

[9]  Weida Tong,et al.  Development and Validation of Decision Forest Model for Estrogen Receptor Binding Prediction of Chemicals Using Large Data Sets. , 2015, Chemical research in toxicology.

[10]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[11]  Michael M. Mysinger,et al.  Directory of Useful Decoys, Enhanced (DUD-E): Better Ligands and Decoys for Better Benchmarking , 2012, Journal of medicinal chemistry.

[12]  R. Houtman,et al.  Cell proliferation and modulation of interaction of estrogen receptors with coregulators induced by ERα and ERβ agonists , 2014, The Journal of Steroid Biochemistry and Molecular Biology.

[13]  Feixiong Cheng,et al.  In silico Prediction of Chemical Ames Mutagenicity , 2012, J. Chem. Inf. Model..

[14]  A. Franchitto,et al.  An oestrogen receptor β-selective agonist exerts anti-neoplastic effects in experimental intrahepatic cholangiocarcinoma. , 2012, Digestive and liver disease : official journal of the Italian Society of Gastroenterology and the Italian Association for the Study of the Liver.

[15]  Richard S. Judson,et al.  Binary Classification of a Large Collection of Environmental Chemicals from Estrogen Receptor Assays by Quantitative Structure-Activity Relationship and Machine Learning Methods , 2013, J. Chem. Inf. Model..

[16]  Lei Chen,et al.  ADME evaluation in drug discovery. 10. Predictions of P-glycoprotein inhibitors using recursive partitioning and naive Bayesian classification techniques. , 2011, Molecular pharmaceutics.

[17]  V. Jordan,et al.  Send Orders of Reprints at Reprints@benthamscience.net the Discovery and Development of Selective Estrogen Receptor Modulators (serms) for Clinical Practice the Biological Basis of Serm Action: Target Tissue Specific Actions , 2022 .

[18]  J. Katzenellenbogen,et al.  Estrogen receptors alpha (ERα) and beta (ERβ): Subtype-selective ligands and clinical potential , 2014, Steroids.

[19]  Tingjun Hou,et al.  ADME evaluation in drug discovery , 2002, Journal of molecular modeling.

[20]  K. Dhingra Selective Estrogen Receptor Modulation: The Search for an Ideal Hormonal Therapy for Breast Cancer , 2001, Cancer investigation.

[21]  B. Katzenellenbogen,et al.  Highly Selective Salicylketoxime-Based Estrogen Receptor β Agonists Display Antiproliferative Activities in a Glioma Model , 2015, Journal of medicinal chemistry.

[22]  Lei Chen,et al.  Selective ligands of estrogen receptor β discovered using pharmacophore mapping and structure-based virtual screening , 2014, Acta Pharmacologica Sinica.

[23]  Robert P. Sheridan,et al.  Random Forest: A Classification and Regression Tool for Compound Classification and QSAR Modeling , 2003, J. Chem. Inf. Comput. Sci..

[24]  Kerenaftali Klein,et al.  A Bayesian Modelling Approach with Balancing Informative Prior for Analysing Imbalanced Data , 2016, PloS one.

[25]  CHUN WEI YAP,et al.  PaDEL‐descriptor: An open source software to calculate molecular descriptors and fingerprints , 2011, J. Comput. Chem..

[26]  Lei Yang,et al.  Classification of Cytochrome P450 Inhibitors and Noninhibitors Using Combined Classifiers , 2011, J. Chem. Inf. Model..

[27]  M. W. B. Trotter,et al.  Support vector machines for drug discovery , 2007 .

[28]  Konrad F. Koehler,et al.  Development of subtype-selective oestrogen receptor-based therapeutics , 2011, Nature Reviews Drug Discovery.

[29]  K. Dahlman-Wright,et al.  Estrogen receptor alpha and beta in health and disease. , 2015, Best practice & research. Clinical endocrinology & metabolism.

[30]  Swagatam Das,et al.  Near-Bayesian Support Vector Machines for imbalanced data classification with equal or unequal misclassification costs , 2015, Neural Networks.

[31]  P. Fuller,et al.  The importance of ERbeta signalling in the ovary. , 2010, The Journal of endocrinology.

[32]  S. Gapstur,et al.  Selective estrogen receptor modulation and reduction in risk of breast cancer, osteoporosis, and coronary heart disease. , 2001, Journal of the National Cancer Institute.

[33]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[34]  Marco Macchia,et al.  Estrogen receptor β ligands: Recent advances and biomedical applications , 2011, Medicinal research reviews.

[35]  A. Taylor,et al.  Immunolocalisation of oestrogen receptor beta in human tissues. , 2000, Journal of molecular endocrinology.