Computational analysis and predictive modeling of small molecule modulators of microRNA

BackgroundMicroRNAs (miRNA) are small endogenously transcribed regulatory RNA which modulates gene expression at a post transcriptional level. These small RNAs have now been shown to be critical regulators in a number of biological processes in the cell including pathophysiology of diseases like cancers. The increasingly evident roles of microRNA in disease processes have also motivated attempts to target them therapeutically. Recently there has been immense interest in understanding small molecule mediated regulation of RNA, including microRNA.ResultsWe have used publicly available datasets of high throughput screens on small molecules with potential to inhibit microRNA. We employed computational methods based on chemical descriptors and machine learning to create predictive computational models for biological activity of small molecules. We further used a substructure based approach to understand common substructures potentially contributing to the activity.ConclusionWe generated computational models based on Naïve Bayes and Random Forest towards mining small RNA binding molecules from large molecular datasets. We complement this with substructure based approach to identify and understand potentially enriched substructures in the active dataset. We use this approach to identify miRNA binding potential of a set of approved drugs, suggesting a probable novel mechanism of off-target activity of these drugs. To the best of our knowledge, this is the first and most comprehensive computational analysis towards understanding RNA binding activities of small molecules and predictive modeling of these activities.

[1]  W. Filipowicz,et al.  Mechanisms of post-transcriptional regulation by microRNAs: are the answers in sight? , 2008, Nature Reviews Genetics.

[2]  George A Calin,et al.  Small molecule enoxacin is a cancer-specific growth inhibitor that acts by enhancing TAR RNA-binding protein 2-mediated microRNA processing , 2011, Proceedings of the National Academy of Sciences.

[3]  Tao Jiang,et al.  A maximum common substructure-based algorithm for searching and predicting drug-like compounds , 2008, ISMB.

[4]  Rok Blagus,et al.  Class prediction for high-dimensional class-imbalanced data , 2010, BMC Bioinformatics.

[5]  Friedrich Miescher,et al.  Mechanisms of miRNA-mediated post-transcriptional regulation in animal cells , 2009 .

[6]  Todd A. Anderson,et al.  Computational identification of microRNAs and their targets , 2006, Comput. Biol. Chem..

[7]  Douglas D Young,et al.  Small molecule modifiers of microRNA miR-122 function for the treatment of hepatitis C virus infection and hepatocellular carcinoma. , 2010, Journal of the American Chemical Society.

[8]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[9]  Kuan-Teh Jeang,et al.  Identification of Small Molecules That Suppress MicroRNA Function and Reverse Tumorigenesis* , 2010, The Journal of Biological Chemistry.

[10]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[11]  Qihong Huang,et al.  Small-molecule inhibitors of microrna miR-21 function. , 2008, Angewandte Chemie.

[12]  T. Graeber,et al.  MicroRNA-21 targets the vitamin D-dependent antimicrobial pathway in leprosy , 2011, Nature Medicine.

[13]  Yanli Wang,et al.  PubChem: a public information system for analyzing bioactivities of small molecules , 2009, Nucleic Acids Res..

[14]  Guangxian Xu,et al.  Cloning and identification of microRNAs in bovine alveolar macrophages , 2009, Molecular and Cellular Biochemistry.

[15]  S. Tyagi,et al.  MicroRNAs as a therapeutic target for cardiovascular diseases , 2009, Journal of cellular and molecular medicine.

[16]  Decheng Yang,et al.  MicroRNA: an Emerging Therapeutic Target and Intervention Tool , 2008, International journal of molecular sciences.

[17]  James Inglese,et al.  A specific mechanism for nonspecific activation in reporter-gene assays. , 2008, ACS chemical biology.

[18]  V. Ambros microRNAs Tiny Regulators with Great Potential , 2001, Cell.

[19]  Johnf . Thompson,et al.  Modulation of firefly luciferase stability and impact on studies of gene regulation. , 1991, Gene.

[20]  Qian Gao,et al.  Comparative miRNA Expression Profiles in Individuals with Latent and Active Tuberculosis , 2011, PloS one.

[21]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[22]  Vinod Scaria,et al.  Predictive models for anti-tubercular molecules using machine learning on high-throughput biological screening datasets , 2011, BMC Research Notes.

[23]  V. Scaria,et al.  MicroRNAs: novel therapeutic targets in neurodegenerative diseases. , 2009, Drug discovery today.

[24]  Harald Mauser,et al.  Database Clustering with a Combination of Fingerprint and Maximum Common Substructure Methods , 2005, J. Chem. Inf. Model..

[25]  Vinod Scaria,et al.  Computational models for in-vitro anti-tubercular activity of molecules based on high-throughput chemical biology screening datasets , 2012, BMC pharmacology.

[26]  Z. Paroo,et al.  A small molecule enhances RNA interference and promotes microRNA processing , 2008, Nature Biotechnology.

[27]  V. Scaria,et al.  microRNA: an Emerging Therapeutic , 2007, ChemMedChem.

[28]  Ian H. Witten,et al.  WEKA - Experiences with a Java Open-Source Project , 2010, J. Mach. Learn. Res..

[29]  Amanda C. Schierz Virtual screening of bioassay data , 2009, J. Cheminformatics.

[30]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[31]  Charles Elkan,et al.  The Foundations of Cost-Sensitive Learning , 2001, IJCAI.

[32]  David S. Wishart,et al.  DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[33]  S. Diamond,et al.  Small Molecule Inhibition of RISC Loading , 2011, ACS chemical biology.

[34]  A. T. Freitas,et al.  Current tools for the identification of miRNA genes and their targets , 2009, Nucleic acids research.

[35]  W. Cho OncomiRs: the discovery and progress of microRNAs in cancers , 2007, Molecular Cancer.

[36]  K. Chaudhuri,et al.  MicroRNA detection and target prediction: integration of computational and experimental approaches. , 2007, DNA and cell biology.

[37]  Jun Feng,et al.  PowerMV: A Software Environment for Molecular Viewing, Descriptor Generation, Data Analysis and Hit Evaluation , 2005, J. Chem. Inf. Model..

[38]  G. Parkinson,et al.  Structural basis of telomeric RNA quadruplex--acridine ligand recognition. , 2011, Journal of the American Chemical Society.

[39]  Jonathan D Hirst,et al.  Machine learning in virtual screening. , 2009, Combinatorial chemistry & high throughput screening.