Significance estimation for sequence-based chemical similarity searching (PhAST) and application to AuroraA kinase inhibitors.

BACKGROUND Chemical similarity searching allows the retrieval of preferred screening molecules from a compound database. Candidates are ranked according to their similarity to a reference compound (query). Assessing the statistical significance of chemical similarity scores helps prioritizing significant hits, and identifying cases where the database does not contain any promising compounds. METHOD Our text-based similarity measure, Pharmacophore Alignment Search Tool (PhAST), employs pair-wise sequence alignment. We adapted the concept of E-values as significance estimates and employed a sampling technique that incorporates the principle of importance sampling in a Markov chain Monte Carlo simulation to generate distributions of random alignment scores. These distributions were used to compute significance estimates for similarity scores in a preliminary prospective virtual screen for inhibitors of Aurora A kinase. CONCLUSION Assessing the significance of compound similarity computed with PhAST allows for a statistically motivated identification of candidate screening compounds. Inhibitors of Aurora A kinase were retrieved from a large compound library.

[1]  Markus Hartenfeller,et al.  A Collection of Robust Organic Synthesis Reactions for In Silico Molecule Design , 2011, J. Chem. Inf. Model..

[2]  Wei Chen,et al.  Antitumor activity of MLN8054, an orally active small-molecule inhibitor of Aurora A kinase , 2007, Proceedings of the National Academy of Sciences.

[3]  Richard A Prayson,et al.  Aurora B expression correlates with aggressive behaviour in glioblastoma multiforme , 2006, Journal of Clinical Pathology.

[4]  Wang Shen,et al.  Discovery of a potent and selective aurora kinase inhibitor. , 2008, Bioorganic & medicinal chemistry letters.

[5]  A. Hopkins,et al.  Ligand efficiency: a useful metric for lead selection. , 2004, Drug discovery today.

[6]  Lorenz C. Blum,et al.  970 million druglike small molecules for virtual screening in the chemical universe database GDB-13. , 2009, Journal of the American Chemical Society.

[7]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings , 1997 .

[8]  Markus Hartenfeller,et al.  Probing the Bioactivity-Relevant Chemical Space of Robust Reactions and Common Molecular Building Blocks , 2012, J. Chem. Inf. Model..

[9]  Pierre Baldi,et al.  When is Chemical Similarity Significant? The Statistical Distribution of Chemical Similarity Scores and Its Extreme Values , 2010, J. Chem. Inf. Model..

[10]  Oliver Gautschi,et al.  Aurora Kinases as Anticancer Drug Targets , 2008, Clinical Cancer Research.

[11]  Michael J. Keiser,et al.  Large Scale Prediction and Testing of Drug Activity on Side-Effect Targets , 2012, Nature.

[12]  Yuval Kluger,et al.  Expression of Aurora A (but Not Aurora B) Is Predictive of Survival in Breast Cancer , 2008, Clinical Cancer Research.

[13]  Markus Hartenfeller,et al.  DOGS: Reaction-Driven de novo Design of Bioactive Compounds , 2012, PLoS Comput. Biol..

[14]  Markus Hartenfeller,et al.  Multistep virtual screening for rapid and efficient identification of non-nucleoside bacterial thymidine kinase inhibitors. , 2010, Chemistry.

[15]  Theonie Anastassiadis,et al.  Comprehensive assay of kinase catalytic activity reveals features of kinase inhibitor selectivity , 2011, Nature biotechnology.

[16]  Risi Kondor,et al.  Diffusion kernels on graphs and other discrete structures , 2002, ICML 2002.

[17]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[18]  Haiyong Han,et al.  Comparing Aurora A and Aurora B as molecular targets for growth inhibition of pancreatic cancer cells , 2006, Molecular Cancer Therapeutics.

[19]  Christoph Steinbeck,et al.  Chemical Entities of Biological Interest: an update , 2009, Nucleic Acids Res..

[20]  Erich A. Nigg,et al.  Cell division: Mitotic kinases as regulators of cell division and its checkpoints , 2001, Nature Reviews Molecular Cell Biology.

[21]  Tudor I. Oprea,et al.  Virtual and biomolecular screening converge on a selective agonist for GPR30 , 2006, Nature chemical biology.

[22]  Jürgen Bajorath,et al.  Molecular similarity analysis in virtual screening: foundations, limitations and novel approaches. , 2007, Drug discovery today.

[23]  Herbert Köppen Virtual screening - what does it give us? , 2009, Current opinion in drug discovery & development.

[24]  Peter Gais,et al.  Aurora Kinase A Messenger RNA Overexpression Is Correlated with Tumor Progression and Shortened Survival in Head and Neck Squamous Cell Carcinoma , 2006, Clinical Cancer Research.

[25]  Pierre Baldi,et al.  BLASTing small molecules—statistics and extreme statistics of chemical similarity scores , 2008, ISMB.

[26]  W. Guida,et al.  The art and practice of structure‐based drug design: A molecular modeling perspective , 1996, Medicinal research reviews.

[27]  Jun Yoshida,et al.  Inhibition of Aurora-B function increases formation of multinucleated cells in p53 gene deficient cells and enhances anti-tumor effect of temozolomide in human glioma cells , 2007, Journal of Neuro-Oncology.

[28]  G. Schneider,et al.  Identification of natural-product-derived inhibitors of 5-lipoxygenase activity by ligand-based virtual screening. , 2007, Journal of medicinal chemistry.

[29]  Stephen S. Taylor,et al.  Aurora-kinase inhibitors as anticancer agents , 2004, Nature Reviews Cancer.

[30]  Mutasem O. Taha,et al.  Elaborate Ligand-Based Modeling Reveals New Nanomolar Heat Shock Protein 90α Inhibitors , 2010, J. Chem. Inf. Model..

[31]  H. Stark,et al.  Lead identification and optimization of diaminopyrimidines as histamine H4 receptor ligands , 2009, Inflammation Research.

[32]  Darren Fayne,et al.  Rational ligand-based virtual screening and structure-activity relationship studies in the ligand-binding domain of the glucocorticoid receptor-α. , 2009, Future medicinal chemistry.

[33]  S. Deacon,et al.  The challenge of selecting protein kinase assays for lead discovery optimization , 2008, Expert opinion on drug discovery.

[34]  Gisbert Schneider,et al.  PhAST: pharmacophore alignment search tool , 2009, J. Comput. Chem..

[35]  Richard R. Adams,et al.  Essential Roles of Drosophila Inner Centromere Protein (Incenp) and Aurora B in Histone H3 Phosphorylation, Metaphase Chromosome Alignment, Kinetochore Disjunction, and Chromosome Segregation , 2001, The Journal of cell biology.

[36]  Eiji Kohmura,et al.  Aurora-A Kinase Maintains the Fidelity of Early and Late Mitotic Events in HeLa Cells* , 2003, Journal of Biological Chemistry.

[37]  Friedrich Rippmann,et al.  Pharmacophore alignment search tool: Influence of the third dimension on text‐based similarity searching , 2011, J. Comput. Chem..

[38]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[39]  Lawrence C. Brody,et al.  Functional Characterization of BRCA1 Sequence Variants using a Yeast Small Colony Phenotype Assay , 2004, Cancer biology & therapy.

[40]  Gisbert Schneider,et al.  Discovery of γ-secretase modulators with a novel activity profile by text-based virtual screening. , 2012, ACS chemical biology.

[41]  Stephen S. Taylor,et al.  Aurora B couples chromosome alignment with anaphase by targeting BubR1, Mad2, and Cenp-E to kinetochores , 2003, The Journal of cell biology.

[42]  Alexander K Hartmann,et al.  Sampling rare events: statistics of local sequence alignments. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[43]  Mladen Vinković,et al.  Fragment-based discovery of the pyrazol-4-yl urea (AT9283), a multitargeted kinase inhibitor with potent aurora kinase activity. , 2009, Journal of medicinal chemistry.

[44]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[45]  Thierry Langer,et al.  Virtual screening for the discovery of bioactive natural products , 2008, Progress in drug research. Fortschritte der Arzneimittelforschung. Progres des recherches pharmaceutiques.

[46]  Hanna Geppert,et al.  Targeting multifunctional proteins by virtual screening: structurally diverse cytohesin inhibitors with differentiated biological functions. , 2010, ACS chemical biology.

[47]  Haiyong Han,et al.  The mitotic serine threonine kinase, Aurora-2, is a potential target for drug development in human pancreatic cancer. , 2004, Molecular cancer therapeutics.

[48]  Anna Frolov,et al.  Targeting Aurora kinases for the treatment of prostate cancer. , 2006, Cancer research.

[49]  G. Giaccone,et al.  Frequent overexpression of aurora B kinase, a novel drug target, in non–small cell lung carcinoma patients , 2006, Molecular Cancer Therapeutics.

[50]  I. J. Good,et al.  C140. Standardized tail-area prosabilities , 1982 .

[51]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[52]  Sven Rahmann,et al.  Accurate statistics for local sequence alignment with position-dependent scoring by rare-event sampling , 2011, BMC Bioinformatics.

[53]  Gisbert Schneider,et al.  Virtual screening: an endless staircase? , 2010, Nature Reviews Drug Discovery.

[54]  Jun Sano,et al.  Centrosomal Kinases, HsAIRK1 and HsAIRK3, are Overexpressed in Primary Colorectal Cancers , 2000, Japanese journal of cancer research : Gann.

[55]  K. Kinzler,et al.  Genetic instabilities in human cancers , 1998, Nature.

[56]  Tony Jebara,et al.  Minimum Volume Embedding , 2007, AISTATS.

[57]  Michael J. Keiser,et al.  Relating protein pharmacology by ligand chemistry , 2007, Nature Biotechnology.

[58]  Gisbert Schneider,et al.  Collection of bioactive reference compounds for focused library design , 2003 .

[59]  Jürgen Bajorath,et al.  Introduction of the Conditional Correlated Bernoulli Model of Similarity Value Distributions and its Application to the Prospective Prediction of Fingerprint Search Performance , 2011, J. Chem. Inf. Model..

[60]  Michael J. Keiser,et al.  Off-target networks derived from ligand set similarity. , 2009, Methods in molecular biology.

[61]  Evan Bolton,et al.  PubChem3D: Biologically relevant 3-D similarity , 2011, J. Cheminformatics.

[62]  Tudor I. Oprea,et al.  Optimization of CAMD techniques 3. Virtual screening enrichment studies: a help or hindrance in tool selection? , 2008, J. Comput. Aided Mol. Des..

[63]  Paola Storici,et al.  PHA-739358, a potent inhibitor of Aurora kinases with a selective target inhibition profile relevant to cancer , 2007, Molecular Cancer Therapeutics.

[64]  Ulrich Rester,et al.  From virtuality to reality - Virtual screening in lead discovery and lead optimization: a medicinal chemistry perspective. , 2008, Current opinion in drug discovery & development.

[65]  Michael Darsow,et al.  ChEBI: a database and ontology for chemical entities of biological interest , 2007, Nucleic Acids Res..

[66]  John J. Irwin,et al.  Community benchmarks for virtual screening , 2008, J. Comput. Aided Mol. Des..

[67]  Gisbert Schneider,et al.  Potent Inhibitors of 5‐Lipoxygenase Identified using Pseudoreceptors , 2011, ChemMedChem.

[68]  Lee Aaron Newberg Significance of Gapped Sequence Alignments , 2008, J. Comput. Biol..

[69]  Friedrich Rippmann,et al.  Pharmacophore alignment search tool: Influence of canonical atom labeling on similarity searching , 2010, J. Comput. Chem..

[70]  David Bebbington,et al.  VX-680, a potent and selective small-molecule inhibitor of the Aurora kinases, suppresses tumor growth in vivo , 2004, Nature Medicine.

[71]  Ruben Abagyan,et al.  Structure-based discovery of novel chemotypes for adenosine A(2A) receptor antagonists. , 2010, Journal of medicinal chemistry.

[72]  Peter Willett,et al.  Similarity-based virtual screening using 2D fingerprints. , 2006, Drug discovery today.