An informatic pipeline for managing high-throughput screening experiments and analyzing data from stereochemically diverse libraries

Integration of flexible data-analysis tools with cheminformatics methods is a prerequisite for successful identification and validation of “hits” in high-throughput screening (HTS) campaigns. We have designed, developed, and implemented a suite of robust yet flexible cheminformatics tools to support HTS activities at the Broad Institute, three of which are described herein. The “hit-calling” tool allows a researcher to set a hit threshold that can be varied during downstream analysis. The results from the hit-calling exercise are reported to a database for record keeping and further data analysis. The “cherry-picking” tool enables creation of an optimized list of hits for confirmatory and follow-up assays from an HTS hit list. This tool allows filtering by computed chemical property and by substructure. In addition, similarity searches can be performed on hits of interest and sets of related compounds can be selected. The third tool, an “S/SAR viewer,” has been designed specifically for the Broad Institute’s diversity-oriented synthesis (DOS) collection. The compounds in this collection are rich in chiral centers and the full complement of all possible stereoisomers of a given compound are present in the collection. The S/SAR viewer allows rapid identification of both structure/activity relationships and stereo-structure/activity relationships present in HTS data from the DOS collection. Together, these tools enable the prioritization and analysis of hits from diverse compound collections, and enable informed decisions for follow-up biology and chemistry efforts.

[1]  Nathan T. Ross,et al.  An aldol-based build/couple/pair strategy for the synthesis of medium- and large-sized rings: discovery of macrocyclic histone deacetylase inhibitors. , 2010, Journal of the American Chemical Society.

[2]  Paul A Clemons,et al.  Relationship of stereochemical and skeletal diversity of small molecules to cellular measurement space. , 2004, Journal of the American Chemical Society.

[3]  Jason T. Lowe,et al.  Synthesis of a 35-member stereoisomer library of bistramide A: evaluation of effects on actin state, cell cycle and tumor cell growth. , 2009, The Journal of organic chemistry.

[4]  Patrick W. Faloon,et al.  Synthesis of a novel suppressor of beta-cell apoptosis via diversity-oriented synthesis. , 2011, ACS medicinal chemistry letters.

[5]  Sanjay Joshua Swamidass,et al.  Utility-Aware Screening with Clique-Oriented Prioritization , 2012, J. Chem. Inf. Model..

[6]  J. Reynisson,et al.  Bond stability of the "undesirable" heteroatom-heteroatom molecular moieties for high-throughput screening libraries. , 2011, European journal of medicinal chemistry.

[7]  N. Meanwell Synopsis of some recent tactical application of bioisosteres in drug design. , 2011, Journal of medicinal chemistry.

[8]  C. Richard,et al.  Fluorous mixture synthesis of stereoisomer libraries: total syntheses of (+)-murisolin and fifteen diastereoisomers. , 2004, Journal of the American Chemical Society.

[9]  Andreas Sewing,et al.  Evaluating Real-Life High-Throughput Screening Data , 2005, Journal of biomolecular screening.

[10]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. , 2001, Advanced drug delivery reviews.

[11]  S Joshua Swamidass,et al.  An Economic Framework to Prioritize Confirmatory Tests after a High-Throughput Screen , 2010, Journal of biomolecular screening.

[12]  Martin Serrano,et al.  Nucleic Acids Research Advance Access published October 18, 2007 ChemBank: a small-molecule screening and , 2007 .

[13]  Jeremy R. Duvall,et al.  Synthesis of a stereochemically diverse library of medium-sized lactams and sultams via S(N)Ar cycloetherification. , 2011, ACS combinatorial science.

[14]  Sanjay Joshua Swamidass,et al.  Enhancing the rate of scaffold discovery with diversity-oriented prioritization , 2011, Bioinform..

[15]  P. Verhoest,et al.  Moving beyond rules: the development of a central nervous system multiparameter optimization (CNS MPO) approach to enable alignment of druglike properties. , 2010, ACS chemical neuroscience.

[16]  C. Wilcox,et al.  Total synthesis of a 28-member stereoisomer library of murisolins. , 2006, Journal of the American Chemical Society.

[17]  Paul A. Clemons,et al.  Using biological performance similarity to inform disaccharide library design. , 2009, Journal of the American Chemical Society.

[18]  Nathalie Meurice,et al.  Softening the Rule of Five--where to draw the line? , 2012, Bioorganic & medicinal chemistry.

[19]  Gavin Harper,et al.  Methods for mining HTS data. , 2006, Drug discovery today.

[20]  D. Curran,et al.  Synthesis of all 16 stereoisomers of pinesaw fly sex pheromones--tools and tactics for solving problems in fluorous mixture synthesis. , 2005, The Journal of organic chemistry.

[21]  C. Humblet,et al.  Escape from flatland: increasing saturation as an approach to improving clinical success. , 2009, Journal of medicinal chemistry.

[22]  Stuart L. Schreiber,et al.  Quantifying structure and performance diversity for sets of small molecules comprising small-molecule screening collections , 2011, Proceedings of the National Academy of Sciences.

[23]  Lorenz M Mayr,et al.  The Future of High-Throughput Screening , 2008, Journal of biomolecular screening.

[24]  Ernesto Callegari,et al.  A comprehensive listing of bioactivation pathways of organic functional groups. , 2005, Current drug metabolism.

[25]  Mathias Wawer,et al.  Extraction of structure-activity relationship information from high-throughput screening data. , 2009, Current medicinal chemistry.

[26]  G. V. Paolini,et al.  Quantifying the chemical beauty of drugs. , 2012, Nature chemistry.

[27]  Xiang Yao,et al.  Advanced Biological and Chemical Discovery (ABCD): Centralizing Discovery Knowledge in an Inherently Decentralized World , 2007, J. Chem. Inf. Model..

[28]  Alexander Chuprina,et al.  Drug- and Lead-likeness, Target Class, and Molecular Diversity Analysis of 7.9 Million Commercially Available Organic Compounds Provided by 29 Suppliers , 2010, J. Chem. Inf. Model..

[29]  Jörg Rademann,et al.  Design of chemical libraries with potentially bioactive molecules applying a maximum common substructure concept , 2009, Molecular Diversity.

[30]  Carol A. Mulrooney,et al.  Build/couple/pair strategy for the synthesis of stereochemically diverse macrolactams via head-to-tail cyclization. , 2012, ACS combinatorial science.

[31]  Peter S. Kutchukian,et al.  Rethinking molecular similarity: comparing compounds on the basis of biological activity. , 2012, ACS chemical biology.

[32]  Stuart L. Schreiber,et al.  Small molecules of different origins have distinct distributions of structural complexity that correlate with protein-binding profiles , 2010, Proceedings of the National Academy of Sciences.

[33]  Thomas Sander,et al.  OSIRIS, an Entirely in-House Developed Drug Discovery Informatics System , 2009, J. Chem. Inf. Model..