A global resource for computational chemistry

A modular distributable system has been built for high-throughput computation of molecular structures and properties. It has been used to process 250,000 compounds from the NCI database and to make the results searchable by structures and properties. The IUPAC/NIST InChI specification and algorithm has been used to index the structures and enforce integrity during computation. A number of novel features of the PM5 Hamiltonian were identified as a result of the high-throughput approach. The system and the data can be redistributed and reused and promote the value of computed data as a primary chemical resource Figure Workflow schematic for conversion of an SDF file to a Mopac input file.

[1]  Jürgen Bajorath,et al.  Combinatorial Preferences Affect Molecular Similarity/Diversity Calculations Using Binary Fingerprints and Tanimoto Coefficients , 2000, J. Chem. Inf. Comput. Sci..

[2]  Henry S. Rzepa,et al.  Chemical Markup, XML, and the Worldwide Web. 1. Basic Principles , 1999, J. Chem. Inf. Comput. Sci..

[3]  MUSTAFA R. HELAL,et al.  Ab Initio calculations of the stabilization energies of the conformational and the structural isomers of C3H7X where X = F, Cl, and Br , 2002, J. Comput. Chem..

[4]  Jürgen Bajorath,et al.  Variability of Molecular Descriptors in Compound Databases Revealed by Shannon Entropy Calculations , 2000, J. Chem. Inf. Comput. Sci..

[5]  Miron Livny,et al.  Condor: a distributed job scheduler , 2001 .

[6]  W. Graham Richards,et al.  Virtual screening using grid computing: the screensaver project , 2002, Nature Reviews Drug Discovery.

[7]  J. Gasteiger,et al.  FROM ATOMS AND BONDS TO THREE-DIMENSIONAL ATOMIC COORDINATES : AUTOMATIC MODEL BUILDERS , 1993 .

[8]  Egon L. Willighagen,et al.  Chemical Markup, XML, and the World Wide Web. 5. Applications of Chemical Metadata in RSS Aggregators , 2004, J. Chem. Inf. Model..

[9]  Henry S. Rzepa,et al.  Chemical Markup, XML, and the World Wide Web. 4. CML Schema , 2003, J. Chem. Inf. Comput. Sci..

[10]  Sean Martin,et al.  Globally distributed object identification for biological knowledgebases , 2004, Briefings Bioinform..

[11]  Bernd Beck,et al.  Enhanced 3D-Databases: A Fully Electrostatic Database of AM1-Optimized Structures , 1998, J. Chem. Inf. Comput. Sci..

[12]  A Kibble American Chemical Society--220th National Meeting. Division of medicinal chemistry--selected symposia. 20-24 August 2000, Washington DC, USA. , 2000, IDrugs : the investigational drugs journal.

[13]  Simon M. Tyrrell,et al.  Representation and use of chemistry in the global electronic age. , 2004, Organic & biomolecular chemistry.

[14]  Luc Moreau,et al.  The semantic smart laboratory: a system for supporting the chemical eScientist. , 2004, Organic & biomolecular chemistry.

[15]  Miron Livny,et al.  Condor and the Grid , 2003 .

[16]  Tim Berners-Lee,et al.  Publishing on the semantic web , 2001, Nature.

[17]  Francine Berman,et al.  Grid Computing: Making the Global Infrastructure a Reality , 2003 .