MolDIA: XML based system of molecular diversity analysis towards virtual screening and QSPR

In this paper a new chemoinformatics tool for Molecular Diversity Analysis (MolDIA) is introduced. The objective of this system is the analysis of molecular similarity and diversity through the treatment of structural and physicochemical information. Current needs for chemical databases include the analysis, the management and the retrieval of chemical information. The implementation of eXtended Markup Languages (XML) is proposed as a basis for representing and structuring the chemical information contained in data structures and databases. The adequate descriptor vector and related physicochemical properties have been defined and constructed. The benefits of XML in chemoinformatics are discussed, as well as, the applications of this system in a virtual screening environment. †Presented at CMTPI 2005: Computational Methods in Toxicology and Pharmacology Integrating Internet Resources (Shanghai, China, October 29–November 1, 2005).

[1]  K. Shadan,et al.  Available online: , 2012 .

[2]  Henry S. Rzepa,et al.  Chemical Markup, XML, and the World-Wide Web. 3. Toward a Signed Semantic Chemical Web of Trust , 2001, J. Chem. Inf. Comput. Sci..

[3]  Robert E. Stobaugh Chemical Abstracts Service Chemical Registry System. 11. Substance-related statistics: update and additions , 1988, J. Chem. Inf. Comput. Sci..

[4]  I. V. Shevera Synchronized multimedia integration language , 2004 .

[5]  Jun Xu,et al.  Drug-like Index: A New Approach To Measure Drug-like Compounds and Their Diversity , 2000, J. Chem. Inf. Comput. Sci..

[6]  Peter Willett,et al.  Effect of standardization on fragment‐based measures of structural similarity , 1993 .

[7]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[8]  Peter Ertl,et al.  Cheminformatics Analysis of Organic Substituents: Identification of the Most Common Substituents, Calculation of Substituent Properties, and Automatic Identification of Drug-like Bioisosteric Groups , 2003, J. Chem. Inf. Comput. Sci..

[9]  Michael J.A. Williams,et al.  Combinatorial chemistry and molecular diversity in drug discovery , 1998 .

[10]  C. Moallemi,et al.  Quantized surface complementarity diversity (QSCD): a model based on small molecule-target complementarity. , 2000, Journal of medicinal chemistry.

[11]  Elliotte Rusty Harold XML Bible , 1999 .

[12]  Henry S. Rzepa,et al.  Chemical Markup, XML, and the World Wide Web. 4. CML Schema , 2003, J. Chem. Inf. Comput. Sci..

[13]  B. Fan,et al.  Molecular similarity and diversity in chemoinformatics: From theory to applications , 2006, Molecular Diversity.

[14]  Henry S. Rzepa,et al.  Chemical Markup, XML, and the Worldwide Web. 1. Basic Principles , 1999, J. Chem. Inf. Comput. Sci..

[15]  G. Makara,et al.  Measuring molecular similarity and diversity: total pharmacophore diversity. , 2001, Journal of medicinal chemistry.

[16]  G. P. Moss,et al.  Glossary of class names of organic compounds and reactivity intermediates based on structure (IUPAC Recommendations 1995) , 1995 .

[17]  E. Gordon,et al.  Combinatorial chemistry and molecular diversity in drug discovery , 1998 .

[18]  Eric J. Martin,et al.  Sensitivity Analysis and Other Improvements to Tailored Combinatorial Library Design , 2000, J. Chem. Inf. Comput. Sci..

[19]  Alexander M. Moore,et al.  A Line-Formula Chemical Notation. , 1955 .

[20]  K. M. Smith,et al.  Novel software tools for chemical diversity , 1998 .

[21]  Michel Petitjean,et al.  Using XML for structuring the chemical information: Towards a chemical knowledge representation , 2005 .

[22]  Jianhua Yao,et al.  SIRS-SS: A System for Simulating IR/Raman Spectra. 1. Substructure/Subspectrum Correlation , 2001, J. Chem. Inf. Comput. Sci..

[23]  Henry S. Rzepa,et al.  Chemical Markup, XML and the World-Wide Web. 2. Information Objects and the CMLDOM , 2001, J. Chem. Inf. Comput. Sci..

[24]  Naomie Salim,et al.  Analysis and Display of the Size Dependence of Chemical Similarity Coefficients , 2003, J. Chem. Inf. Comput. Sci..