New models for the clustering of large databases through a hierarchical paradigm

The recovery of information from large databases based on similarity approach supposes a high computational cost -when the process is carried out comparing each one of the records with the search pattern. If the database records store some data structure representing the information of the problem domain by means of a graph it is possible to classify these records using a hierarchical model which considers the structural basic elements of the graphs and diminishes the computational cost of the recovery process considerably. In this paper we propose a classification model based on structural elements (cycles and chains) for large and medium databases.

[1]  J. Gross,et al.  Graph Theory and Its Applications , 1998 .

[2]  David Riley The Object of Data Abstraction and Structures (Using Java) , 2002 .

[3]  Thomas R. Cundari,et al.  Database Mining Using Soft Computing Techniques. An Integrated Neural Network-Fuzzy Logic-Genetic Algorithm Approach , 2001, J. Chem. Inf. Comput. Sci..

[4]  Mehmed Kantardzic,et al.  Data Mining: Concepts, Models, Methods, and Algorithms , 2002 .

[5]  Irene Luque Ruiz,et al.  Cyclical Conjunction: An Efficient Operator for the Extraction of Cycles from a Graph , 2002, J. Chem. Inf. Comput. Sci..

[6]  Michael F. Lynch,et al.  Computer storage and retrieval of generic chemical structures in patents, 1. Introduction and general strategy , 1981, J. Chem. Inf. Comput. Sci..

[7]  Peter Willett,et al.  Rapid Quantification of Molecular Diversity for Selective Database Acquisition , 1997, J. Chem. Inf. Comput. Sci..

[8]  Irene Luque Ruiz,et al.  Step-by-Step Calculation of All Maximum Common Substructures through a Constraint Satisfaction Based Algorithm , 2004, J. Chem. Inf. Model..

[9]  Irene Luque Ruiz,et al.  Representation of the Molecular Topology of Cyclical Structures by Means of Cycle Graphs. 1. Extraction of Topological Properties , 2004, J. Chem. Inf. Model..

[10]  時實 象一 Computer storage and retrieval of generic chemical structures , 1987 .

[11]  Gerta Rücker,et al.  Computer perception of constitutional (topological) symmetry: TOPSYM, a fast algorithm for partitioning atoms and pairwise relations among atoms into equivalence classes , 1990, J. Chem. Inf. Comput. Sci..

[12]  John M. Barnard,et al.  Chemical Similarity Searching , 1998, J. Chem. Inf. Comput. Sci..

[13]  John M. Barnard,et al.  Clustering Methods and Their Uses in Computational Chemistry , 2003 .

[14]  Irene Luque Ruiz,et al.  Representation of the Molecular Topology of Cyclical Structures by Means of Cycle Graphs. 3. Hierarchical Model of Screening of Chemical Databases , 2004, J. Chem. Inf. Model..

[15]  Julian R. Ullmann,et al.  An Algorithm for Subgraph Isomorphism , 1976, J. ACM.