Implementation of the objective clustering inductive technology based on DBSCAN clustering algorithm

The paper presents the results of the research of the clustering algorithm DBSCAN practical implementation within the framework of the objective clustering inductive technology. As experimental, the data Aggregation and Compound of the Computing school of the East Finland University and the gene expression sequences of lung cancer patients of the database ArrayExpres were used. The architecture of the objective clustering model has been developed. The implementation of the model involves the parallel data clustering on the two equal power subsets, which include the same quantity of pairwise similar objects. The choice of the solution about parameters of the algorithm determination has been carried out based on the minimum value of the external clustering quality criterion, which calculated as normalized difference of the internal clustering quality criteria for the two subsets.

[1]  T. Caliński,et al.  A dendrite method for cluster analysis , 1974 .

[2]  Charles T. Zahn,et al.  Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters , 1971, IEEE Transactions on Computers.

[3]  Lyudmila V. Sarycheva Objective Cluster Analysis of Data Based on GMDH , 2008 .

[4]  Volodymyr Lytvynenko,et al.  Estimation of the inductive model of objects clustering stability based on the k-means algorithm for different levels of data noise , 2017 .

[5]  V. V. Osypenko,et al.  The methodology of inductive systems analysis as a tool of engineering researches analytical planning , 2011 .

[6]  P. Fränti,et al.  Sum-of-Squares Based Cluster Validity Index and Significance Analysis , 2009, ICANNGA.

[7]  David E. Misek,et al.  Gene-expression profiles predict survival of patients with lung adenocarcinoma , 2002, Nature Medicine.

[8]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[9]  Aristides Gionis,et al.  Clustering Aggregation , 2005, ICDE.

[10]  Sergii Babichev,et al.  Inductive model of data clustering based on the agglomerative hierarchical algorithm , 2016, 2016 IEEE First International Conference on Data Stream Mining & Processing (DSMP).

[11]  Hema R. Madala,et al.  Inductive Learning Algorithms for Complex Systems Modeling , 2017 .

[12]  Vladimir S. Stepashko Method of Critical Variances as Analytical Tool of Theory of Inductive Modeling , 2008 .

[13]  Volodymyr Lytvynenko,et al.  Computational analysis of microarray gene expression profiles of lung cancer , 2016 .