An evaluation of sampling methods for data mining with fuzzy C-means

Using fuzzy c-means as the data-mining tool, this study evaluates the effectiveness of sampling methods in producing the knowledge of interest. The effectiveness is shown in terms of the representative-ness of sampling data and both the accuracy and errors of sampled data sets when subjected to the fuzzy clustering algorithm. Two population data in the weld inspection domain were used for the evaluation. Based on the results obtained, a number of observations are made.

[1]  Jiawei Han,et al.  Efficient and Effective Clustering Methods for Spatial Data Mining , 1994, VLDB.

[2]  Ron Kohavi,et al.  Data Mining Using MLC a Machine Learning Library in C++ , 1996, Int. J. Artif. Intell. Tools.

[3]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[4]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[5]  G H Ball,et al.  A clustering technique for summarizing multivariate data. , 1967, Behavioral science.

[6]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[7]  O. Rana,et al.  A Distributed Framework for Parallel Data Mining Using HPJava , 1999 .

[8]  Sudipto Guha,et al.  CURE: an efficient clustering algorithm for large databases , 1998, SIGMOD '98.

[9]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[10]  T. W. Liao,et al.  Detection of welding flaws from radiographic images with fuzzy clustering methods , 1999, Fuzzy Sets Syst..

[11]  Dimitrios Gunopulos,et al.  Automatic subspace clustering of high dimensional data for data mining applications , 1998, SIGMOD '98.

[12]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[13]  B. S. Duran,et al.  Cluster Analysis: A Survey , 1974 .

[14]  Yueming Li,et al.  Extraction of welds from radiographic images using fuzzy classifiers , 2000, Inf. Sci..

[15]  James M. Keller,et al.  A possibilistic approach to clustering , 1993, IEEE Trans. Fuzzy Syst..

[16]  Thomas Reinartz,et al.  Focusing Solutions for Data Mining , 1999, Lecture Notes in Computer Science.

[17]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .