论文信息 - Machine Learning and Data Mining in Pattern Recognition

Machine Learning and Data Mining in Pattern Recognition

Clustering is an extensive research area in data science. The aim of clustering is to discover groups and to identify interesting patterns in datasets. Crisp (hard) clustering considers that each data point belongs to one and only one cluster. However, it is inadequate as some data points may belong to several clusters, as is the case in text categorization. Thus, we need more flexible clustering. Fuzzy clustering methods, where each data point can belong to several clusters, are an interesting alternative. Yet, seeding iterative fuzzy algorithms to achieve high quality clustering is an issue. In this paper, we propose a new linear and efficient initialization algorithm MaxMin Linear to deal with this problem. Then, we validate our theoretical results through extensive experiments on a variety of numerical real-world and artificial datasets. We also test several validity indices, including a new validity index that we propose, Transformed Standardized Fuzzy Difference (TSFD).

Maria Petrou | Petra Perner

[1] Bayan S. Sharif,et al. A nonlinear variational method for signal segmentation and reconstruction using level set algorithm , 2006, Signal Processing.

[2] A. Kehagias,et al. A dynamic programming segmentation procedure for hydrological and environmental time series , 2006 .

[3] Brian Stephen Wong,et al. Development of an automated ultrasonic testing system , 2005, International Conference on Experimental Mechanics.

[4] Nicola Bosso,et al. Automated and Cost Effective Maintenance for Railway (ACEM–Rail) , 2012 .

[5] Casimir A. Kulikowski,et al. Featureless Pattern Recognition in an Imaginary Hilbert Space and Its Application to Protein Fold Classification , 2001, MLDM.

[6] Gimy Joy,et al. Rail Flaw Detection Using Image Processing Concepts- A Review , 2014 .

[7] Paul Fearnhead,et al. Exact Bayesian curve fitting and signal segmentation , 2005, IEEE Transactions on Signal Processing.

[8] Eamonn J. Keogh,et al. Experimental comparison of representation methods and distance measures for time series data , 2010, Data Mining and Knowledge Discovery.

[9] Valentina Sulimova,et al. An Automatic Matching Procedure of Ultrasonic Railway Defectograms , 2014, MLDM.

[10] Maksym Spiryagin,et al. Rail Flaw Detection Technologies for Safer, Reliable Transportation: A Review , 2018 .