Density estimation using KNN and a potential model

Density-based clustering methods are usually more adaptive than other classical methods in that they can identify clusters of various shapes and can handle noisy data. A novel density estimation method is proposed using both the knearest neighbor (KNN) graph and a hypothetical potential field of the data points to capture the local and global data distribution information respectively. An initial density score computed using KNN is used as the mass of the data point in computing the potential values. Then the computed potential is used as the new density estimation, from which the final clustering result is derived. All the parameters used in the proposed method are determined from the input data automatically. The new clustering method is evaluated by comparing with K-means++, DBSCAN, and CSPV. The experimental results show that the proposed method can determine the number of clusters automatically while producing competitive clustering results compared to the other three methods.