Research on optimized R-Tree high-dimensional indexing method based on video features

In order to solve the dimension disaster problem of Video high dimensional feature, a new indexing method is proposed: PKSR-Tree index. PKSR-Tree index first uses the principal component analysis to reduce the dimensionality of the high-dimensional feature data, reducing the dimension of the disaster impact and making the distribution of data homogeneous. The feature data after dimensionality reduction are divided by k-means clustering. It reduces the error of K-means algorithm and solves the shortcomings of the K-means algorithm which is susceptible to noise and unable to find data other than spherical shape. The SR-Tree index is established for each cluster, which reduces the problem of multi-path query. The searching process makes use of clustering partition to reduce the scope of candidate results, so that the searching range can be quickly filtered. Experiments show that PKSR-Tree index has obvious advantages in the efficiency and accuracy of video high-dimensional feature data.

[1]  Miao Duo An Improved KNN Algorithm Based on Variable Precision Rough Sets , 2012 .

[2]  A. Guttman,et al.  A Dynamic Index Structure for Spatial Searching , 1984, SIGMOD 1984.

[3]  Zhang Chun-ping,et al.  Research on K-means Clustering Algorithm , 2011 .

[4]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[5]  Hanan Samet,et al.  Index-driven similarity search in metric spaces (Survey Article) , 2003, TODS.

[6]  Omar Elbeqqali,et al.  Normed principal components analysis: A new approach to data warehouse fragmentation , 2013, 2013 ACS International Conference on Computer Systems and Applications (AICCSA).

[7]  David Wai-Lok Cheung,et al.  Clustering Uncertain Data Using Voronoi Diagrams and R-Tree Index , 2010, IEEE Transactions on Knowledge and Data Engineering.

[8]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[9]  Shaojie Tang,et al.  Efficient R-Tree Based Indexing Scheme for Server-Centric Cloud Storage System , 2016, IEEE Transactions on Knowledge and Data Engineering.

[10]  Dai Jia MapReduce Based Fast kNN Join , 2015 .