New clustering algorithms for the support vector machine based hierarchical classification

This study presents two new clustering algorithms for partition of data samples for the support vector machine (SVM) based hierarchical classification. A divisive (top-down) approach is considered in which a set of classes is automatically separated into two smaller groups at each node of the hierarchy. The first algorithm splits the data samples based on a variation of the normalized cuts (NCuts) clustering algorithm wherein the weights of adjacency matrix are modified to utilize class membership in the process. The second algorithm also uses the NCuts clustering; however, it considers the involved classes rather than the individual data samples. It uses the minimum distances between the convex hulls of classes as a distance measure for determining the weights of the graph. Splits are determined for both algorithms based on the eigenvector corresponding to the second smallest eigenvalue of a Laplacian matrix, and it is observed that the proposed algorithms generate well-separated and well-balanced clusters. Unlike other clustering methods used for this purpose, the methods in the present study are found to be more suitable when SVMs are used as base classifiers. As demonstrated in the experiments, the proposed clustering algorithms are integrated into the hierarchical SVM classifiers, which results in significantly improved testing times with a negligible decrease in classification accuracies as compared to the traditional multi-class SVMs.

[1]  Liang-Tien Chia,et al.  Adaptive hierarchical multi-class SVM classifier for texture-based image classification , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[2]  Cordelia Schmid,et al.  Constructing Category Hierarchies for Visual Recognition , 2008, ECCV.

[3]  David Casasent,et al.  A hierarchical classifier using new support vector machine , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[4]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[5]  Aleix M. Martinez,et al.  The AR face database , 1998 .

[6]  Alexander J. Smola,et al.  Learning with Kernels: support vector machines, regularization, optimization, and beyond , 2001, Adaptive computation and machine learning series.

[7]  Kristin P. Bennett,et al.  Duality and Geometry in SVM Classifiers , 2000, ICML.

[8]  Joydeep Ghosh,et al.  Integrating support vector machines in a hierarchical output space decomposition framework , 2004, IGARSS 2004. 2004 IEEE International Geoscience and Remote Sensing Symposium.

[9]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[10]  Jennifer G. Dy,et al.  A hierarchical method for multi-class support vector machines , 2004, ICML.

[11]  A. Rahimi,et al.  Clustering with Normalized Cuts is Clustering with a Hyperplane , 2004 .

[12]  David Casasent,et al.  A hierarchical classifier using new support vector machines for automatic target recognition , 2005, Neural Networks.

[13]  Pietro Perona,et al.  Learning and using taxonomies for fast visual categorization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[15]  Jay L. Devore,et al.  Probability and statistics for engineering and the sciences , 1982 .

[16]  John G. Taylor,et al.  Vector Machines , 2002, Neural Networks and the Financial Markets.

[17]  Zhigang Liu,et al.  Hierarchical support vector machines , 2005, IGARSS.

[18]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Nello Cristianini,et al.  Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[20]  Jon Louis Bentley,et al.  An Algorithm for Finding Best Matches in Logarithmic Expected Time , 1977, TOMS.

[21]  Robert P. W. Duin,et al.  Support Vector Data Description , 2004, Machine Learning.

[22]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[23]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.