Organ analysis and classification using principal component and linear discriminant analysis

Texture analysis and classification of soft tissues in Computed Tomography (CT) images recently advanced with a new approach that disambiguates the checkboard problem where two distinctly different patterns produce identical co-occurrence matrices, but this method quadruples the size of the feature space. The feature space size problem is exacerbated by the use of varying sized texture operators for improving boundary segmentation. Dimensionality reduction motivates this investigation into systematic analysis of the power of feature categories (Haralick descriptors, distance, and direction) to differentiate between soft tissues. The within-organ variance explained by the individual components of feature categories offers a ranking of their potential power for between-organ discrimination. This paper introduces a technique for combining the Principal Component Analysis (PCA) results to compare and visualize the explanatory power of features with varying window sizes. We found that 1) the two Haralick features Cluster Tendency and Contrast contribute the most; 2) as distance increases, its contribution to overall variance decreases; and 3) direction is unimportant. We also evaluated the proposed technique with respect to its classification power. Linear Discriminant Analysis (LDA) and Decision Tree (DT) were used to produce two classification models based on the reduced data set. We found that using PCA either fails to improve or markedly degrades the classification performance of LDA as well as of the DT model. Though feature extraction for classification shows no promise, the proposed technique offers a systematic mechanism to compare feature reduction strategies for varying window sizes as well as other measurement techniques.