Detecting Clinically Meaningful Shape Clusters in Medical Image Data: Metrics Analysis for Hierarchical Clustering Applied to Healthy and Pathological Aortic Arches

Objective: Today's growing medical image databases call for novel processing tools to structure the bulk of data and extract clinically relevant information. Unsupervised hierarchical clustering may reveal clusters within anatomical shape data of patient populations as required for modern precision medicine strategies. Few studies have applied hierarchical clustering techniques to three-dimensional patient shape data and results depend heavily on the chosen clustering distance metrics and linkage functions. In this study, we sought to assess clustering classification performance of various distance/linkage combinations and of different types of input data to obtain clinically meaningful shape clusters. Methods: We present a processing pipeline combining automatic segmentation, statistical shape modeling, and agglomerative hierarchical clustering to automatically subdivide a set of 60 aortic arch anatomical models into healthy controls, two groups affected by congenital heart disease, and their respective subgroups as defined by clinical diagnosis. Results were compared with traditional morphometrics and principal component analysis of shape features. Results: Our pipeline achieved automatic division of input shape data according to primary clinical diagnosis with high F-score (0.902 ± 0.042) and Matthews correlation coefficient (0.851 ± 0.064) using the correlation/weighted distance/linkage combination. Meaningful subgroups within the three patient groups were obtained and benchmark scores for automatic segmentation and classification performance are reported. Conclusion: Clustering results vary depending on the distance/linkage combination used to divide the data. Yet, clinically relevant shape clusters and subgroups could be found with high specificity and low misclassification rates. Significance: Detecting disease-specific clusters within medical image data could improve image-based risk assessment, treatment planning, and medical device development in complex disease.

[1]  Alejandro F Frangi,et al.  Precision Imaging: more descriptive, predictive and integrative imaging , 2016, Medical Image Anal..

[2]  Maxime Sermesant,et al.  A statistical shape modelling framework to extract 3D shape biomarkers from medical imaging data: assessing arch morphology of repaired coarctation of the aorta , 2016, BMC Medical Imaging.

[3]  Bilwaj Gaonkar,et al.  CHIMERA: Clustering of Heterogeneous Disease Effects via Distribution Matching of Imaging Patterns , 2016, IEEE Transactions on Medical Imaging.

[4]  Ninon Burgos,et al.  Voxelwise atlas rating for computer assisted diagnosis: Application to congenital heart diseases of the great arteries , 2015, Medical Image Anal..

[5]  Jan L. Bruse,et al.  A Non-parametric Statistical Shape Model for Assessment of the Surgically Repaired Aortic Arch in Coarctation of the Aorta: How Normal is Abnormal? , 2015, STACOM@MICCAI.

[6]  Juan J. Cerrolaza,et al.  Automatic multi-resolution shape modeling of multi-organ structures , 2015, Medical Image Anal..

[7]  Avan Suinesiaputra,et al.  Challenges of Cardiac Image Analysis in Large-Scale Population-Based Studies , 2015, Current Cardiology Reports.

[8]  Ingrid Daubechies,et al.  A New Fully Automated Approach for Aligning and Comparing Shapes , 2015, Anatomical record.

[9]  Guido Gerig,et al.  Morphometry of anatomical shape complexes with dense deformations and sparse parameters , 2014, NeuroImage.

[10]  Giovanni Biglino,et al.  3D morphometric analysis of the arterial switch operation using in vivo MRI data , 2014, Clinical anatomy.

[11]  Alexey Tsymbal,et al.  Towards cloud-based image-integrated similarity search in big data , 2014, IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI).

[12]  N. Pierrat,et al.  Comparison of organs' shapes with geometric and Zernike 3D moments , 2013, Comput. Methods Programs Biomed..

[13]  Xiahai Zhuang,et al.  Challenges and methodologies of fully automatic whole heart segmentation: a review. , 2013, Journal of healthcare engineering.

[14]  Sébastien Ourselin,et al.  Multi-atlas Propagation Whole Heart Segmentation from MRI and CTA Using a Local Normalised Correlation Coefficient Criterion , 2013, FIMH.

[15]  Berthold Langguth,et al.  Cluster analysis for identifying sub-types of tinnitus: A positron emission tomography and voxel-based morphometry study , 2012, Brain Research.

[16]  N Hosten,et al.  Population Imaging as Valuable Tool for Personalized Medicine , 2012, Clinical pharmacology and therapeutics.

[17]  Helko Lehmann,et al.  Automatic Multi-model-Based Segmentation of the Left Atrium in Cardiac MRI Scans , 2012, MICCAI.

[18]  Hervé Delingette,et al.  A Statistical Model for Quantification and Prediction of Cardiac Remodelling: Application to Tetralogy of Fallot , 2011, IEEE Transactions on Medical Imaging.

[19]  Alain Trouvé,et al.  Statistical models of sets of curves and surfaces based on currents , 2009, Medical Image Anal..

[20]  Marcel Brun,et al.  Clustering Algorithms: On Learning, Validation, Performance, and Applications to Genomics , 2009, Current genomics.

[21]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[22]  Milan Sonka,et al.  Congenital aortic disease: 4D magnetic resonance segmentation and quantitative analysis , 2009, Medical Image Anal..

[23]  David A. Steinman,et al.  A Framework for Geometric Analysis of Vascular Structures: Application to Cerebral Aneurysms , 2009, IEEE Transactions on Medical Imaging.

[24]  David A. Steinman,et al.  An image-based modeling framework for patient-specific computational hemodynamics , 2008, Medical & Biological Engineering & Computing.

[25]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[26]  Edward R. Dougherty,et al.  Model-based evaluation of clustering validation measures , 2007, Pattern Recognit..

[27]  Guido Gerig,et al.  User-guided 3D active contour segmentation of anatomical structures: Significantly improved efficiency and reliability , 2006, NeuroImage.

[28]  Joan Alexis Glaunès,et al.  Surface Matching via Currents , 2005, IPMI.

[29]  Shantanu H. Joshi,et al.  Statistical shape analysis: clustering, learning, and testing , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Alain Trouvé,et al.  Computing Large Deformation Metric Mappings via Geodesic Flows of Diffeomorphisms , 2005, International Journal of Computer Vision.

[31]  Phalla Ou,et al.  Late systemic hypertension and aortic arch geometry after successful repair of coarctation of the aorta. , 2004, European heart journal.

[32]  P. Gemperline,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[33]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[34]  Ka Yee Yeung,et al.  Principal component analysis for clustering gene expression data , 2001, Bioinform..

[35]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[36]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[37]  William Schroeder,et al.  The Visualization Toolkit: An Object-Oriented Approach to 3-D Graphics , 1997 .

[38]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[39]  Fionn Murtagh,et al.  A Survey of Recent Advances in Hierarchical Clustering Algorithms , 1983, Comput. J..

[40]  A. Jatene,et al.  Anatomic correction of transposition of the great arteries , 1982 .

[41]  Anil K. Jain,et al.  Clustering techniques: The user's dilemma , 1976, Pattern Recognit..

[42]  G. N. Lance,et al.  A General Theory of Classificatory Sorting Strategies: 1. Hierarchical Systems , 1967, Comput. J..

[43]  Daniel Rueckert,et al.  Right ventricle segmentation from cardiac MRI: A collation study , 2015, Medical Image Anal..

[44]  Fionn Murtagh,et al.  Algorithms for hierarchical clustering: an overview , 2012, WIREs Data Mining Knowl. Discov..

[45]  Divakar Singh,et al.  Performance Evaluation of K-Means and Heirarichal Clustering in Terms of Accuracy and Running Time , 2012 .

[46]  C. C. Law,et al.  ParaView: An End-User Tool for Large-Data Visualization , 2005, The Visualization Handbook.

[47]  Roberto Marcondes Cesar Junior,et al.  Inference from Clustering with Application to Gene-Expression Microarrays , 2002, J. Comput. Biol..

[48]  Adrienne Lutovsky,et al.  For more information: , 2010 .