Fiber Clustering Acceleration With a Modified Kmeans++ Algorithm Using Data Parallelism

Fiber clustering methods are typically used in brain research to study the organization of white matter bundles from large diffusion MRI tractography datasets. These methods enable exploratory bundle inspection using visualization and other methods that require identifying brain white matter structures in individuals or a population. Some applications, such as real-time visualization and inter-subject clustering, need fast and high-quality intra-subject clustering algorithms. This work proposes a parallel algorithm using a General Purpose Graphics Processing Unit (GPGPU) for fiber clustering based on the FFClust algorithm. The proposed GPGPU implementation exploits data parallelism using both multicore and GPU fine-grained parallelism present in commodity architectures, including current laptops and desktop computers. Our approach implements all FFClust steps in parallel, improving execution times in all of them. In addition, our parallel approach includes a parallel Kmeans++ algorithm implementation and defines a new variant of Kmeans++ to reduce the impact of choosing outliers as initial centroids. The results show that our approach provides clustering quality results very similar to FFClust, and it requires an execution time of 3.5 s for processing about a million fibers, achieving a speedup of 11.5 times compared to FFClust.

[1]  Pamela Guevara,et al.  Inter-Subject Clustering of Brain Fibers from Whole-Brain Tractography , 2020, 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).

[2]  Rafael Sachetto Oliveira,et al.  G-DBSCAN: A GPU Accelerated Algorithm for Density-based Clustering , 2013, ICCS.

[3]  Bahriye Akay,et al.  A survey and systematic categorization of parallel K-means and fuzzy-c-means algorithms , 2019, Comput. Syst. Sci. Eng..

[4]  Bart M. ter Haar Romeny,et al.  Fused DTI/HARDI Visualization , 2011, IEEE Transactions on Visualization and Computer Graphics.

[5]  C. Poupon,et al.  iFiber: A brain tract visualizer for Android devices , 2015, 2015 CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON).

[6]  Maxime Descoteaux,et al.  Recognition of white matter bundles using local and global streamline-based registration and clustering , 2017, NeuroImage.

[7]  Sudipto Guha,et al.  CURE: an efficient clustering algorithm for large databases , 1998, SIGMOD '98.

[8]  Yogesh Rathi,et al.  An anatomically curated fiber clustering white matter atlas for consistent white matter tract parcellation across the lifespan , 2018, NeuroImage.

[9]  R. Deriche,et al.  Regularized, fast, and robust analytical Q‐ball imaging , 2007, Magnetic resonance in medicine.

[10]  Carl-Fredrik Westin,et al.  Automatic Tractography Segmentation Using a High-Dimensional White Matter Atlas , 2007, IEEE Transactions on Medical Imaging.

[11]  Jean-Francois Mangin,et al.  Automatic fiber bundle segmentation in massive tractography datasets using a multi-subject bundle atlas , 2012, NeuroImage.

[12]  Jean-Francois Mangin,et al.  Fiber Tracking in q-Ball Fields Using Regularized Particle Trajectories , 2005, IPMI.

[13]  Akira Tanaka,et al.  The worst-case time complexity for generating all maximal cliques and computational experiments , 2006, Theor. Comput. Sci..

[14]  Partha Pratim Talukdar,et al.  ReAl-LiFE: Accelerating the Discovery of Individualized Brain Connectomes on GPUs , 2019, AAAI.

[15]  Jean-Francois Mangin,et al.  An Example-Based Multi-Atlas Approach to Automatic Labeling of White Matter Tracts , 2015, PloS one.

[16]  J. R. Reichenbach,et al.  GPGPU-Computing for the cluster analysis of fiber tracts : Replacing a $ 15000 high end PC with a $ 500 graphics card , 2010 .

[17]  Anna Vilanova,et al.  CUDA-Accelerated Geodesic Ray-Tracing for Fiber Tracking , 2011, Int. J. Biomed. Imaging.

[18]  Guy B. Williams,et al.  QuickBundles, a Method for Tractography Simplification , 2012, Front. Neurosci..

[19]  D. Sculley,et al.  Web-scale k-means clustering , 2010, WWW '10.

[20]  Frédéric Cazals,et al.  A note on the problem of reporting maximal cliques , 2008, Theor. Comput. Sci..

[21]  Jean-Francois Mangin,et al.  Clustering of Whole-Brain White Matter Short Association Bundles Using HARDI Data , 2017, Front. Neuroinform..

[22]  Jean-Francois Mangin,et al.  FFClust: Fast fiber clustering for large tractography datasets for a detailed study of brain connectivity , 2020, NeuroImage.

[23]  C. Westin,et al.  Automated white matter fiber tract identification in patients with brain tumors , 2016, NeuroImage: Clinical.

[24]  Yasmine Lamari,et al.  A survey on parallel clustering algorithms for Big Data , 2020, Artificial Intelligence Review.

[25]  Sanjay Ghemawat,et al.  MapReduce: simplified data processing on large clusters , 2008, CACM.

[26]  Jean-Francois Mangin,et al.  Parallel Optimization of Fiber Bundle Segmentation for Massive Tractography Datasets , 2019, 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019).

[27]  Omar Bouattane,et al.  New optimized GPU version of the k-means algorithm for large-sized image segmentation , 2017, 2017 Intelligent Systems and Computer Vision (ISCV).

[28]  Zhuo Tang,et al.  GPU-Accelerated Parallel Hierarchical Extreme Learning Machine on Flink for Big Data , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[29]  Maxime Descoteaux,et al.  Dipy, a library for the analysis of diffusion MRI data , 2014, Front. Neuroinform..

[30]  Salvatore Cuomo,et al.  A GPU-accelerated parallel K-means algorithm , 2017, Comput. Electr. Eng..

[31]  Maxime Descoteaux,et al.  Robust clustering of massive tractography datasets , 2011, NeuroImage.

[32]  Nathan Bell,et al.  Thrust: A Productivity-Oriented Library for CUDA , 2012 .

[33]  Alfred Anwander,et al.  A hierarchical method for whole‐brain connectivity‐based parcellation , 2014, Human brain mapping.

[34]  Angelo Bifone,et al.  Automated multi-subject fiber clustering of mouse brain using dominant sets , 2015, Front. Neuroinform..

[35]  Stephen M. Smith,et al.  Using GPUs to accelerate computational diffusion MRI: From microstructure estimation to tractography and connectomes , 2018, NeuroImage.

[36]  Yingjie Tian,et al.  A Comprehensive Survey of Clustering Algorithms , 2015, Annals of Data Science.

[37]  Christian O'Reilly,et al.  Visbrain: A Multi-Purpose GPU-Accelerated Open-Source Suite for Multimodal Brain Data Visualization , 2019, Front. Neuroinform..

[38]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[39]  Jean-Francois Mangin,et al.  Reproducibility of superficial white matter tracts using diffusion-weighted imaging tractography , 2017, NeuroImage.

[40]  Fan Zhang,et al.  TRAKO: Efficient Transmission of Tractography Data for Visualization , 2020, MICCAI.

[41]  Jean-Francois Mangin,et al.  Fast Automatic Segmentation of White Matter Streamlines Based on a Multi-Subject Bundle Atlas , 2016, Neuroinformatics.

[42]  Stephen T. C. Wong,et al.  A hybrid approach to automatic clustering of white matter fibers , 2010, NeuroImage.

[43]  Y. Cointepas,et al.  Accurate tractography propagation mask using T 1-weighted data rather than FA , 2010 .

[44]  Bruce Fischl,et al.  AnatomiCuts: Hierarchical clustering of tractography streamlines based on anatomical similarity , 2016, NeuroImage.

[45]  Anan Banharnsakun,et al.  A MapReduce-based artificial bee colony for large-scale data clustering , 2017, Pattern Recognit. Lett..

[46]  Paul M. Thompson,et al.  Automatic clustering of white matter fibers in brain diffusion MRI with an application to genetics , 2014, NeuroImage.