Discovery of Glaucoma Progressive Patterns Using Hierarchical MDL-Based Clustering

In this paper, we propose a method to cluster the spacial patterns of the visual field in glaucoma patients to analyze the progression patterns of glaucoma. The degree of progression in the visual field of glaucoma patients can be divided into several regions by straight line boundaries, we call this specific structure Direct Product Structure in this paper. Since we can observe the direct product structure in the visual fields, we propose a bottom-up hierarchical clustering method to embed this structure into the clustering structure. In our method, according to the minimum description length (MDL) principle, we select the best cluster division so that the total code length required for encoding the data as well as the clustering structure is minimum. We can thereby select the clusters that are robust to the noise in the position of the direct product structure for clustering. We demonstrate the effectiveness of our method using an artificial dataset and a real glaucoma dataset. Our proposed method performed better than existing methods for both datasets. For the real glaucoma dataset in particular, our method discovered the characteristic progressive patterns of glaucoma as specific features of clusters. These patterns agree with clinical knowledge. Furthermore, we show that our clusters can be applied to improve the accuracy of predicting glaucoma progression. Thus, our clusters contain rich information of glaucoma, and hence can contribute to further development in glaucoma research.

[1]  Benjamin C. M. Fung,et al.  Hierarchical Document Clustering using Frequent Itemsets , 2003, SDM.

[2]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[3]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[4]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[5]  Jorma Rissanen,et al.  Stochastic Complexity in Statistical Inquiry , 1989, World Scientific Series in Computer Science.

[6]  Ryo Asaoka,et al.  Clustering visual field test points based on rates of progression to improve the prediction of future damage. , 2014, Investigative ophthalmology & visual science.

[7]  D. Garway-Heath,et al.  Mapping the visual field to the optic disc in normal tension glaucoma eyes. , 2000, Ophthalmology.

[8]  Kenji Yamanishi,et al.  Efficient computation of normalized maximum likelihood coding for Gaussian mixtures with its applications to optimal clustering , 2011, 2011 IEEE International Symposium on Information Theory Proceedings.

[9]  Ryo Asaoka,et al.  Mapping Glaucoma Patients' 30-2 and 10-2 Visual Fields Reveals Clusters of Test Points Damaged in the 10-2 Grid That Are Not Sampled in the Sparse 30-2 Grid , 2014, PloS one.

[10]  Ryo Asaoka,et al.  A new approach to measure visual field progression in glaucoma patients using variational bayes linear regression. , 2014, Investigative ophthalmology & visual science.

[11]  Kai Morino,et al.  Predicting glaucoma progression using multi-task learning with heterogeneous features , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[12]  Jorma Rissanen,et al.  Information and Complexity in Statistical Modeling , 2006, ITW.

[13]  Jorma Rissanen,et al.  Fisher information and stochastic complexity , 1996, IEEE Trans. Inf. Theory.

[14]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[15]  Stephen J. Roberts,et al.  Variational Mixture of Bayesian Independent Component Analyzers , 2003, Neural Computation.

[16]  Philip S. Yu,et al.  Hierarchical, Parameter-Free Community Discovery , 2008, ECML/PKDD.

[17]  Hujun Bao,et al.  Laplacian Regularized Gaussian Mixture Model for Data Clustering , 2011, IEEE Transactions on Knowledge and Data Engineering.

[18]  Jorma Rissanen,et al.  Stochastic Complexity in Learning , 1995, J. Comput. Syst. Sci..

[19]  Ryota Tomioka,et al.  Quantitative Prediction of Glaucomatous Visual Field Loss from Few Measurements , 2013, 2013 IEEE 13th International Conference on Data Mining.

[20]  Julio Gonzalo,et al.  The SemEval-2007 WePS Evaluation: Establishing a benchmark for the Web People Search Task , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[21]  Christopher Bowd,et al.  Learning From Data: Recognizing Glaucomatous Defect Patterns and Detecting Progression From Visual Field Measurements , 2014, IEEE Transactions on Biomedical Engineering.