Incorporating Association Patterns into Manifold Clustering for Enabling Predictive Analytics

The goal of this research is to develop a predictive analytics technique based on manifold clustering of mixed data type. In this research, we explore the concept of statistically significant association patterns to induce an initial partition on data for deriving manifolds. Manifolds are hyperplanes embedded in low dimensions. The advantage of this novel technique is a bootstrap on data clusters that reveals statistical associations from the information-theoretic perspective. As an illustration, the proposed technique is applied to a real data set of diabetes patients. An assessment on the proposed technique is performed to investigate the effect of bootstrap based on association patterns. Results of the preliminary study demonstrate the feasibility of applying the proposed technique to real-world data.

[1]  Bon Sy SEM Approach for TPB: Application to Digital Health Software and Self-Health Management , 2017, 2017 International Conference on Computational Science and Computational Intelligence (CSCI).

[2]  Brandon M. Greenwell,et al.  Hands-On Machine Learning with R , 2019 .

[3]  Jacob Goldberger,et al.  An unsupervised data projection that preserves the cluster structure , 2012, Pattern Recognit. Lett..

[4]  Bon K. Sy,et al.  Information-Statistical Data Mining: Warehouse Integration With Examples of Oracle Basics (The Kluwer International Series in Engineering and Computer Science, 757) , 2003 .

[5]  M. Little,et al.  DIABETES SELF-EFFICACY QUESTIONNAIRE (DSEQ) OUTCOME MEASUREMENT FOR DIABETES EDUCATION , 2004 .

[6]  Howard B. Lee,et al.  A K-Means Cluster Analysis Computer Program With Cross-Tabulations and Next-Nearest-Neighbor Analysis , 1980 .

[7]  Stefan Grünewald,et al.  Structured sparse K-means clustering via Laplacian smoothing , 2018, Pattern Recognit. Lett..

[8]  Evgueni A. Haroutunian,et al.  Information Theory and Statistics , 2011, International Encyclopedia of Statistical Science.

[9]  James T. Kwok,et al.  Clustered Nyström Method for Large Scale Manifold Learning and Dimension Reduction , 2010, IEEE Transactions on Neural Networks.

[10]  Fei Yan,et al.  Robust Dimension Reduction for Clustering With Local Adaptive Learning , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Shuzhi Sam Ge,et al.  Geometrically local embedding in manifolds for dimension reduction , 2012, Pattern Recognit..

[12]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.