Modified Graph-theoretic Clustering Algorithm for Mining International Linkages of Philippine Higher Education Institutions

Graph-theoretic clustering either uses limited neighborhood or construction of a minimum spanning tree to aid the clustering process. The latter is challenged by the need to identify and consequently eliminate inconsistent edges to achieve final clusters, detect outliers and partition substantially. This work focused on mining the data of the International Linkages of Philippine Higher Education Institutions by employing a modified graph-theoretic clustering algorithm with which the Prim’s Minimum Spanning Tree algorithm was used to construct a minimum spanning tree for the internationalization dataset infusing the properties of a small world network. Such properties are invoked by the computation of local clustering coefficient for the data elements in the limited neighborhood of data points established using the von Neumann Neighborhood. The overall result of the cluster validation using the Silhouette Index with a score of .69 indicates that there is an acceptable structure found in the clustering result – hence, a potential of the modified MST-based clustering algorithm. The Silhouette per cluster with .75 being the least score means that each cluster derived for r=5 by the von Neumann Neighborhood has a strong clustering structure.

[1]  Christian Hennig,et al.  Recovering the number of clusters in data sets with noise features using feature rescaling factors , 2015, Inf. Sci..

[2]  Duoqian Miao,et al.  A graph-theoretical clustering method based on two rounds of minimum spanning trees , 2010, Pattern Recognit..

[3]  Mohamed Benmohammed,et al.  A Partitional Approach for Genomic-Data Clustering Combined with K-Means Algorithm , 2016, 2016 IEEE Intl Conference on Computational Science and Engineering (CSE) and IEEE Intl Conference on Embedded and Ubiquitous Computing (EUC) and 15th Intl Symposium on Distributed Computing and Applications for Business Engineering (DCABES).

[4]  Nei Y. Soma,et al.  On irreversibility of von Neumann additive cellular automata on grids , 2006, Discret. Appl. Math..

[5]  Paul J. Laurienti,et al.  The Ubiquity of Small-World Networks , 2011, Brain Connect..

[6]  Amarjot Singh,et al.  Variance Based Moving K-Means Algorithm , 2017, 2017 IEEE 7th International Advance Computing Conference (IACC).

[7]  Chun-Xia Zhang,et al.  Clustering with Prim's sequential representation of minimum spanning tree , 2014, Appl. Math. Comput..

[8]  J. Knight Internationalization Remodeled: Definition, Approaches, and Rationales , 2004 .

[9]  John D. Kelleher,et al.  An Analysis of the Application of Simplified Silhouette to the Evaluation of k-means Clustering Validity , 2017, MLDM.

[10]  Xiaofei Xu,et al.  Combining Von Neumann Neighborhood Topology with Approximate-Mapping Local Search for ABC-Based Service Composition , 2014, 2014 IEEE International Conference on Services Computing.

[11]  Li-Hsing Yen,et al.  Clustering coefficient of wireless ad hoc networks and the quantity of hidden terminals , 2005, IEEE Communications Letters.

[12]  Niina Päivinen Clustering with a minimum spanning tree of scale-free-like structure , 2005, Pattern Recognit. Lett..

[13]  C. B. Abhilash,et al.  A comparative analysis of data sets using Machine Learning techniques , 2014, 2014 IEEE International Advance Computing Conference (IACC).

[14]  Damla Turgut,et al.  Is the clustering coefficient a measure for fault tolerance in wireless sensor networks? , 2012, 2012 IEEE International Conference on Communications (ICC).

[15]  Joshua Zhexue Huang,et al.  Minimum Spanning Tree Based Classification Model for Massive Data with MapReduce Implementation , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[16]  Xia Li Wang,et al.  A New Fast Minimum Spanning Tree-Based Clustering Technique , 2014, 2014 IEEE International Conference on Data Mining Workshop.

[17]  Kwang-Cheng Chen,et al.  Clustering coefficient analysis in large wireless ad hoc network , 2017, 2017 IEEE/CIC International Conference on Communications in China (ICCC).

[18]  Priyojit Das,et al.  A novel clustering method to identify cell types from single cell transcriptional profiles , 2018 .

[19]  Anbupalam Thalamuthu,et al.  Gene expression Evaluation and comparison of gene clustering methods in microarray analysis , 2006 .

[20]  D. Elsayad,et al.  An improved parallel minimum spanning tree based clustering algorithm for microarrays data analysis , 2012, 2012 8th International Conference on Informatics and Systems (INFOS).

[21]  Pasi Fränti,et al.  Minimum spanning tree based split-and-merge: A hierarchical clustering method , 2011, Inf. Sci..

[22]  Shing I. Chang,et al.  Determination of cluster number in clustering microarray data , 2005, Appl. Math. Comput..

[23]  Diego H. Milone,et al.  A new index for clustering validation with overlapped clusters , 2016, Expert Syst. Appl..

[24]  Hasan Akin,et al.  Three-state von Neumann cellular automata and pattern generation , 2015 .

[25]  Hans de Wit,et al.  Assessing the internationalisation of degree programmes: experiences from a Dutch-Flemish pilot certifying internationalisation , 2013 .

[26]  D. R. Edla,et al.  An improved MST-based clustering for biological data , 2012, 2012 International Conference on Data Science & Engineering (ICDSE).

[27]  Mahesh Motwani,et al.  Survey of clustering algorithms for MANET , 2009, ArXiv.

[28]  Olatz Arbelaitz,et al.  An extensive comparative study of cluster validity indices , 2013, Pattern Recognit..

[29]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[30]  Ajay Rana,et al.  K-means with Three different Distance Metrics , 2013 .

[31]  Ulrich Teichler,et al.  Internationalisation of higher education: European experiences , 2009 .

[32]  Alfred O. Hero,et al.  Clustering with a new distance measure based on a dual-rooted tree , 2013, Inf. Sci..

[33]  D. Dutschke Campus Internationalization Initiatives and Study Abroad , 2009 .

[34]  Dmitry A. Zaitsev A generalized neighborhood for cellular automata , 2017, Theor. Comput. Sci..