Hierarchical clustering of asymmetric networks

This paper considers networks where relationships between nodes are represented by directed dissimilarities. The goal is to study methods that, based on the dissimilarity structure, output hierarchical clusters, i.e., a family of nested partitions indexed by a connectivity parameter. Our construction of hierarchical clustering methods is built around the concept of admissible methods, which are those that abide by the axioms of value—nodes in a network with two nodes are clustered together at the maximum of the two dissimilarities between them—and transformation—when dissimilarities are reduced, the network may become more clustered but not less. Two particular methods, termed reciprocal and nonreciprocal clustering, are shown to provide upper and lower bounds in the space of admissible methods. Furthermore, alternative clustering methodologies and axioms are considered. In particular, modifying the axiom of value such that clustering in two-node networks occurs at the minimum of the two dissimilarities entails the existence of a unique admissible clustering method. Finally, the developed clustering methods are implemented to analyze the internal migration in the United States.

[1]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[2]  Daniel Müllner,et al.  Modern hierarchical, agglomerative clustering algorithms , 2011, ArXiv.

[3]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Samir Chowdhury,et al.  Hierarchical representations of network data with optimal distortion bounds , 2016, 2016 50th Asilomar Conference on Signals, Systems and Computers.

[5]  Donatella Vicari,et al.  Classification of Asymmetric Proximity Data , 2014, Journal of Classification.

[6]  Shai Ben-David,et al.  A Sober Look at Clustering Stability , 2006, COLT.

[7]  Marina Meila,et al.  Spectral Clustering of Biological Sequence Data , 2005, AAAI.

[8]  David J. Marchette Data Analysis of Asymmetric Structures: Advanced Approaches in Computational Statistics , 2006, Technometrics.

[9]  Naohito Chino,et al.  A BRIEF SURVEY OF ASYMMETRIC MDS AND SOME OPEN PROBLEMS , 2012 .

[10]  Fan Chung,et al.  Spectral Graph Theory , 1996 .

[11]  Santiago Segarra,et al.  Axiomatic construction of hierarchical clustering in asymmetric networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[12]  L. Hubert Min and max hierarchical clustering using asymmetric similarity measures , 1973 .

[13]  Akinori Okada,et al.  UNIVERSITY ENROLLMENT FLOW AMONG THE JAPANESE PREFECTURES: A Comparison before and after the Joint First Stage Achievement Test by Asymmetric Cluster Analysis , 1996 .

[14]  Shai Ben-David,et al.  Measures of Clustering Quality: A Working Set of Axioms for Clustering , 2008, NIPS.

[15]  G. N. Lance,et al.  A General Theory of Classificatory Sorting Strategies: 1. Hierarchical Systems , 1967, Comput. J..

[16]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[17]  Robert E. Tarjan An Improved Algorithm for Hierarchical Clustering Using Strong Components , 1983, Inf. Process. Lett..

[18]  D. Burago,et al.  A Course in Metric Geometry , 2001 .

[19]  F. Mémoli,et al.  Multiparameter Hierarchical Clustering Methods , 2010 .

[20]  Facundo Mémoli,et al.  Department of Mathematics , 1894 .

[21]  Michael I. Jordan,et al.  Learning Spectral Clustering , 2003, NIPS.

[22]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  Santiago Segarra,et al.  Alternative axiomatic constructions for hierarchical clustering of asymmetric networks , 2013, 2013 IEEE Global Conference on Signal and Information Processing.

[25]  Marina Meila,et al.  Clustering by weighted cuts in directed graphs , 2007, SDM.

[26]  Donatella Vicari CLUSKEXT: CLUstering model for SKew-symmetric data including EXTernal information , 2018, Adv. Data Anal. Classif..

[27]  Jon M. Kleinberg,et al.  An Impossibility Theorem for Clustering , 2002, NIPS.

[28]  T. C. Hu Letter to the Editor---The Maximum Capacity Route Problem , 1961 .

[29]  Fionn Murtagh,et al.  Multidimensional clustering algorithms , 1985 .

[30]  Thomas Hofmann,et al.  Semi-supervised Learning on Directed Graphs , 2004, NIPS.

[31]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[32]  Isabelle Guyon,et al.  Clustering: Science or Art? , 2009, ICML Unsupervised and Transfer Learning.

[33]  Santiago Segarra,et al.  Hierarchical Quasi-Clustering Methods for Asymmetric Networks , 2014, ICML.

[34]  Facundo Mémoli,et al.  Classifying Clustering Schemes , 2010, Foundations of Computational Mathematics.

[35]  U. V. Luxburg,et al.  Towards a Statistical Theory of Clustering , 2005 .

[36]  George Karypis,et al.  Hierarchical Clustering Algorithms for Document Datasets , 2005, Data Mining and Knowledge Discovery.

[37]  Naohito Chino,et al.  GEOMETRICAL STRUCTURES OF SOME NON-DISTANCE MODELS FOR ASYMMETRIC MDS , 1993 .

[38]  Reza Bosagh Zadeh,et al.  A Uniqueness Theorem for Clustering , 2009, UAI.

[39]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[40]  Philip Levis,et al.  Achieving single channel, full duplex wireless communication , 2010, MobiCom.

[41]  P B Slater,et al.  A Partial Hierarchical Regionalization of 3140 US Counties on the Basis of 1965–1970 Intercounty Migration , 1984, Environment & planning A.

[42]  Boyd Jp,et al.  Asymmetric clusters of internal migration regions of France , 1980 .

[43]  Paul B. Slater Hierarchical Internal Migration Regions of France , 1976, IEEE Transactions on Systems, Man, and Cybernetics.