fastcluster: Fast Hierarchical, Agglomerative Clustering Routines for R and Python

The fastcluster package is a C++ library for hierarchical, agglomerative clustering. It provides a fast implementation of the most efficient, current algorithms when the input is a dissimilarity index. Moreover, it features memory-saving routines for hierarchical clustering of vector data. It improves both asymptotic time complexity (in most cases) and practical performance (in all cases) compared to the existing implementations in standard software: several R packages, MATLAB, Mathematica, Python with SciPy.

[1]  Jean Cardinal,et al.  Lazy Algorithms for Dynamic Closest Pair with Arbitary Distance Measures , 2004, ALENEX/ANALC.

[2]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[3]  H. Edelsbrunner,et al.  Efficient algorithms for agglomerative hierarchical clustering methods , 1984 .

[4]  Brian Everitt,et al.  Cluster analysis , 1974 .

[5]  J. Gower,et al.  Minimum Spanning Trees and Single Linkage Cluster Analysis , 1969 .

[6]  G. N. Lance,et al.  A General Theory of Classificatory Sorting Strategies: 1. Hierarchical Systems , 1967, Comput. J..

[7]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[8]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[9]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[10]  Robin Sibson,et al.  SLINK: An Optimally Efficient Algorithm for the Single-Link Cluster Method , 1973, Comput. J..

[11]  B. Jaumard,et al.  Cluster Analysis and Mathematical Programming , 2003 .

[12]  Ronald L. Rivest,et al.  Introduction to Algorithms, 3rd Edition , 2009 .

[13]  Daniel Müllner,et al.  Modern hierarchical, agglomerative clustering algorithms , 2011, ArXiv.

[14]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[15]  Guido van Rossum,et al.  Python Programming Language , 2007, USENIX Annual Technical Conference.

[16]  David Eppstein,et al.  Fast hierarchical clustering and other applications of dynamic closest pairs , 1999, SODA '98.

[17]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[18]  Michael R. Anderberg,et al.  Cluster Analysis for Applications , 1973 .