movMF: An R Package for Fitting Mixtures of von Mises-Fisher Distributions

Finite mixtures of von Mises-Fisher distributions allow to apply model-based clustering methods to data which is of standardized length, i.e., all data points lie on the unit sphere. The R package movMF contains functionality to draw samples from finite mixtures of von Mises-Fisher distributions and to fit these models using the expectation-maximization algorithm for maximum likelihood estimation. Special features are the possibility to use sparse matrix representations for the input data, different variants of the expectation-maximization algorithm, different methods for determining the concentration parameters in the M-step and to impose constraints on the concentration parameters over the components. In this paper we describe the main fitting function of the package and illustrate its application. In addition we compare the clustering performance of finite mixtures of von Mises-Fisher distributions to spherical k-means. We also discuss the resolution of several numerical issues which occur for estimating the concentration parameters and for determining the normalizing constant of the von Mises-Fisher distribution.

[1]  Sadaaki Miyamoto,et al.  Spherical k-Means++ Clustering , 2015, MDAI.

[2]  K. Hornik,et al.  Mixtures of von Mises-Fisher Distributions , 2014 .

[3]  Kurt Hornik,et al.  On maximum likelihood estimation of the concentration parameter of von Mises–Fisher distributions , 2013, Comput. Stat..

[4]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[5]  Amos-type bounds for modified Bessel function ratios☆ , 2013, Journal of mathematical analysis and applications.

[6]  Kurt Hornik,et al.  Spherical k-Means Clustering , 2012 .

[7]  Jun Liu,et al.  High-order parameter approximation for von Mises-Fisher distributions , 2012, Appl. Math. Comput..

[8]  Suvrit Sra,et al.  A short note on parameter approximation for von Mises-Fisher distributions: and a fast implementation of Is(x) , 2012, Comput. Stat..

[9]  S. R. Jammalamadaka,et al.  Directional Statistics, I , 2011 .

[10]  Philipp Hennig,et al.  Using an Infinite Von Mises-Fisher Mixture Model to Cluster Treatment Beam Directions in External Radiation Therapy , 2010, 2010 Ninth International Conference on Machine Learning and Applications.

[11]  Thomas S. Huang,et al.  Generative model-based speaker clustering via mixture of von Mises-Fisher distributions , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[12]  F. Leisch,et al.  FlexMix Version 2: Finite Mixtures with Concomitant Variables and Varying and Constant Parameters , 2008 .

[13]  Kurt Hornik,et al.  Text Mining Infrastructure in R , 2008 .

[14]  N. Wicker,et al.  Model-based clustering on the unit sphere with an illustration using gene expression profiles. , 2008, Biostatistics.

[15]  Peter D. Hoff,et al.  Simulation of the Matrix Bingham–von Mises–Fisher Distribution, With Applications to Multivariate and Relational Data , 2007, 0712.4166.

[16]  Shin Ishii,et al.  Parameter estimation for von Mises–Fisher distributions , 2007, Comput. Stat..

[17]  Sylvia Frühwirth-Schnatter,et al.  Finite Mixture and Markov Switching Models , 2006 .

[18]  Baba C. Vemuri,et al.  Segmentation of High Angular Resolution Diffusion MRI Modeled as a Field of von Mises-Fisher Mixtures , 2006, ECCV.

[19]  B. Everitt,et al.  A Handbook of Statistical Analyses using R , 2006 .

[20]  Inderjit S. Dhillon,et al.  Clustering on the Unit Hypersphere using von Mises-Fisher Distributions , 2005, J. Mach. Learn. Res..

[21]  Kurt Hornik,et al.  A CLUE for CLUster Ensembles , 2005 .

[22]  F. Leisch FlexMix: A general framework for finite mixture models and latent class regression in R , 2004 .

[23]  George Karypis,et al.  Empirical and Theoretical Comparisons of Selected Criterion Functions for Document Clustering , 2004, Machine Learning.

[24]  Inderjit S. Dhillon,et al.  Concept Decompositions for Large Sparse Text Data Using Clustering , 2004, Machine Learning.

[25]  I. Dhillon,et al.  Modeling Data using Directional Distributions , 2003 .

[26]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[27]  George Karypis,et al.  CLUTO - A Clustering Toolkit , 2002 .

[28]  William H. Press,et al.  Numerical recipes in C , 2002 .

[29]  Arnold Neumaier,et al.  Introduction to Numerical Analysis , 2001 .

[30]  W. J. Whiten,et al.  Fitting Mixtures of Kent Distributions to Aid in Joint Set Identification , 2001 .

[31]  Inderjit S. Dhillon,et al.  Efficient Clustering of Very Large Document Collections , 2001 .

[32]  G. Karypis,et al.  Criterion Functions for Document Clustering ∗ Experiments and Analysis , 2001 .

[33]  A. Wood Simulation of the von mises fisher distribution , 1994 .

[34]  William J. Cody,et al.  Algorithm 715: SPECFUN–a portable FORTRAN package of special function routines and test drivers , 1993, TOMS.

[35]  G. Celeux,et al.  A Classification EM algorithm for clustering and two stochastic versions , 1992 .

[36]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[37]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[38]  Begnaud Francis Hildebrand,et al.  Introduction to numerical analysis: 2nd edition , 1987 .

[39]  Gary Ulrich,et al.  Computer Generation of Distributions on the M‐Sphere , 1984 .

[40]  H. Simpson,et al.  Some monotonicity results for ratios of modified Bessel functions , 1984 .

[41]  J. Kent The Fisher‐Bingham Distribution on the Sphere , 1982 .

[42]  G. W. Walster,et al.  Further comments on the computation of modified Bessel function ratios , 1980 .

[43]  Walter Gautschi,et al.  On the computation of modified Bessel function ratios , 1978 .

[44]  G. Schou Estimation of the concentration parameter in von Mises–Fisher distributions , 1978 .

[45]  M. Degroot,et al.  Probability and Statistics , 1977 .

[46]  D. E. Amos,et al.  Computation of modified Bessel functions and their ratios , 1974 .

[47]  F. W. J. Olver,et al.  The asymptotic expansion of bessel functions of large order , 1954, Philosophical Transactions of the Royal Society of London. Series A, Mathematical and Physical Sciences.

[48]  L. Milne‐Thomson A Treatise on the Theory of Bessel Functions , 1945, Nature.

[49]  G. Watson Bessel Functions. (Scientific Books: A Treatise on the Theory of Bessel Functions) , 1923 .