New proposals for clustering based on trimming and restrictions

TCLUST is a model-based clustering methodology, which employs trimming and restrictions for getting robust estimators. It is available in the tclust package at the CRAN website and in the FSDA Matlab library. Extensions of TCLUST modelling include clustering around linear subspaces, factor analyzers approaches and fuzzy proposals. Further research has been focused in allowing more flexible models for the components, based on the skew normal distribution. An important issue that may appear within TCLUST is the dependence of the obtained solutions from the input parameters. Therefore, a variety of tools have been developed to assist to the users in choosing these parameters. Theoretical and robustness properties for the TCLUST estimators have been proven, and many empirical evidences show the efficacy of the proposed methodology, in a wide variety of situations.

[1]  Luis Angel García-Escudero,et al.  A reweighting approach to robust clustering , 2017, Statistics and Computing.

[2]  Luis Angel García-Escudero,et al.  The influence function of the TCLUST robust clustering procedure , 2012, Adv. Data Anal. Classif..

[3]  Luis Angel García-Escudero,et al.  The joint role of trimming and constraints in robust estimation for mixtures of Gaussian factor analyzers , 2016, Comput. Stat. Data Anal..

[4]  C. Matr'an,et al.  A general trimming approach to robust Cluster Analysis , 2008, 0806.2976.

[5]  Luis Angel García-Escudero,et al.  A constrained robust proposal for mixture modeling avoiding spurious solutions , 2014, Adv. Data Anal. Classif..

[6]  Luis Angel García-Escudero,et al.  tclust: An R Package for a Trimming Approach to Cluster Analysis , 2012 .

[7]  Luis Angel García-Escudero,et al.  Finding the Number of Normal Groups in Model-Based Clustering via Constrained Likelihoods , 2018 .

[8]  Gérard Govaert,et al.  Gaussian parsimonious clustering models , 1995, Pattern Recognit..

[9]  Luis Angel García-Escudero,et al.  Robust estimation of mixtures of regressions with random covariates, via trimming and constraints , 2017, Stat. Comput..

[10]  Luis Angel García-Escudero,et al.  Exploring the number of groups in robust model-based clustering , 2011, Stat. Comput..

[11]  Luis Angel García-Escudero,et al.  Robust Clustering for Time Series Using Spectral Densities and Functional Data Analysis , 2017, IWANN.

[12]  Ryan P. Browne,et al.  Estimating common principal components in high dimensions , 2013, Advances in Data Analysis and Classification.

[13]  Luis Angel García-Escudero,et al.  Robust estimation of mixture models with skew components via trimming and constraints , 2016 .

[14]  A. Gordaliza,et al.  On the breakdown behavior of the TCLUST clustering procedure , 2013 .

[15]  Luis Angel García-Escudero,et al.  Robust Fuzzy Clustering via Trimming and Constraints , 2016, SMPS.

[16]  Geoffrey J. McLachlan,et al.  Finite mixtures of multivariate skew t-distributions: some recent and new results , 2014, Stat. Comput..

[17]  B. Flury Common Principal Components in k Groups , 1984 .

[18]  Francesca Torti,et al.  FSDA: A MATLAB toolbox for robust analysis and interactive data exploration , 2012 .

[19]  Luis Angel García-Escudero,et al.  Avoiding spurious local maximizers in mixture modeling , 2014, Statistics and Computing.

[20]  Luis Angel García-Escudero,et al.  Robust constrained fuzzy clustering , 2013, Inf. Sci..

[21]  Luis Angel García-Escudero,et al.  Computational Statistics and Data Analysis Robust Clusterwise Linear Regression through Trimming , 2022 .