Mixture model-based functional discriminant analysis for curve classification

Statistical approaches for Functional Data Analysis concern the paradigm for which the individuals are functions or curves rather than finite dimensional vectors. In this paper, we particularly focus on the modeling and the classification of functional data which are temporal curves presenting regime changes over time. More specifically, we propose a new mixture model-based discriminant analysis approach for functional data using a specific hidden process regression model. Our approach is particularly adapted to both handle the problem of complex-shaped classes of curves, where each class is composed of several sub-classes, and to deal with the regime changes within each homogeneous sub-class. The model explicitly integrates the heterogeneity of each class of curves via a mixture model formulation, and the regime changes within each sub-class through a hidden logistic process. The approach allows therefore for fitting flexible curve-models to each class of complex-shaped curves presenting regime changes through an unsupervised learning scheme, to automatically summarize it into a finite number of homogeneous clusters, each of them is decomposed into several regimes. The model parameters are learned by maximizing the observed-data log-likelihood for each class by using a dedicated expectation-maximization (EM) algorithm. Comparisons on simulated data and real data with alternative approaches, including functional linear discriminant analysis and functional mixture discriminant analysis with polynomial regression mixtures and spline regression mixtures, show that the proposed approach provides better results regarding the discrimination results and significantly improves the curves approximation.

[1]  J. O. Ramsay,et al.  Functional Data Analysis (Springer Series in Statistics) , 1997 .

[2]  Sophie Dabo-Niang,et al.  On the using of modal curves for radar waveforms classification , 2007, Comput. Stat. Data Anal..

[3]  Hans-Georg Müller,et al.  Functional Data Analysis , 2016 .

[4]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[5]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[6]  Padhraic Smyth,et al.  Probabilistic curve-aligned clustering and prediction with regression mixture models , 2004 .

[7]  Aurore Delaigle,et al.  Componentwise classification and clustering of functional data , 2012 .

[8]  Faicel Chamroukhi,et al.  Hidden process regression for curve modeling, classification and tracking , 2010 .

[9]  Frédéric Ferraty,et al.  Curves discrimination: a nonparametric functional approach , 2003, Comput. Stat. Data Anal..

[10]  A. F. Smith,et al.  Statistical analysis of finite mixture distributions , 1986 .

[11]  Gareth M. James,et al.  Functional linear discriminant analysis for irregularly sampled curves , 2001 .

[12]  Allou Samé,et al.  Time series modeling by a regression approach based on a latent process , 2009, Neural Networks.

[13]  A. F. Smith,et al.  Statistical analysis of finite mixture distributions , 1986 .

[14]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[15]  Allou Samé,et al.  A hidden process regression model for functional data description. Application to curve discrimination , 2010, Neurocomputing.

[16]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[17]  Allou Samé,et al.  Model-based clustering and segmentation of time series with changes in regime , 2011, Adv. Data Anal. Classif..

[18]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[19]  Yves Lechevallier,et al.  Exploratory analysis of functional data via clustering and optimal segmentation , 2010, Neurocomputing.

[20]  R. Tibshirani,et al.  Discriminant Analysis by Gaussian Mixtures , 1996 .

[21]  Nizar Bouguila,et al.  High-Dimensional Unsupervised Selection and Estimation of a Finite Generalized Dirichlet Mixture Model Based on Minimum Message Length , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  P. Deb Finite Mixture Models , 2008 .