Fuzzy Change-Point Algorithms for Regression Models

Change-point (CP) regression models have been widely applied in various fields, where detecting CPs is an important problem. Detecting the location of CPs in regression models could be equivalent to partitioning data points into clusters of similar individuals. In the literature, fuzzy clustering has been widely applied in various fields, but it is less used in locating CPs in CP regression models. In this paper, a new method, called fuzzy CP (FCP) algorithm, is proposed to detect the CPs and simultaneously estimate the parameters of regression models. The fuzzy c -partitions concept is first embedded into the CP regression models. Any possible collection of all CPs is considered as a partitioning of data with a fuzzy membership. We then transfer these memberships into the pseudomemberships of data points belonging to each individual cluster, and therefore, we can obtain the estimates for model parameters by the fuzzy c-regressions method. Subsequently, we use the fuzzy c -means clustering to obtain the new iterates of the CP collection memberships by minimizing an objective function concerning the deviations between the predicted response values and data values. We illustrate the new approach with several numerical examples and real datasets. Experimental results actually show that the proposed FCP is an effective and useful CP detection algorithm for CP regression models and can be applied to various fields, such as econometrics, medicine, quality control, and signal processing.

[1]  Paul Fearnhead,et al.  Exact and efficient Bayesian inference for multiple changepoint problems , 2006, Stat. Comput..

[2]  F. Downton,et al.  Introduction to Mathematical Statistics , 1959 .

[3]  Yin-caiTang,et al.  Detecting Change Points in Polynomial Regression Models with an Application to Cable Data Sets , 2004 .

[4]  Yi-Ching Yao,et al.  LEAST-SQUARES ESTIMATION OF A STEP FUNCTION , 2016 .

[5]  Gabriela Ciuperca,et al.  The M-estimator in a multi-phase random nonlinear model , 2007 .

[6]  Hui-hui Wang,et al.  Change -Points Detection in Fuzzy Point Data Sets , 2007 .

[7]  M. Muggeo,et al.  segmented: An R package to Fit Regression Models with Broken-Line Relationships , 2008 .

[8]  Jacques F. Carriere,et al.  PARAMETRIC MODELS FOR LIFE TABLES , 1992 .

[9]  Hira L. Koul,et al.  Asymptotics of M-estimators in two-phase linear regression models , 2003 .

[10]  Witold Pedrycz,et al.  Collaborative Fuzzy Clustering Algorithms: Some Refinements and Design Guidelines , 2012, IEEE Transactions on Fuzzy Systems.

[11]  D. Hawkins Fitting multiple change-point models to data , 2001 .

[12]  Rajesh N. Davé,et al.  Validating fuzzy partitions obtained through c-shells clustering , 1996, Pattern Recognit. Lett..

[13]  Jian Yu,et al.  Alpha-Cut Implemented Fuzzy Clustering Algorithms and Switching Regressions , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[14]  P. Perron,et al.  Estimating and testing linear models with multiple structural changes , 1995 .

[15]  V. Muggeo Estimating regression models with unknown break‐points , 2003, Statistics in medicine.

[16]  Ram C Tiwari,et al.  Detecting multiple change points in piecewise constant hazard functions , 2011, Journal of applied statistics.

[17]  T. Apostol Mathematical Analysis , 1957 .

[18]  Hira L. Koul,et al.  Asymptotics of maximum likelihood estimator in a two-phase linear regression model , 2002 .

[19]  Miin-Shen Yang,et al.  A Robust Automatic Merging Possibilistic Clustering Method , 2011, IEEE Transactions on Fuzzy Systems.

[20]  E. Guallar,et al.  Use of two-segmented logistic regression to estimate change-points in epidemiologic studies. , 1998, American journal of epidemiology.

[21]  Lianfen Qian,et al.  Estimation in a change-point hazard regression model with long-term survivors , 2013 .

[22]  P. Olver Nonlinear Systems , 2013 .

[23]  Paulo Fazendeiro,et al.  Observer-Biased Fuzzy Clustering , 2015, IEEE Transactions on Fuzzy Systems.

[24]  Miin-Shen Yang A survey of fuzzy clustering , 1993 .

[25]  J. Bai,et al.  Least squares estimation of a shift in linear processes , 1994 .

[26]  S. Julious Inference and estimation in a changepoint regression problem , 2001 .

[27]  R. Tibshirani,et al.  Generalized Additive Models , 1991 .

[28]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[29]  Cathy W. S. Chen,et al.  A comparison of estimators for regression models with change points , 2010, Stat. Comput..

[30]  Asari,et al.  Fuzzy Clustering with a Modified MRF Energy Function for Change Detection in Synthetic Aperture Radar Images , 2015 .

[31]  Andrej Pázman,et al.  Nonlinear Regression , 2019, Handbook of Regression Analysis With Applications in R.

[32]  V. J. Rayward-Smith,et al.  Fuzzy Cluster Analysis: Methods for Classification, Data Analysis and Image Recognition , 1999 .

[33]  Miin-Shen Yang,et al.  Alternative c-means clustering algorithms , 2002, Pattern Recognit..

[34]  J. H. Venter,et al.  Finding multiple abrupt change points , 1996 .

[35]  Xian Zhou,et al.  A change-point model for survival data with long-term survivors , 2009 .

[36]  R. Quandt Tests of the Hypothesis That a Linear Regression System Obeys Two Separate Regimes , 1960 .

[37]  H. V. Henderson,et al.  Building Multiple Regression Models Interactively , 1981 .

[38]  Gabriela Ciuperca A general criterion to determine the number of change-points , 2011 .

[39]  Anthony C. Davison,et al.  Bootstrap Methods and Their Application , 1998 .

[40]  J. Neter,et al.  Applied Linear Statistical Models (3rd ed.). , 1992 .

[41]  Lung-fei Lee,et al.  Simulation estimation of dynamic switching regression and dynamic disequilibrium models -- some Monte Carlo results , 1997 .

[42]  G. Chow Tests of equality between sets of coefficients in two linear regressions (econometrics voi 28 , 1960 .

[43]  Gabriela Ciuperca,et al.  Maximum likelihood estimator in a multi-phase random regression model , 2008 .

[44]  Andrew L. Rukhin,et al.  Change-Point Estimation as a Nonlinear Regression Problem , 1997 .

[45]  Isak Gath,et al.  Unsupervised Optimal Fuzzy Clustering , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[46]  Ciuperca Gabriela Maximum likelihood estimator in a two-phase nonlinear random regression model , 2004 .

[47]  Masoud Asgharian,et al.  Change-point Problem and Regression: An Annotated Bibliography , 2008 .

[48]  P. Fearnhead,et al.  On‐line inference for hidden Markov models via particle filters , 2003 .

[49]  J. Zidek,et al.  ON SEGMENTED MULTIVARIATE REGRESSION , 1997 .

[50]  Simeon K. Ehui,et al.  Credit constraints and smallholder dairy production in the East African highlands: application of a switching regression model , 1998 .

[51]  Yi-Ching Yao Estimating the number of change-points via Schwarz' criterion , 1988 .

[52]  R. Quandt The Estimation of the Parameters of a Linear Regression System Obeying Two Separate Regimes , 1958 .

[53]  W. Marsden I and J , 2012 .

[54]  E. Feuer,et al.  Permutation tests for joinpoint regression with applications to cancer rates. , 2000, Statistics in medicine.

[55]  R.J. Hathaway,et al.  Switching regression models and fuzzy clustering , 1993, IEEE Trans. Fuzzy Syst..