Fuzzy Clusterwise Generalized Structured Component Analysis

Generalized Structured Component Analysis (GSCA) was recently introduced by Hwang and Takane (2004) as a component-based approach to path analysis with latent variables. The parameters of GSCA are estimated by pooling data across respondents under the implicit assumption that they all come from a single, homogenous group. However, as has been empirically demonstrated by various researchers across a number of areas of inquiry, such aggregate analyses can often mask the true structure in data when respondent heterogeneity is present. In this paper, GSCA is generalized to a fuzzy clustering framework so as to account for potential group-level respondent heterogeneity. An alternating least-squares procedure is developed and technically described for parameter estimation. A small-scale Monte Carlo study involving synthetic data is carried out to compare the performance between the proposed method and an extant approach. In addition, an empirical application concerning alcohol use among adolescents from US northwestern urban areas is presented to illustrate the usefulness of the proposed method. Finally, a number of directions for future research are provided.

[1]  A. Goldberger,et al.  Structural Equation Models in the Social Sciences. , 1974 .

[2]  P. Arabie,et al.  Cluster analysis in marketing research , 1994 .

[3]  Daniel J Bauer,et al.  Distributional assumptions of growth mixture models: implications for overextraction of latent trajectory classes. , 2003, Psychological methods.

[4]  Y. Takane,et al.  Generalized structured component analysis , 2004 .

[5]  B. Efron,et al.  The Jackknife: The Bootstrap and Other Resampling Plans. , 1983 .

[6]  William Meredith,et al.  Latent curve analysis , 1990 .

[7]  P. Arabie,et al.  Overlapping Clustering: A New Method for Product Positioning , 1981 .

[8]  Joop J. Hox,et al.  Applied Multilevel Analysis. , 1995 .

[9]  W. Kamakura,et al.  Modeling Preference and Structural Heterogeneity in Consumer Choice , 1996 .

[10]  K. Jöreskog A General Method for Estimating a Linear Structural Equation System. , 1970 .

[11]  T. Moffitt Adolescence-limited and life-course-persistent antisocial behavior: a developmental taxonomy. , 1993, Psychological review.

[12]  W F Velicer,et al.  Component Analysis versus Common Factor Analysis: Some issues in Selecting an Appropriate Procedure. , 1990, Multivariate behavioral research.

[13]  J. William Ahwood,et al.  CLASSIFICATION , 1931, Foundations of Familiar Language.

[14]  D. G. Weeks,et al.  Linear structural equations with latent variables , 1980 .

[15]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[16]  Forrest W. Young Quantitative analysis of qualitative data , 1981 .

[17]  Daniel S. Nagin,et al.  Analyzing developmental trajectories: A semiparametric, group-based approach , 1999 .

[18]  Kenneth G. Manton,et al.  Statistical applications using fuzzy sets , 1994 .

[19]  S. Mulaik Foundations of Factor Analysis , 1975 .

[20]  P. Deb Finite Mixture Models , 2008 .

[21]  R. P. McDonald,et al.  Some algebraic properties of the Reticular Action Model for moment structures. , 1984, The British journal of mathematical and statistical psychology.

[22]  W. DeSarbo,et al.  Finite-Mixture Structural Equation Models for Response-Based Segmentation and Unobserved Heterogeneity , 1997 .

[23]  M. Wedel,et al.  Market Segmentation: Conceptual and Methodological Foundations , 1997 .

[24]  R. D. Bock,et al.  Analysis of covariance structures , 1966, Psychometrika.

[25]  R. Darrell Bock,et al.  Multilevel analysis of educational data , 1989 .

[26]  K. Jöreskog A general method for analysis of covariance structures , 1970 .

[27]  Alex B. McBratney,et al.  Application of fuzzy sets to climatic classification , 1985 .

[28]  J. Bezdek Numerical taxonomy with fuzzy sets , 1974 .

[29]  W. DeSarbo,et al.  A maximum likelihood methodology for clusterwise linear regression , 1988 .

[30]  Arnon Karnieli,et al.  Linear mixture model approach for selecting fuzzy exponent value in fuzzy c-means algorithm , 2006, Ecol. Informatics.

[31]  H. Hruschka Market definition and segmentation using fuzzy clustering methods , 1986 .

[32]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[33]  Forrest W. Young,et al.  Additive structure in qualitative data: An alternating least squares method with optimal scaling features , 1976 .

[34]  M. Hill,et al.  Nonlinear Multivariate Analysis. , 1990 .

[35]  R.J. Hathaway,et al.  Switching regression models and fuzzy clustering , 1993, IEEE Trans. Fuzzy Syst..

[36]  W. DeSarbo,et al.  An Empirical Pooling Approach for Estimating Marketing Mix Elasticities with PIMS Data , 1993 .

[37]  H. Goldstein,et al.  Multilevel Models in Educational and Social Research. , 1989 .

[38]  M. Wedel,et al.  A fuzzy clusterwise regression approach to benefit segmentation , 1989 .

[39]  R. Bagozzi A Field Investigation of Causal Relations among Cognitions, Affect, Intentions, and Behavior , 1982 .

[40]  M. Wedel,et al.  A Clusterwise Regression Method for Simultaneous Fuzzy Market Structuring and Benefit Segmentation , 1991 .

[41]  M. Roubens Fuzzy clustering algorithms and their cluster validity , 1982 .

[42]  Yoram Wind,et al.  Issues and Advances in Segmentation Research , 1978 .

[43]  J. Bezdek,et al.  Detection and Characterization of Cluster Substructure II. Fuzzy c-Varieties and Convex Combinations Thereof , 1981 .

[44]  R. MacCallum,et al.  Studying Multivariate Change Using Multilevel Models and Latent Curve Models. , 1997, Multivariate behavioral research.

[45]  Terry E. Duncan,et al.  Latent Variable Modeling of Longitudinal and Multilevel Substance Use Data. , 1997, Multivariate behavioral research.

[46]  S. Mulaik,et al.  Foundations of Factor Analysis , 1975 .

[47]  G. A. Marcoulides,et al.  New developments and techniques in structural equation modeling , 2001 .

[48]  F. Bookstein,et al.  Two Structural Equation Models: LISREL and PLS Applied to Consumer Exit-Voice Theory , 1982 .

[49]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .

[50]  Heungsun Hwang,et al.  Structural Equation Modeling by Extended Redundancy Analysis , 2002 .

[51]  B. Muthén Latent Variable Mixture Modeling , 2001 .

[52]  R. Bagozzi Advanced Methods of Marketing Research , 1994 .

[53]  L. Tucker A METHOD FOR SYNTHESIS OF FACTOR ANALYSIS STUDIES , 1951 .

[54]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[55]  P. Prescott Estimation of the standard deviation of a normal population from doubly censored samples using normal scores , 1970 .

[56]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[57]  Wynne W. Chin Issues and Opinion on Structural Equation Modeling by , 2009 .

[58]  I. Jolliffe,et al.  Nonlinear Multivariate Analysis , 1992 .

[59]  P. Groenen,et al.  Cluster differences scaling with a within-clusters loss component and a fuzzy successive approximation strategy to avoid local minima , 1997 .

[60]  Anthony S. Bryk,et al.  Hierarchical Linear Models: Applications and Data Analysis Methods , 1992 .

[61]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .