Identifying Most Influential Observations in Factor Analysis

At the mathematical level, a factor or principal component of a factor analysis is simply a linear combination of variables under some constraints. Therefore, as in regression analysis, there are conditions under which individual or joint observations can be influential in the sense that their presence or absence significantly influences the obtained values of the estimated factor loadings. The nature of these effects as well as potential effects due to “gross errors” in the data set should be investigated in order to determine which observations, if any, need to be analyzed separately or excluded entirely. The purpose of this paper is (1) to propose a new technique for identifying influential observations and observations containing “gross errors” and (2) to discuss situations under which each is likely to significantly alter the results of a factor analysis.

[1]  William R. Darden,et al.  Useful component analysis: Graphical views of marketing data , 1988 .

[2]  H. Kaiser A second generation little jiffy , 1970 .

[3]  W. W. Muir,et al.  Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1980 .

[4]  Sanjit Chatterjee,et al.  A note on finding extreme points in multivariate space , 1990 .

[5]  J. Carroll,et al.  Synthesized clustering: A method for amalgamating alternative clustering bases with differential weighting of variables , 1984 .

[6]  Bernard W. Silverman,et al.  Constructing the Convex Hull of a Set of Points in the Plane , 1979, Comput. J..

[7]  David Stewart,et al.  The Application and Misapplication of Factor Analysis in Marketing Research , 1981 .

[8]  Ronald L. Graham,et al.  An Efficient Algorithm for Determining the Convex Hull of a Finite Planar Set , 1972, Inf. Process. Lett..

[9]  G. Duncan,et al.  Multivariate Analysis: With Applications in Education and Psychology. , 1977 .

[10]  Howard B. Lee,et al.  A first course in factor analysis , 1973 .

[11]  Michael Ian Shamos,et al.  Computational geometry: an introduction , 1985 .

[12]  H. Harman Modern factor analysis , 1961 .

[13]  Gene H. Golub,et al.  Matrix computations , 1983 .

[14]  S. Weisberg Plots, transformations, and regression , 1985 .

[15]  Ronald D. Anderson,et al.  A Simulation Study of Factor Score Indeterminacy , 1986 .

[16]  D. F. Andrews,et al.  Finding the Outliers that Matter , 1978 .

[17]  John R. Hauser,et al.  Design and marketing of new products , 1980 .

[18]  S. Chatterjee,et al.  Influential Observations, High Leverage Points, and Outliers in Linear Regression , 1986 .

[19]  Howard B. Lee,et al.  Foundations of Behavioral Research , 1973 .

[20]  Ronald D. Anderson,et al.  A Monté Carlo Comparison of Factor Analytic Methods , 1980 .