The authors study the estimation of factor models and the imputation of missing data and propose an approach that provides direct estimates of factor weights without the replacement of missing data with imputed values. First, the approach is useful in applications of factor analysis in the presence of missing data. Second, the proposed factor analysis model may be used as a vehicle for imputing missing data, producing a complete data set that can be analyzed subsequently with some other method. Here, the factor model itself is not of primary interest but presents a suitable model for purposes of imputation. The proposed method accommodates various patterns of missing data commonly found in marketing. The framework for factor analysis the authors develop deals with both discrete and continuous variables and gives rise to several models not considered previously. The authors illustrate various factor models on synthetic data, investigating their performance when missing data are present and when the distribution of the observed variables is incorrectly specified. The authors provide two empirical studies of the performance of the approach. In the first, the authors demonstrate how the proposed approach recovers the true (complete-data) factor structure in the presence of missing observations that occur because of item nonresponse and compare the procedure with three alternative methods traditionally used for handling missing data in factor analysis. In the second application, the factor model is used as a vehicle to impute data that are missing by design.
[1]
P. McCullagh,et al.
Generalized Linear Models
,
1992
.
[2]
D. Andrich.
Application of a Psychometric Rating Model to Ordered Categories Which Are Scored with Successive Integers
,
1978
.
[3]
Lung-fei Lee,et al.
Simulated Maximum Likelihood Estimation of Dynamic Discrete Choice Statistical Models Some Monte Carlo Results
,
1997
.
[4]
David E. Booth,et al.
Analysis of Incomplete Multivariate Data
,
2000,
Technometrics.
[5]
H. Bozdogan.
Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions
,
1987
.
[6]
J. Murry,et al.
Evaluating the Measurement Validity of Lifestyle Typologies with Qualitative Measures and Multiplicative Factoring
,
1990
.
[7]
John M. McCann,et al.
Item Nonresponse in Mail Surveys: Extent and Correlates
,
1978
.
[8]
Wagner A. Kamakura,et al.
Statistical Data Fusion for Cross-Tabulation
,
1997
.
[9]
Banwari Mittal,et al.
Here's the Beef: Factors, Determinants, and Segments in Consumer Criticism of Advertising
,
1993
.
[10]
Erling B. Andersen,et al.
Discrete Statistical Models with Social Science Applications.
,
1980
.
[11]
S. Haberman.
Analysis of Dispersion of Multinomial Responses
,
1982
.
[12]
D. McFadden.
A Method of Simulated Moments for Estimation of Discrete Response Models Without Numerical Integration
,
1989
.
[13]
J. Aaker,et al.
Dimensions of Brand Personality
,
1997
.
[14]
P. McCullagh,et al.
Generalized Linear Models
,
1972,
Predictive Analytics.