论文信息 - A Comparison of Segment Retention Criteria for Finite Mixture Logit Models

A Comparison of Segment Retention Criteria for Finite Mixture Logit Models

Despite the widespread application of finite mixture models in marketing research, the decision of how many segments to retain in the models is an important unresolved issue. Almost all applications of the models in marketing rely on segment retention criteria such as Akaike's information criterion, Bayesian information criterion, consistent Akaike's information criterion, and information complexity to determine the number of latent segments to retain. Because these applications employ real-world data in which the true number of segments is unknown, it is not clear whether these criteria are effective. Retaining the true number of segments is crucial because many product design and marketing decisions depend on it. The purpose of this extensive simulation study is to determine how well commonly used segment retention criteria perform in the context of simulated multinomial choice data, as obtained from supermarket scanner panels, in which the true number of segments is known. The authors find that an Akaike's information criterion with a penalty factor of three rather than the traditional value of two has the highest segment retention success rate across nearly all experimental conditions. Currently, this criterion is rarely, if ever, applied in the marketing literature. Experimental factors of particular interest in marketing contexts, such as the number of choices per household, the number of choice alternatives, the error variance of the choices, and the minimum segment size, have not been considered in the statistics literature. The authors show that they, among other factors, affect the performance of segment retention criteria.

Rick L. Andrews | Imran S. Currim

[1] A. Hope. A Simplified Monte Carlo Significance Test Procedure , 1968 .

[2] Paul I. Feder,et al. On the Distribution of the Log Likelihood Ratio Test Statistic When the True Parameter is "Near" the Boundaries of the Hypothesis Regions , 1968 .

[3] H. Akaike,et al. Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[4] G. Schwarz. Estimating the Dimension of a Model , 1978 .

[5] Hamparsum Bozdogan. MULTI-SAMPLE CLUSTER ANALYSIS AND APPROACHES TO VALIDITY STUDIES IN CLUSTERING INDIVIDUALS. , 1981 .

[6] Murray Aitkin,et al. Statistical Modelling of Data on Teaching Styles , 1981 .

[7] David C. Schmittlein,et al. A Bayesian Cross-Validated Likelihood Method for Comparing Alternative Specifications of Quantitative Models , 1985 .

[8] G. McLachlan. On Bootstrapping the Likelihood Ratio Test Statistic for the Number of Components in a Normal Mixture , 1987 .

[9] H. Bozdogan. Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions , 1987 .

[10] Geoffrey J. McLachlan,et al. Mixture models : inference and applications to clustering , 1989 .

[11] Gary J. Russell,et al. A Probabilistic Choice Model for Market Segmentation and Elasticity Structure , 1989 .