论文信息 - Competing methods for representing random taste heterogeneity in discrete choice models

Competing methods for representing random taste heterogeneity in discrete choice models

The representation of random taste heterogeneity has been a prime research interest in the field of discrete choice modelling over recent years. Introducing random taste heterogeneity brings highly valued advantages in model flexibility and the ability of models to fit particular data. There are however also some drawbacks that must be addressed. Aside from the heightened cost of estimation, the main complication arising with the use of mixture models is the specification of a distribution for those taste parameters that vary randomly across respondents. It is of interest to seek to be able to specify models using mixture distributions that allow the range to be controlled while also yielding sufficient flexibility to fit the data. We further require that flexibility should be scalable such that it is possible to gradually increase the flexibility of the mixture as desired in any given application. This would allow practitioners to start with a standard model and then adapt it to the situation at hand. We finally ask that increased flexibility can be achieved with minimal additional computational cost such that there is hope that the methods will be applied in large scale applications. We summarise these conditions as range control, flexibility, scalable flexibility, and economy. Some effort has gone into advocating the use of discrete mixture models and non-parametric distributions. The distributions afforded by these methods are as flexible as the data allow and also give direct control over the range of the mixture distribution. This meets requirements a to c but not requirement d. With discrete mixtures, the number of mass points required may be excessively high and there may be substantial numerical problems involved. Nonparametric methods are generally very computationally intensive. For these reasons these methods are probably not considered for large scale applications. Some authors have investigated the use of more advanced continuous distributions such as Johnson SB or Johnson SU. This is a step in the right direction, but the flexibility of such distributions is still not scalable. They are also mostly unimodal, which might not hold for the true distribution to be estimated. In this paper, we stage a competition between two alternative approaches to the specification of a mixture distribution that both meet our requirements. The competition takes place over a number of matches, where each match is the estimation of a model on simulated datasets comprising a true distribution to be estimated. These distributions are specified by us in advance so as to be challenging estimation problems. We will mimic what a practitioner might do: we will fix the estimation methods without using our a priori knowledge of the true distribution, scale the flexibility as indicated by the data and in each match evaluate which approach performs best in terms of our criteria. The first of our contenders in the competition is a mixture distribution that is itself a discrete mixture of continuous distributions. In principle, the continuous distributions can be any continuous parametric distributions. However, we fix attention to using the Normal distribution as the base distribution and get a discrete mixture of Normals. This approach is scalable via the number of Normal distributions used and is a straightforward extension of the standard Normal mixture. It can easily accommodate a multimodal distribution. The second contender is essentially seminonparametric (SNP) in nature and uses a representation of densities from Bierens (2005) that can approximate virtually any continuous distribution. For the covering abstract see ITRD E135582.

Stephane Hess | Mogens Fosgerau | S. Hess | M. Fosgerau