Ti-H CONCERN of this paper is the estimation of the parameters of a probabilistic choice model when choices rather than decision makers are sampled. Existing estimation methods presuppose an exogeneous sampling process, that is one in which a sequence of decision makers are drawn and their choice behaviors observed. In contrast, in choice based sampling processes, a sequence of chosen alternatives are drawn and the characteristics of the decision makers selecting those alternatives are observed. The problem of estimating a choice model from a choice based sample has suibstantive interest because data collection costs for such processes are often considerably smaller than for exogeneous sampling. Particular instances of this differential occur in the analysis of transportation behavior. For example, in studying choice of mode for work trips, it is often less expensive to survey transit users at the station and auto users at the parking lot than to interview commuters at their homes. Similarly, in examining choice of destination for shopping trips, surveys conducted at various shopping centers offer significant cost savings relative to home interviews.2 While interest in transportation applications provided the original motivation for our work, it has become apparent that choice based sampling processes can be cost effective in the analysis of numerous decision problems. In particular, wherever decision makers are physically clustered according to the alternatives they select, choice based sampling processes can achieve economies of scale not available with exogeneous sampling. Some non-transportation decision problems in which decision makers do cluster as described include the schooling decisions of students, the job decisions of workers, the medical care decisions of patients and the residential location decisions of households. Realization of the sampling cost benefits of choice based samples presupposes of course that the parameters of the underlying choice model can logically be inferred from such samples and that a tractable estimator with desirable statistical properties can be found. We shall, in this paper, confirm the logical supposition, develop a suitable estimator, and characterize the behavior of existing, exogeneous sampling, estimators in the context of choice based samples. An outline of the presentation and summary of major results follows.
[1]
D. McFadden.
Conditional logit analysis of qualitative choice behavior
,
1972
.
[2]
C. Manski.
MAXIMUM SCORE ESTIMATION OF THE STOCHASTIC UTILITY MODEL OF CHOICE
,
1975
.
[3]
Roger J. Bowden,et al.
THE THEORY OF PARAMETRIC IDENTIFICATION
,
1973
.
[4]
Calyampudi Radhakrishna Rao,et al.
Linear Statistical Inference and its Applications
,
1967
.
[5]
W. Fleming.
Functions of Several Variables
,
1965
.
[6]
D. McFadden.
Quantal Choice Analysis: A Survey
,
1976
.
[7]
A. Tversky.
Choice by elimination
,
1972
.
[8]
Takeshi Amemiya,et al.
Regression Analysis when the Dependent Variable is Truncated Normal
,
1973
.
[9]
R. Jennrich.
Asymptotic Properties of Non-Linear Least Squares Estimators
,
1969
.
[10]
T. Rothenberg.
Identification in Parametric Models
,
1971
.
[11]
Richard B. Westin,et al.
Predictions from binary choice models
,
1974
.