Some models of loan default are binary, simply modelling the probability of default, while others go further and model the extent of default (eg number of outstanding payments; amount of arrears). The double-hurdle model, originally due to Cragg (Econometrica, 1971), and conventionally applied to household consumption or labour supply decisions, contains two equations, one which determines whether or not a customer is a potential defaulter (the ‘first hurdle’), and the other which determines the extent of default. In separating these two processes, the model recognizes that there exists a subset of the observed non-defaulters who would never default whatever their circumstances. A Box-Cox transformation applied to the dependent variable is a useful generalization to the model. Estimation is relatively easy using the Maximum Likelihood routine available in STATA. The model is applied to a sample of 2515 loan applicants for whom loans were approved, a sizeable proportion of whom defaulted in varying degrees. The dependent variables used are amount in arrears and number of days in arrears. The value of the hurdle approach is confirmed by finding that certain key explanatory variables have very different effects between the two equations. Most notably, the effect of loan amount is strongly positive on arrears, while being U-shaped on the probability of default. The former effect is seriously under-estimated when the first hurdle is ignored.
[1]
Montserrat Guillén,et al.
Count data models for a credit scoring system
,
1996
.
[2]
Allen McDowell,et al.
From the Help Desk: Hurdle Models
,
2003
.
[3]
Alan T. K. Wan,et al.
Handbook of applied econometrics and statistical inference
,
2002
.
[4]
J. G. Cragg.
Some Statistical Models for Limited Dependent Variables with Application to the Demand for Durable Goods
,
1971
.
[5]
W. Cleveland.
Robust Locally Weighted Regression and Smoothing Scatterplots
,
1979
.
[6]
Margaret Irish,et al.
Statistical models for zero expenditures in household budgets
,
1984
.
[7]
Andrew M. Jones.
A double‐hurdle model of cigarette consumption
,
1989
.
[8]
Mark B. Stewart,et al.
On Least Squares Estimation when the Dependent Variable is Grouped
,
1982
.
[9]
Andrew M. Jones,et al.
A Box-Cox Double-Hurdle Model
,
1994
.