Learning the surface structure of wh-questions in English and French with a non-parametric Bayesian model