Corpus studies and probabilistic grammar