Categorization by simplicity: a minimum description length approach to unsupervised clustering