Machine Learning versus Knowledge Based Classification of Legal Texts

This paper presents results of an experiment in which we used machine learning (ML) techniques to classify sentences in Dutch legislation. These results are compared to the results of a pattern-based classifier. Overall, the ML classifier performs as accurate (>90%) as the pattern based one, but seems to generalize worse to new laws. Given these results, the pattern based approach is to be preferred since its reasons for classification are clear and can be used for further modelling of the content of the sentences.