Identifying the Content Zones of German Court Decisions

A central step in the automatic processing of court decisions is the identification of the various content zones, i.e., breaking up the document into functionally independent areas. We assembled a corpus of German court decisions and argue that this genre belongs to the class of semi-structured text documents. Currently, we are implementing zone identification by means of a set of recognition rules, following up on our earlier experiences with a different genre (film reviews).