MULTILINGUAL CORPORA FOR COOPERATION

MLCC was a corpus, acquisition project funded by the EC Telematics program.The aim was to collect a set of texts representing a substantial improvement in range, quantity and quality of corpus material available. Two sub-corpora have been defined to help meet the needs for multilingual data consisting of a comparable set of texts in six languages and a parallel set of data in 9 languages. The comparable text collection includes financial newspaper articles from the early '90s. The parallel data is taken from the Official Journal of the European Commission, sub-series Written Questions to Parliament and from the Proceedings of the European Parliament. The data has been converted to an SGML, TEI-conformant mark-up and is distributed by ELRA.