Regular Paper Creating a Noisy Parallel Corpus from Newswire Articles Using Cross-language Information Retrieval