Merging Different Languages in a Single Document Collection

Multilingual IR is usually carried out with separate collections, each for a language. Once a set of answers have been found in each language, all the sets have to be merged to produce a unique answer list. In our experiments of CLEF2002, we try to implement a different approach, in which the documents in different languages are mixed in the same collection. Indexes are associated with a language tag so as to distinguish homographs in different languages. The indexing and retrieval processes can then be done once for all the documents. No result merging is required. This report describes our first tests in CLEF2002.