A Comparative Study on English-Chinese Bilingual News Document Clustering

Bilingual or multilingual document clustering is a valuable research. Based on monolingual algorithm, the paper makes a comparative study on monolingual-based clustering and bilingual-based clustering by using the corpus of English-Chinese bilingual news text. The experimental results show that mixed language-based method can make a better and more stable performance.