English-Chinese Cross-language Information Retrieval Using Lucene System
暂无分享,去创建一个
The paper describes the implementation of an English-Chinese cross-language information retrieval system (CLIR). It focuses on finding effective translation equivalents between English and Chinese, and improving the performance of Chinese IR. On English-Chinese CLIR, it adopts query translation as the dominant strategy, and utilizes English-Chinese bilingual dictionary as the important knowledge resource to acquire correct translations. On Chinese monolingual retrieval, it investigates the use of different entities as indexes and implements retrieval system based on the Lucene toolkit. On system evaluation, it presents a quick method to construct the sets of relevant documents for query topics.