Heterogeneous Multi-core Design for Information Retrieval Efficiency on the Vector Space Model

This paper proposes a hardware approach to improving information retrieval efficiency on the vector space model. We design a heterogeneous multicore system, within which auxiliary retrieval-oriented intellectual properties (IP) cores perform term counting as basic operations. Moreover, a memory system is also designed to supply data efficiently. Since term counting is a highly frequent operation in information retrieval, we hope the overall efficiency will improve significantly as a result of hardware implementation of term counting. The experiment shows that our system has a speedup of 3 to 7 for different input data. We have also analyzed the influence of data patterns on performance.