Searching Tourism Information by Using Vertical Search Engine Based on Nutch and Solr

Since there exist some issues with traditional search engine in information retrieval, such as huge numbers of results, poor profession, low precision rate and other issues, in this paper, we proposed a Vertical Search Engine based on Nutch and Solr. We used forward iteration most granular segmentation algorithm based on dictionary to achieve Chinese word segmentation, employed Vector Space Model (VSM) based on keywords to implement topic relevance, extended the user search module and the tourism domain word library to collect information, filter information retrieval, and relate word various stages. Experiments were also conducted in order to evaluate the algorithm and the results show that the vertical search engine based on Nutch and Solr which is used for tourism information retrieval can improve the user retrieval precision and meet the professional demand of user retrieval.