Non-Relevance Feedback Document Retrieval based on One Class SVM and SVDD

This paper reports a new document retrieval method using non-relevant documents. Especially, this paper reports a comparison of retrieval efficiency between one class support vector machine (SVM) based and support vector data description (SVDD) based interactive document retrieval method using non-relevant documents only. From a large data set of documents, we need to find documents that relate to human interesting in as few iterations of human testing or checking as possible. In each iteration a comparatively small batch of documents is evaluated for relating to the human interesting. We applied active learning techniques based on support vector machine for evaluating successive batches, which is called relevance feedback. Our proposed approach has been very useful for document retrieval with relevance feedback experimentally. The traditional relevance feedback needs a set of relevant and non-relevant documents to work usefully. However, the initial retrieved documents, which are displayed to a user, sometimes don't include relevant documents. In order to solve this problem, we propose a new feedback method using information of non-relevant documents only. We named this method non-relevance feedback document retrieval. The non-relevance feedback document retrievals are based on one class support vector machine and support vector data description. Our experimental results show that one class support vector machine based method can retrieve relevant documents efficiently using information of non-relevant documents only.