AN EFFICIENT INFORMATION RETRIEVAL SYSTEM USING QUERY EXPANSION AND DOCUMENT RANKING

Information retrieval is the process of searching and retrieval of information from documents that matches user query. The user information requirement is represented by a query or profile that contains one or more search terms. Indexing plays important role to retrieve the information. Researchers have been used indexing techniques only for document indexing and not focused on the speed up the search and retrieval time. In this paper, an enhanced inverted indexing technique is proposed to index all root terms of the documents. This approach maintains the weight of each term connected to the document in index structure. Multiple query terms are easy to handle using inverted index. After indexing, the searching and retrieval process is made by matching the query with the indexed terms. In order to solve the problem of false and null information retrieval, the proposed system includes query expansion and document ranking which improves retrieval accuracy.

[1]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[2]  Berkant Barla Cambazoglu,et al.  Performance of query processing implementations in ranking-based text retrieval systems using inverted indices , 2006, Inf. Process. Manag..

[3]  T. Martin McGinnity,et al.  Query Representation through Lexical Association for Information Retrieval , 2012, IEEE Transactions on Knowledge and Data Engineering.

[4]  Ken C. K. Lee,et al.  IR-Tree: An Efficient Index for Geographic Document Search , 2011, IEEE Trans. Knowl. Data Eng..

[5]  Mauro Dragoni,et al.  A conceptual representation of documents and queries for information retrieval systems by using light ontologies , 2012, Expert Syst. Appl..

[6]  Claudio Gennaro,et al.  Combining local and global visual feature similarity using a text search engine , 2011, 2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI).

[7]  Ping-I Chen,et al.  Using Google latent semantic distance to extract the most relevant information , 2011, Expert Syst. Appl..

[8]  Syed Raza Ali Rizvi,et al.  Using semantic and structural similarities for indexing and searching scientific papers , 2011, 2011 IEEE International Conference on Computer Science and Automation Engineering.

[9]  Jia-Ling Koh,et al.  A multi-level hierarchical index structure for supporting efficient similarity search on tag sets , 2012, 2012 Sixth International Conference on Research Challenges in Information Science (RCIS).

[10]  N. V. Ganapathi Raju,et al.  An Application of Statistical indexing for Searching and Ranking of documents – A Case Study on Telugu Script , 2011 .

[11]  Danushka Bollegala,et al.  A Web Search Engine-Based Approach to Measure Semantic Similarity between Words , 2011, IEEE Transactions on Knowledge and Data Engineering.

[12]  Yonggang Wen,et al.  Private query processing on inverted index , 2011, 2011 IEEE Third Latin-American Conference on Communications.

[13]  Deepa Gupta,et al.  Improving unsupervised stemming by using partial lemmatization coupled with data-based heuristics for Hindi , 2012 .

[14]  Benoît Gaillard,et al.  Query expansion for Cross Language Information Retrieval Improvement , 2010, 2010 Fourth International Conference on Research Challenges in Information Science (RCIS).

[15]  Xiang Lian,et al.  Ranked Query Processing in Uncertain Databases , 2010, IEEE Transactions on Knowledge and Data Engineering.