Latent Semantic Indexing using Multiresolution Analysis

Latent semantic indexing (LSI) is commonly used to match queries to documents in information retrieval (IR) applications. It has been shown to improve the retrieval performance, as it can deal with synonymy and polysemy problems. This paper proposes a hybrid approach which can improve result accuracy significantly. Evaluation of the approach based on using the Haar wavelet transform (HWT) as a preprocessing step for the singular value decomposition (SVD) in the LSI system is presented, using Donoho′s thresholding with the transformation in HWT. Furthermore, the effect of different levels of decomposition in the HWT process is investigated. The experimental results presented in the paper confirm a significant improvement in performance by applying the HWT as a preprocessing step using Donoho′s thresholding.

[1]  David L. Donoho,et al.  De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[2]  Abbes Amira,et al.  A Novel Approach for Lexical Noise Analysis and Measurement in Intelligent Information Retrieval , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[3]  Abbes Amira,et al.  Performance evaluation of DCT and wavelet transform for LSI , 2008, 2008 IEEE International Symposium on Circuits and Systems.

[4]  D. Sackett,et al.  Cochrane Collaboration , 1994, BMJ.

[5]  Ioannis Delakis,et al.  Wavelet-based de-noising algorithm for images acquired with parallel magnetic resonance imaging (MRI) , 2007, Physics in medicine and biology.

[6]  Abbes Amira,et al.  An automatic face recognition system based on wavelet transforms , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[7]  Elizabeth R. Jessup,et al.  Matrices, Vector Spaces, and Information Retrieval , 1999, SIAM Rev..

[8]  Eduard Hoenkamp,et al.  Unitary Operators on the Document Spac , 2003, J. Assoc. Inf. Sci. Technol..

[9]  Susan T. Dumais,et al.  Using Linear Algebra for Intelligent Information Retrieval , 1995, SIAM Rev..

[10]  P. P. Vaidyanathan,et al.  Wavelet-based denoising by customized thresholding , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.