An approach for semantic query expansion based on maximum entropy-hidden Markov model

The ineffectiveness of information retrieval systems is mostly caused by the inaccurate query formed by a few keywords that reflect actual user information need. One well known technique to overcome this limitation is Automatic Query Expansion (AQE), whereby the user's original query is improved by adding new features with a related meaning. It has long been accepted that capturing term associations is a vital part of information retrieval. It is therefore mainly to consider whether many sources of support may be combined to forecast term relations more precisely. This is mainly significant when frustrating to predict the probability of relevance of a set of terms given a query, which may involve both lexical and semantic relations between the terms. This paper presents a approach to expand the user query using three level domain model such as conceptual level(underlying Domain knowledge), linguistic level(term vocabulary based on Wordnet), stochastic model ME-HMM2 which combines (HMM (Hidden Markov Model and Maximum Entropy(ME) models) stores the mapping between such levels, taking into account the linguistic context of words.

[1]  Hajer Baazaoui Zghal,et al.  SIRO: On-line semantic information retrieval using ontologies , 2007, 2007 2nd International Conference on Digital Information Management.

[2]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[3]  Andrew McCallum,et al.  Maximum Entropy Markov Models for Information Extraction and Segmentation , 2000, ICML.

[4]  Zhenyu Liu,et al.  Textual Document Indexing and Retrieval via Knowledge Sources and Data Mining , 2002 .

[5]  Zhenyu Liu,et al.  Techniques for Textual Document Indexing and Retrieval Knowledge Sources and Data Mining , 2003, Clustering and Information Retrieval.

[6]  Rila Mandala Improving automatic query expansion by using heterogeneous thesauri , 2001 .

[7]  M. E. Maron,et al.  On Relevance, Probabilistic Indexing and Information Retrieval , 1960, JACM.

[8]  F. Jelinek Fast sequential decoding algorithm using a stack , 1969 .

[9]  P. Smith,et al.  A review of ontology based query expansion , 2007, Inf. Process. Manag..

[10]  Antonio Maria Rinaldi,et al.  An ontology-driven approach for semantic information retrieval on the Web , 2009, TOIT.

[11]  Adwait Ratnaparkhi,et al.  A maximum entropy model for parsing , 1994, ICSLP.

[12]  Xiaoying Gao,et al.  Exploiting underrepresented query aspects for automatic query expansion , 2007, KDD '07.

[13]  Wei-Ying Ma,et al.  Query Expansion by Mining User Logs , 2003, IEEE Trans. Knowl. Data Eng..

[14]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[15]  Min Song,et al.  Integration of association rules and ontologies for semantic query expansion , 2007, Data Knowl. Eng..

[16]  Vijayan Sugumaran,et al.  Improving web-query processing through semantic knowledge , 2008, Data Knowl. Eng..

[17]  Roberto Tedesco,et al.  A novel semantic information retrieval system based on a three-level domain model , 2013, J. Syst. Softw..

[18]  Yang He Extended Viterbi algorithm for second order hidden Markov process , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[19]  Claudio Carpineto,et al.  A Survey of Automatic Query Expansion in Information Retrieval , 2012, CSUR.

[20]  Jaime G. Carbonell,et al.  Document Representation and Query Expansion Models for Blog Recommendation , 2008, ICWSM.

[21]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[22]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.