Developing a semantic-enable information retrieval mechanism

The existing information retrieval systems are mostly keyword-based and retrieve relevant documents or information by matching keywords. Keyword-based search, in spite of its merits of expedient query for information and ease-of-use, has failed to represent the complete semantics contained in the content and has let to the retrieval failure. In a textual content, the author's intention is represented in a semantic format of various combinations of word-word relations that are comprehensible to human beings. Query constructed by descriptions in natural language best reflects querist's intention. This study developed a semantic-enable information retrieval mechanism that handles the processing, recognition, extraction, extensions and matching of content semantics to achieve the following objectives: (1) to analyze and determine the semantic features of content, to develop a semantic pattern that represents semantic features of the content, and to structuralize and materialize semantic features; (2) to analyze user's query and extend its implied semantics through semantic extension so as to identify more semantic features for matching; and (3) to generate contents with approximate semantics by matching against the extended query to provide correct contents to the querist. This mechanism is capable of improving the traditional problem of keyword search and enables the user to perform a semantic-based query and search for the required information, thereby improving the reusing and sharing of information.

[1]  Faker Ben Belgacem,et al.  The Mortar finite element method with Lagrange multipliers , 1999, Numerische Mathematik.

[2]  Byron L. D. Bezerra,et al.  A symbolic approach for content-based information filtering , 2004, Inf. Process. Lett..

[3]  Chih-Jen Lin,et al.  The analysis of decomposition methods for support vector machines , 2000, IEEE Trans. Neural Networks Learn. Syst..

[4]  Susan T. Dumais,et al.  Using Linear Algebra for Intelligent Information Retrieval , 1995, SIAM Rev..

[5]  Hsinchun Chen,et al.  Intelligent internet searching agent based on hybrid simulated annealing , 2000, Decis. Support Syst..

[6]  Hugh C. Davis,et al.  Managing the semantic aspects of learning using the knowledge life cycle , 2005, Fifth IEEE International Conference on Advanced Learning Technologies (ICALT'05).

[7]  Ahmed Abdelali,et al.  Improving query precision using semantic expansion , 2007, Inf. Process. Manag..

[8]  Gerald Salton,et al.  Automatic text processing , 1988 .

[9]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[10]  Maria Vargas-Vera,et al.  A Question-Answering System Using Argumentation , 2004, MICAI.

[11]  Sanda M. Harabagiu,et al.  LASSO: A Tool for Surfing the Answer Net , 1999, TREC.

[12]  Hyunbo Cho,et al.  A novel method for measuring semantic similarity for XML schema matching , 2008, Expert Syst. Appl..

[13]  Paul Johannesson,et al.  Classifying the Semantics of Relationships in Conceptual Modeling by Categorization of Roles , 2001, NLDB.

[14]  Arthur B. Markman,et al.  Knowledge Representation , 1998 .

[15]  Veda C. Storey,et al.  Understanding semantic relationships , 1993, The VLDB Journal.

[16]  Stephen E. Robertson,et al.  On document relevance and lexical cohesion between query terms , 2006, Inf. Process. Manag..

[17]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[18]  Hsin-Chang Yang,et al.  A classifier-based text mining approach for evaluating semantic relatedness using support vector machines , 2005, International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II.

[19]  Dieter Fensel,et al.  Towards the Semantic Web: Ontology-driven Knowledge Management , 2002 .

[20]  Farhi Marir,et al.  Document management systems from current capabilities towards intelligent information retrieval: an overview , 1999 .

[21]  Wei-Pang Yang,et al.  Text summarization using a trainable summarizer and latent semantic analysis , 2005, Inf. Process. Manag..

[22]  Dieter Fensel,et al.  Conclusions: Ontology‐driven Knowledge Management – Towards the Semantic Web? , 2003 .

[23]  Sung-Hyon Myaeng,et al.  Semantic passage segmentation based on sentence topics for question answering , 2007, Inf. Sci..

[24]  Der-Chiang Li,et al.  An algorithm to cluster data for efficient classification of support vector machines , 2008, Expert Syst. Appl..

[25]  Marlene Scardamalia,et al.  Computer Support for Knowledge-Building Communities , 1994 .

[26]  Peter W. Foltz,et al.  An introduction to latent semantic analysis , 1998 .

[27]  Milad Shokouhi,et al.  Using query logs to establish vocabularies in distributed information retrieval , 2007, Inf. Process. Manag..

[28]  Philippe Blache,et al.  A semantic vector space and features-based approach for automatic information filtering , 2004, Expert Syst. Appl..

[29]  Kyo Kageura,et al.  Virtual relevant documents in text categorization with support vector machines , 2007, Inf. Process. Manag..

[30]  Veda C. Storey,et al.  Comparing relationships in conceptual modeling: mapping to semantic classifications , 2005, IEEE Transactions on Knowledge and Data Engineering.

[31]  Daniel E. O'Leary,et al.  Internet-based information and retrieval systems , 1999, Decis. Support Syst..