A parametric linguistics based approach for cross-lingual web querying

Developing efficient and meaningful search mechanisms for the Web is an active area of research in Information Management. With information explosion on the Internet, existing search engines encounter difficulty in accurate document positioning and retrieval. This situation is exacerbated by the language barrier for accessing web content provided in different languages. Sophisticated content-based search engines are needed for helping users find useful information quickly from multi-lingual knowledge sources on the Web. This paper presents a parametric linguistics based approach for flexible and scalable cross-lingual web querying with low complexity in query translation. The proposed methodology and the system architecture are discussed.

[1]  Peter Wegner,et al.  Why interaction is more powerful than algorithms , 1997, CACM.

[2]  A. Turing On Computable Numbers, with an Application to the Entscheidungsproblem. , 1937 .

[3]  S. Pinker The language instinct : how the mind creates language , 1995 .

[4]  Vesper Owei,et al.  An intelligent approach to handling imperfect information in concept-based natural language queries , 2002, TOIS.

[5]  David A. Hull Using Structured Queries for Disambiguation in Cross-Language Information Retrieval , 1997 .

[6]  Ray Jackendoff Semantics and Cognition , 1983 .

[7]  Berthier A. Ribeiro-Neto,et al.  Concept-based interactive query expansion , 2005, CIKM '05.

[8]  Sandeep Purao,et al.  A Heuristic-Based Methodology for Semantic Augmentation of User Queries on the Web , 2003, ER.

[9]  Jorge Morato,et al.  WordNet Applications , 2004 .

[10]  Hsi-Jian Lee,et al.  Anchor text mining for translation of Web queries: A transitive translation approach , 2004, TOIS.

[11]  Pu-Jen Cheng,et al.  Translating unknown cross-lingual queries in digital libraries using a Web-based approach , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[12]  Jianfeng Gao,et al.  Statistical query translation models for cross-language information retrieval , 2006, TALIP.

[13]  Georgia Koutrika,et al.  Personalized Systems: Models and Methods from an IR and DB Perspective , 2005, VLDB.

[14]  Robin Milner,et al.  Elements of interaction: Turing award lecture , 1993, CACM.

[15]  Mark C. Baker The Atoms of Language , 1987 .

[16]  Bonnie J. Dorr,et al.  Solving Thematic Divergences in Machine Translation , 1990, ACL.

[17]  Epaminondas Kapetanios,et al.  Simplifying Syntactic and Semantic Parsing of NL Based Queries in Advanced Application Domains , 2005, NLDB.

[18]  Stefan Decker,et al.  TRIPLE - A Query, Inference, and Transformation Language for the Semantic Web , 2002, SEMWEB.

[19]  Zhenyu Liu,et al.  Knowledge-based query expansion to support scenario-specific retrieval of medical free text , 2005, SAC '05.

[20]  John Fry Assembling a Parallel Corpus from RSS News Feeds , 2005, MTSUMMIT.

[21]  Kai-Uwe Sattler,et al.  Concept-based querying in mediator systems , 2005, The VLDB Journal.

[22]  Noam Chomsky,et al.  On Nature and Language , 2002 .

[23]  Jay F. Nunamaker,et al.  Multilingual Web Retrieval: An Experiment on a Multilingual Business Intelligence Portal , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[24]  Takenobu Tokunaga,et al.  Combining multiple evidence from different types of thesaurus for query expansion , 1999, SIGIR '99.

[25]  Djoerd Hiemstra,et al.  Challenges in information retrieval and language modeling: report of a workshop held at the center for intelligent information retrieval, University of Massachusetts Amherst, September 2002 , 2003, SIGF.

[26]  Sihem Amer-Yahia,et al.  Report on the DB/IR panel at SIGMOD 2005 , 2005, SGMD.