Búsqueda de informacion multilingue: estado del arte

Presentamos un estado del arte en el problema de la busqueda de informacion multilingue, con especial atencion a los distintos recursos linguisticos utilizados y a los aspectos interactivos de la busqueda de documentos en idiomas desconocidos por el usuario.

[1]  Jeong Soo Ahn,et al.  Using n-grams for Korean text retrieval , 1996, SIGIR '96.

[2]  James Allan,et al.  Automatic Query Expansion Using SMART: TREC 3 , 1994, TREC.

[3]  Jian-Yun Nie,et al.  Parallel Web text mining for cross-language IR , 2000, RIAO.

[4]  Peter Willett,et al.  A Stemming Algorithm for Latin Text Databases , 1996, J. Documentation.

[5]  Yiming Yang,et al.  Translingual Information Retrieval: Learning from Bilingual Corpora , 1998, Artif. Intell..

[6]  Dagobert Soergel,et al.  Multilingual Thesauri in Cross-Language Text and Speech Retrieval , 1997 .

[7]  Fredric C. Gey,et al.  Comparing Multiple Methods for Japanese and Japanese-English Text Retrieval , 1999, NTCIR.

[8]  Gareth J. F. Jones,et al.  Exeter at CLEF 2001: Experiments with Machine Translation for Bilingual Retrieval , 2001, CLEF.

[9]  Ralf D. Brown Automated Dictionary Extraction for “Knowledge-Free” Example-Based Translation , 2006 .

[10]  W. Bruce Croft,et al.  The INQUERY Retrieval System , 1992, DEXA.

[11]  Kenneth Ward Church,et al.  A Program for Aligning Sentences in Bilingual Corpora , 1993, CL.

[12]  Mark W. Davis,et al.  Document Thumbnail Visualization for Rapid Relevance Judgments: When do They Pay Off? , 1998, TREC.

[13]  Ellen M. Voorhees,et al.  The Collection Fusion Problem , 1994, TREC.

[14]  Martin Braschler,et al.  The eurospider retreival system and the trec-8 cross-language task , 1999 .

[15]  Carol Peters,et al.  Using Linguistic Tools and Resources in Cross-Language Retrieval , 1997 .

[16]  Wessel Kraaij,et al.  Porter's stemming algorithm for Dutch , 1994 .

[17]  Martin Kay,et al.  Text-Translation Alignment , 1993, Comput. Linguistics.

[18]  Max Schroeder When you come to a fork in the road... take it , 2000 .

[19]  Wessel Kraaij,et al.  Comparing the Effect of Syntactic vs. Statistical Phrase Indexing Strategies for Dutch , 1998, ECDL.

[20]  Yiming Yang,et al.  Translingual Information Retrieval: A Comparative Evaluation , 1997, IJCAI.

[21]  John White,et al.  Predicting what MT is good for: user judgments and task performance , 1998, AMTA.

[22]  Douglas W. Oard The CLEF 2001 Interactive Track , 2001, CLEF.

[23]  Paul Buitelaar,et al.  Semantic annotation for concept-based cross-language medical information retrieval , 2002, Int. J. Medical Informatics.

[24]  Kenneth Ward Church,et al.  Japanese Word Segmentation Using Similarity Measure for IR , 1999, NTCIR.

[25]  Jian-Yun Nie,et al.  Merging Different Languages in a Single Document Collection , 2002, CLEF.

[26]  Alistair Moffat,et al.  Information Retrieval Systems for Large Document Collections , 1994, TREC.

[27]  Luis Alfonso Ureña López,et al.  SINAI at CLEF 2002: Experiments with Merging Strategies , 2002, CLEF.

[28]  Martin Braschler,et al.  Experiments with the Eurospider Retrieval System for CLEF 2000 , 2000, CLEF.

[29]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[30]  Julie Beth Lovins,et al.  Development of a stemming algorithm , 1968, Mech. Transl. Comput. Linguistics.

[31]  J. Scott McCarley Should we Translate the Documents or the Queries in Cross-language Information Retrieval? , 1999, ACL.

[32]  Lisa Ballesteros,et al.  Cross-Language Retrieval via Transitive Translation , 2002 .

[33]  Fredric C. Gey,et al.  Manual Queries and Machine Translation in Cross-Language Retrieval and Interactive Retrieval with Cheshire II at TREC-7 , 1998, TREC.

[34]  Chris Buckley,et al.  OHSUMED: an interactive retrieval evaluation and new large test collection for research , 1994, SIGIR '94.

[35]  W. Bruce Croft,et al.  Phrasal translation and query expansion techniques for cross-language information retrieval , 1997, SIGIR '97.

[36]  Yonggang Qiu Automatic query expansion based on a similarity thesaurus , 1995 .

[37]  W. Bruce Croft,et al.  Resolving ambiguity for cross-language information retrieval: a dictionary approach , 2001 .

[38]  Jacques Savoy,et al.  A Stemming Procedure and Stopword List for General French Corpora , 1999, J. Am. Soc. Inf. Sci..

[39]  Martha W. Evens,et al.  Stemming methodologies over individual query words for an Arabic information retrieval system , 1999 .

[40]  P. C. Wong,et al.  Generalized vector spaces model in information retrieval , 1985, SIGIR '85.

[41]  Piek T. J. M. Vossen,et al.  Introduction to EuroWordNet , 1998, Comput. Humanit..

[42]  Mark Sanderson,et al.  iCLEF at Sheffield , 2001, CLEF.

[43]  Nicola Ferro,et al.  University of Padua at CLEF 2002: Experiments to Evaluate a Statistical Stemming Algorithm , 2002, CLEF.

[44]  Mark W. Davis,et al.  Getting Information from Documents You Cannot Read: An Interactive Cross-Language Text Retrieval and Summarization System , 1999 .

[45]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[46]  Sergei Nirenburg,et al.  Keizai: An Interactive Cross-Language Text Retrieval System , 2000 .

[47]  Ralf D. Brown,et al.  The Pangloss-Lite machine translation system , 1996, AMTA.

[48]  Mark Sanderson,et al.  Sheffield University: CLEF 2000 Submission (Bilingual Track - German to English) , 2000, CLEF.

[49]  Julio Gonzalo,et al.  Noun Phrase Translations for Cross-Language Document Selection , 2001, CLEF.

[50]  Wessel Kraaij,et al.  TNO at CLEF-2001: Comparing Translation Resources , 2001, CLEF.

[51]  Douglas W. Oard,et al.  Structured translation for cross-language information retrieval , 2000, SIGIR '00.

[52]  Julio Gonzalo,et al.  The CLEF 2002 Interactive Track , 2002, CLEF.

[53]  Martha W. Evens,et al.  Stemming Methodologies Over Individual Query Words for an Arabic Information Retrieval System , 1999, J. Am. Soc. Inf. Sci..

[54]  Martin Braschler,et al.  Cross-Language Information Retrieval in a Multilingual Legal Domain , 1997, ECDL.

[55]  Douglas W. Oard,et al.  CLEF Experiments at Maryland: Statistical Stemming and Backoff Translation , 2000, CLEF.

[56]  Douglas W. Oard Evaluating Interactive Cross-Language Information Retrieval: Document Selection , 2000, CLEF.

[57]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[58]  Douglas W. Oard,et al.  A comparative study of query and document translation for cross-language information retrieval , 1998, AMTA.

[59]  Hans Uszkoreit,et al.  MULINEX: multilingual indexing, navigation and editing extensions for the world-wide web , 1997, RIAO.

[60]  Elizabeth D. Liddy,et al.  TREC-7 Evaluation of Conceptual Interlingua Document Retrieval (CINDOR) in English and French , 1998, TREC.

[61]  Jian-Yun Nie TREC-7 CLIR using a Probabilistic Translation Model , 1998, TREC.

[62]  Maarten de Rijke,et al.  Shallow Morphological Analysis in Monolingual Information Retrieval for Dutch, German, and Italian , 2001, CLEF.

[63]  Gregory Grefenstette,et al.  Querying across languages: a dictionary-based approach to multilingual information retrieval , 1996, SIGIR '96.

[64]  James C. French,et al.  The impact of database selection on distributed searching , 2000, SIGIR '00.

[65]  Pascale Fung,et al.  Compiling Bilingual Lexicon Entries From a Non-Parallel English-Chinese Corpus , 1995, VLC@ACL.

[66]  Jianqiang Wang,et al.  Comparing User-assisted and Automatic Query Translation , 2002, CLEF.

[67]  Sergei Nirenburg,et al.  CRL's TREC-8 Systems Cross-Lingual IR, and Q&A , 1999, TREC.

[68]  T. Kalamboukis Suffix stripping with modern Greek , 1995 .

[69]  Hans Uszkoreit,et al.  MULINEX: Multilingual Web Search and Navigation , 1999 .

[70]  Martin Braschler,et al.  The Eurospider Retrieval System and the TREC-8 Cross-Language Track , 1999, TREC.

[71]  Philip Resnik,et al.  Evaluating Multilingual Gisting of Web Pages , 1997, ArXiv.

[72]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[73]  Ari Pirkola,et al.  The effects of query structure and dictionary setups in dictionary-based cross-language information retrieval , 1998, SIGIR '98.

[74]  Daniela Petrelli,et al.  Is Query Translation a Distinct Task from Search? , 2002, CLEF.

[75]  Gregory Grefenstette The Problem of Cross-Language Information Retrieval , 1998 .

[76]  Kui-Lam Kwok Comparing representations in Chinese information retrieval , 1997, SIGIR '97.

[77]  Sergei Nirenburg,et al.  A Full-Text Experiment in Example-Based Machine Translation , 1994 .

[78]  Julio Gonzalo,et al.  Interactive Cross-Language Searching: Phrases are Better than Terms for Query Formulation and Refinement , 2002, CLEF.

[79]  Fredric C. Gey,et al.  The TREC-2001 Cross-Language Information Retrieval Track: Searching Arabic Using English, French or Arabic Queries , 2001, TREC.

[80]  Michael Carl,et al.  Linking Translation Memories with Example-Based Machine Translation , 1999 .

[81]  Brona Collins,et al.  Example-Based Machine Translation: An Adaptation-Guided Retrieval Approach , 1999 .

[82]  Michel Simard,et al.  Using cognates to align sentences in bilingual corpora , 1993, TMI.

[83]  Martin Braschler,et al.  Multilingual Information Retrieval Based on Document Alignment Techniques , 1998, ECDL.

[84]  Anselmo Peñas Padilla "Website term browser", un sistema interactivo y multilingüe de búsqueda textual basado en técnicas lingüísticas , 2002 .

[85]  Douglas W. Oard,et al.  Support for Interactive Document Selection in Cross-Language Information Retrieval , 1999, Inf. Process. Manag..

[86]  Mark W. Davis,et al.  New Experiments In Cross-Language Text Retrieval At NMSU's Computing Research Lab , 1996, TREC.

[87]  Kui-Lam Kwok,et al.  TREC-3 Ad-Hoc, Routing Retrieval and Thresholding Experiments using PIRCS , 1994, TREC.

[88]  Jian-Yun Nie,et al.  Multilingual Information Retrieval Based on Parallel Texts from the Web , 2000, CLEF.

[89]  Natalia V. Loukachevitch,et al.  Thesaurus-Based Structural Thematic Summary in Multilingual Information Systems - Machne Translation Review , 2000 .

[90]  Carol Peters,et al.  Evaluation of Cross-Language Information Retrieval Systems , 2002, Lecture Notes in Computer Science.

[91]  Ángel F. Zazo Rodríguez,et al.  Spanish Monolingual Track: The Impact of Stemming on Retrieval , 2001, CLEF.

[92]  James Mayfield,et al.  A Language-Independent Approach to European Text Retrieval , 2000, CLEF.

[93]  Masami Suzuki,et al.  A Method for Supporting Document Selection in Cross-language Information Retrieval and its Evaluation , 2001, Comput. Humanit..

[94]  Susumu Akamine,et al.  A Character-based Indexing and Word-based Ranking Method for Japanese Text Retrieval , 1999, NTCIR.

[95]  Jian-Yun Nie Towards a Unified Approach to CLIR and Multilingual IR ( Position paper ) , 2002 .

[96]  Christine D. Piatko,et al.  JHU/APL at TREC 2001: Experiments in Filtering and in Arabic, Video, and Web Retrieval , 2001, TREC.

[97]  James Mayfield,et al.  JHU/APL Experiments at CLEF: Translation Resources and Score Normalization , 2001, CLEF.

[98]  Stanley F. Chen,et al.  Aligning Sentences in Bilingual Corpora Using Lexical Information , 1993, ACL.

[99]  Philip Resnik,et al.  Parallel strands: a preliminary investigation into mining the Web for bilingual text , 1998, AMTA.

[100]  Gerard Salton,et al.  Automatic Processing of Foreign Language Documents , 1969, COLING.

[101]  Carol Peters,et al.  Cross-Language Information Retrieval (CLIR) Track Overview , 1997, TREC.

[102]  Mohand Boughanem,et al.  Investigation on Disambiguation in CLIR: Aligned Corpus and Bi-directional Translation-Based Strategies , 2001, CLEF.

[103]  W. Bruce Croft,et al.  Dictionary Methods for Cross-Lingual Information Retrieval , 1996, DEXA.

[104]  Fernando Martínez,et al.  SINAI at CLEF 2002: Experiments with Merging Strategies , 2002 .

[105]  W. Bruce Croft,et al.  Resolving ambiguity for cross-language retrieval , 1998, SIGIR '98.

[106]  Fredric C. Gey,et al.  Translation Term Weighting and Combining Translation Resources in Cross-Language Retrieval , 2001, TREC.

[107]  Padmini Srinivasan,et al.  Optimal Document-Indexing Vocabulary for MEDLINE , 1996, Inf. Process. Manag..

[108]  Miguel E. Ruiz,et al.  CINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation , 1999, TREC.

[109]  Jianqiang Wang,et al.  iCLEF 2001 at Maryland: Comparing Term-for-Term Gloss and MT , 2001, CLEF.

[110]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[111]  Hans Uszkoreit,et al.  Design and Evaluation of a Psychological Experiment on the Effectiveness of Document Summarisation for the Retrieval of Multilingual WWW Documents , 1998 .