Advances in natural language processing

Adaptive Selection of Base Classifiers in One-Against-All Learning for Large Multi-labeled Collections.- Automatic Acquisition of Transfer Rules from Translation Examples.- Automatic Assessment of Open Ended Questions with a Bleu-Inspired Algorithm and Shallow NLP.- Automatic Phonetic Alignment and Its Confidence Measures.- Automatic Spelling Correction in Galician.- Baseline Methods for Automatic Disambiguation of Abbreviations in Jewish Law Documents.- Bayes Decision Rules and Confidence Measures for Statistical Machine Translation.- Character Identification in Children Stories.- Comparison and Evaluation of Two Approaches of a Multilayered QA System Applied to Temporality.- The Contents and Structure of the Context Base, and Its Application.- Developing a Minimalist Parser for Free Word Order Languages with Discontinuous Constituency.- Developing Competitive HMM PoS Taggers Using Small Training Corpora.- Exploring the Use of Target-Language Information to Train the Part-of-Speech Tagger of Machine Translation Systems.- Expressive Power and Consistency Properties of State-of-the-Art Natural Language Parsers.- An Independent Domain Dialogue System Through a Service Manager.- Information Retrieval in Digital Theses Based on Natural Language Processing Tools.- Integrating Conceptual Density with WordNet Domains and CALD Glosses for Noun Sense Disambiguation.- Intertwining Deep Syntactic Processing and Named Entity Detection.- Language Understanding Using n-multigram Models.- Multi-label Text Classification Using Multinomial Models.- A Multi-use Incremental Syntax-Semantic Interface.- Multiword Expression Translation Using Generative Dependency Grammar.- Named Entity Recognition Through Corpus Transformation and System Combination.- One Size Fits All? A Simple Technique to Perform Several NLP Tasks.- Ontology-Based Feature Transformations: A Data-Driven Approach.- On the Quality of Lexical Resources for Word Sense Disambiguation.- Reuse of Free Online MT Engines to Develop a Meta-system of Multilingual Machine Translation.- Semantic-Aided Anaphora Resolution in Large Corpora Development.- SemRol: Recognition of Semantic Roles.- Significance of Syntactic Features for Word Sense Disambiguation.- SisHiTra : A Hybrid Machine Translation System from Spanish to Catalan.- Smoothing and Word Sense Disambiguation.- Spelling Correction for Search Engine Queries.- A Statistical Study of the WPT-03 Corpus.- A Study of Chunk-Based and Keyword-Based Approaches for Generating Headlines.- Suffixal and Prefixal Morpholexical Relationships of the Spanish.- SuPor: An Environment for AS of Texts in Brazilian Portuguese.- Systemic Analysis Applied to Problem Solving: The Case of the Past Participle in French.- The Merging Problem in Distributed Information Retrieval and the 2-Step RSV Merging Algorithm.- Unsupervised Training of a Finite-State Sliding-Window Part-of-Speech Tagger.- Using Seed Words to Learn to Categorize Chinese Text.- On Word Frequency Information and Negative Evidence in Naive Bayes Text Classification.

[1]  John Nerbonne,et al.  Measuring Dialect Distance Phonetically , 1997, SIGMORPHON@EACL.

[2]  Regina Barzilay,et al.  Using Lexical Chains for Text Summarization , 1997 .

[3]  James P. Callan,et al.  Training algorithms for linear text classifiers , 1996, SIGIR '96.

[4]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[5]  R. A. Redol JaVaLI!: understanding real questions∗ , 2003 .

[6]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[7]  Enrico Motta,et al.  Reusable components for knowledge modelling , 1998 .

[8]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[9]  Hang Li,et al.  Word Translation Disambiguation Using Bilingual Bootstrapping , 2004, Computational Linguistics.

[10]  I. Bosque,et al.  Gramática descriptiva de la lengua española , 1999 .

[11]  Derrick Higgins,et al.  A transformation-based approach to argument labeling , 2004, CoNLL.

[12]  Makoto Nagao,et al.  General Word Sense Disambiguation Method Based on a Full Sentential Context , 1998, WordNet@ACL/COLING.

[13]  Dragos Stefan Munteanu,et al.  GLEANS: A Generator of Logical Extracts and Abstracts for Nice Summaries , 2002 .

[14]  Fabio Rinaldi,et al.  ExtrAns: Extracting Answers from Technical Texts , 2003, IEEE Intell. Syst..

[15]  Alexander Budanitsky,et al.  Lexical Semantic Relatedness and Its Application in Natural Language Processing , 1999 .

[16]  Jaime G. Carbonell,et al.  Anaphora Resolution: A Multi-Strategy Approach , 1988, COLING.

[17]  Siobhan Devlin,et al.  Feature Selection for Summarising: The Sunderland DUC 2004 Experience , 2004 .

[18]  Asunción Gómez-Pérez,et al.  Building Ontologies at the Knowledge Level using the Ontology Design Environment , 1998 .

[19]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[20]  Penelope Sibun,et al.  A Practical Part-of-Speech Tagger , 1992, ANLP.

[21]  Bento Carlos Dias-da-Silva,et al.  A construção de um thesaurus eletrônico para o Português do Brasil , 2003 .

[22]  Kemal Oflazer,et al.  Error-tolerant Finite-state Recognition with Applications to Morphological Analysis and Spelling Correction , 1995, CL.

[23]  Carl Pollard,et al.  A Centering Approach to Pronouns , 1987, ACL.

[24]  W. B. Cavnar,et al.  Using An N-Gram-Based Document Representation With A Vector Processing Retrieval Model , 1994, TREC.

[25]  Ramesh Krishnamurthy,et al.  Peeling an Onion: The Lexicographer's Experience ofManual Sense-Tagging , 2000, Comput. Humanit..

[26]  Manuel Palomar,et al.  Semantic Knowledge-Driven Method to Solve Pronominal Anaphora in Spanish Texts , 2000, Natural Language Processing.

[27]  Silvia Faitelson-Weiser,et al.  Sufijación y derivación sufijal: sentido y forma , 1993 .

[28]  Rafael Muñoz,et al.  Splitting Complex Temporal Questions for Question Answering Systems , 2004, ACL.

[29]  Jason Eisner,et al.  Three New Probabilistic Models for Dependency Parsing: An Exploration , 1996, COLING.

[30]  Van Nostrand,et al.  Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm , 1967 .

[31]  Marti A. Hearst Text tiling: A quantitative approach to discourse segmentation , 1993, ACL 1993.

[32]  Manuel Palomar,et al.  A Maximum Entropy-based Word Sense Disambiguation System , 2002, COLING.

[33]  Mark T. Maybury,et al.  Automatic Summarization , 2002, Computational Linguistics.

[34]  Mehryar Mohri,et al.  Finite-State Transducers in Language and Speech Processing , 1997, CL.

[35]  John Hale,et al.  A Statistical Approach to Anaphora Resolution , 1998, VLC@COLING/ACL.

[36]  Jerry R. Hobbs Resolving pronoun references , 1986 .

[37]  Encarna Segarra,et al.  Construction of language models using the morphic generator grammatical inference (MGGI) methodology , 1997, EUROSPEECH.

[38]  Mehryar Mohri,et al.  The Design Principles of a Weighted Finite-State Transducer Library , 2000, Theor. Comput. Sci..

[39]  Hwee Tou Ng,et al.  An Empirical Evaluation of Knowledge Sources and Learning Algorithms for Word Sense Disambiguation , 2002, EMNLP.

[40]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[41]  Mário J. Silva,et al.  Language identification in web pages , 2005, SAC '05.

[42]  Maria das Graças Volpe Nunes,et al.  DIADORIM - A Lexical Database for Brazilian Portuguese , 2002, LREC.

[43]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[44]  Yves Schabes,et al.  Deterministic Part-of-Speech Tagging with Finite-State Transducers , 1995, Comput. Linguistics.

[45]  Erik F. Tjong Kim Sang,et al.  Introduction to the CoNLL-2002 Shared Task: Language-Independent Named Entity Recognition , 2002, CoNLL.

[46]  M. F. Lang,et al.  Formación de palabras en español: morfología derivativa productiva en el léxico moderno , 1992 .

[47]  Michael F. McTear,et al.  Book Review: Spoken Dialogue Technology: Toward the Conversational User Interface, by Michael F. McTear , 2002, CL.

[48]  Sanda M. Harabagiu,et al.  Using Predicate-Argument Structures for Information Extraction , 2003, ACL.

[49]  Kenneth Ward Church A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text , 1988, ANLP.

[50]  Robert Gaizauskas,et al.  Quantitative evaluation of coreference algorithms in an information extraction system , 2000 .

[51]  Shankar Kumar,et al.  Minimum Bayes-Risk Decoding for Statistical Machine Translation , 2004, NAACL.

[52]  John F. Sowa,et al.  Construing Experience through Meaning: A Language-based Approach to Cognition by M. A. K. Halliday and Christian M. I. M. Matthiessen , 2001, Computational Linguistics.

[53]  Ben Hutchinson,et al.  Dependency-based semantic interpretation for answer extraction , 2002 .

[54]  Douglas-Val Ziegler The automatic identification of languages using linguistic recognition signals , 1992 .

[55]  João C. O. Matias,et al.  Edite - A Natural Language Interface to Databases A new dimension for an old approach , 1997 .

[56]  Sandro Pedrazzini Treating Terms As Idioms , 1999 .

[57]  King-Lup Liu,et al.  Discovering the representative of a search engine , 2001, CIKM '01.

[58]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[59]  Mitchell P. Marcus,et al.  Three machine learning algorithms for lexical ambiguity resolution , 1996 .

[60]  B. John Oommen,et al.  Spelling correction using probabilistic methods , 1984, Pattern Recognit. Lett..

[61]  Fred J. Damerau,et al.  A technique for computer detection and correction of spelling errors , 1964, CACM.

[62]  Ted Pedersen,et al.  A Decision Tree of Bigrams is an Accurate Predictor of Word Sense , 2001, NAACL.

[63]  William M. Fisher A statistical text-to-phone function using ngrams and rules , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[64]  Wayne H. Ward,et al.  Target Word Detection and Semantic Role Chunking using Support Vector Machines , 2003, NAACL.

[65]  Werner Winiwarter Incremental Learning of Transfer Rules for Customized Machine Translation , 2004, INAP/WLP.

[66]  Abdelmajid Ben Hamadou,et al.  Automatic Text Summarization of Scientific Articles Based on Classification of Extract's Population , 2003, CICLing.

[67]  Jes Us Gim Enez And Llu Fast and Accurate Part{of{speech Tagging: the Svm Approach Revisited , 2003 .

[68]  Helmut Schmid,et al.  Improvements in Part-of-Speech Tagging with an Application to German , 1999 .

[69]  Karen Kukich,et al.  Techniques for automatically correcting words in text , 1992, CSUR.

[70]  Ferran Plà,et al.  Improving part-of-speech tagging using lexicalized HMMs , 2004, Natural Language Engineering.

[71]  Grace Ngai,et al.  Transformation Based Learning in the Fast Lane , 2001, NAACL.

[72]  Luísa Coheur,et al.  From a Surface Analysis to a Dependency Structure , 2004 .

[73]  Andreas Rauber,et al.  Text Classification and Labelling of Document Clusters with Self-Organising Maps , 2000 .

[74]  Eugene Charniak,et al.  An Approach to Anaphoric Pronouns , 2000 .

[75]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[76]  Ramón Almela Pérez Procedimientos de formación de palabras en español , 1999 .

[77]  Ray A. Jarvis,et al.  Clustering Using a Similarity Measure Based on Shared Near Neighbors , 1973, IEEE Transactions on Computers.

[78]  Constantin Orasan,et al.  CAST: A computer-aided summarisation tool , 2003, EACL.

[79]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[80]  P. Laplace A Philosophical Essay On Probabilities , 1902 .

[81]  German Rigau,et al.  Spanish WordNet 1.6: Porting the Spanish Wordnet Across Princeton Versions , 2004, LREC.

[82]  Martha W. Evens,et al.  Spelling Correction using Context , 1998, ACL.

[83]  M. de Rijke,et al.  Comparing the Ambiguity Reduction Abilities of Probabilistic Context-Free Grammars , 2004, LREC.

[84]  Bruce W. Watson,et al.  Incremental construction of minimal acyclic finite state automata , 2000, CL.

[85]  German Rigau,et al.  Towards the Meaning Top Ontology: Sources of Ontological Meaning , 2004, LREC.

[86]  David D. McDonald Internal and External Evidence in the Identification and Semantic Categorization of Proper Names , 1993 .

[87]  Luísa Coheur,et al.  A step towards incremental generation of logical forms , 2004 .

[88]  Sergei Nirenburg,et al.  Lexical Acquisition with WordNet and the Mikrokosmos Ontology , 1998, WordNet@ACL/COLING.

[89]  F. Verdejo The web as a resource for WSD , 2003 .

[90]  James F. Allen,et al.  An architecture for more realistic conversational systems , 2001, IUI '01.

[91]  Zenón José Hernández Figueroa,et al.  FLAVER : reflexionador y lematizador automático de formas verbales , 1996 .

[92]  Beth Levin,et al.  English Verb Classes and Alternations: A Preliminary Investigation , 1993 .

[93]  Thierry Dutoit,et al.  High-quality speech synthesis for phonetic speech segmentation , 1997, EUROSPEECH.

[94]  Ellen M. Voorhees,et al.  The Collection Fusion Problem , 1994, TREC.

[95]  Jean Véronis,et al.  Sense tagging: does it make sense? , 2001 .

[96]  Thierry Poibeau,et al.  Parsing natural language idioms with bidirectional finite-state machines , 2001, Theor. Comput. Sci..

[97]  David Thomas,et al.  The Art in Computer Programming , 2001 .

[98]  Caroline Brun,et al.  Semantic Encoding of Electronic Documents , 2001 .

[99]  Hermann Ney,et al.  Confidence measures for statistical machine translation , 2003, MTSUMMIT.

[100]  David Yarowsky,et al.  Combining Classifiers for word sense disambiguation , 2002, Nat. Lang. Eng..

[101]  Joana Paulo Pardal,et al.  Using Morphological, Syntactical, and Statistical Information for Automatic Term Acquisition , 2002, PorTAL.

[102]  Satoshi Sekine,et al.  Japanese Named Entity Extraction Evaluation - Analysis of Results - , 2000, COLING.

[103]  Ralf Krestel,et al.  {Using Knowledge-poor Coreference Resolution for Text Summarization} , 2003 .

[104]  Daniel Gildea,et al.  The Necessity of Parsing for Predicate Argument Recognition , 2002, ACL.

[105]  William A. Gale,et al.  Good-Turing Smoothing Without Tears , 2001 .

[106]  Raymond J. D'Amore,et al.  One-time complete indexing of text: theory and practice , 1985, SIGIR '85.

[107]  Ted Pedersen,et al.  Combining Lexical and Syntactic Features for Supervised Word Sense Disambiguation , 2004, CoNLL.

[108]  Yoram Singer,et al.  Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers , 2000, J. Mach. Learn. Res..

[109]  Maria das Graças Volpe Nunes,et al.  GistSumm: A Summarization Tool Based on a New Extractive Method , 2003, PROPOR.

[110]  Bernardo Magnini,et al.  Integrating Subject Field Codes into WordNet , 2000, LREC.

[111]  George A. Miller,et al.  A Semantic Concordance , 1993, HLT.

[112]  João P. Campos Versus: a Web Data Repository with Time Support , 2003 .

[113]  Beata Kouchnir,et al.  A Memory-Based Approach for Semantic Role Labeling , 2004, CoNLL.

[114]  José B. Mariño,et al.  Language modeling using x-grams , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[115]  M Damashek,et al.  Gauging Similarity with n-Grams: Language-Independent Categorization of Text , 1995, Science.

[116]  Hae-Chang Rim,et al.  Two-Phase Semantic Role Labeling based on Support Vector Machines , 2004, CoNLL.

[117]  Sergi Cervell,et al.  An environment for mophosyntactic processing of unrestricted Spanish text , 1998 .

[118]  Roy E. Kimbrell,et al.  Searching for text? Send an N-gram] , 1988 .

[119]  Jean-Pierre Chanod,et al.  Incremental Finite-State Parsing , 1997, ANLP.

[120]  Nitesh V. Chawla,et al.  C4.5 and Imbalanced Data sets: Investigating the eect of sampling method, probabilistic estimate, and decision tree structure , 2003 .

[121]  Victoria J. Hodge,et al.  An Evaluation of Phonetic Spell Checkers , 2001 .

[122]  Elena Bajo Pérez,et al.  La derivación nominal en español , 1997 .

[123]  Scott Bennett,et al.  Evaluating Automated and Manual Acquisition of Anaphora Resolution Strategies , 1995, ACL.

[124]  Daniel Gildea,et al.  Identifying Semantic Roles Using Combinatory Categorial Grammar , 2003, EMNLP.

[125]  Eric Brill,et al.  Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging , 1995, CL.

[126]  T. Katerina,et al.  Automatic Term Recognition using Contextual Cues , 1997 .

[127]  W. J. Hutchins Machine translation over fifty years , 2001 .

[128]  Richard Sproat,et al.  Identifying speakers in children's stories for speech synthesis , 2003, INTERSPEECH.

[129]  Luís C. Oliveira,et al.  DTW-based phonetic alignment using multiple acoustic features , 2003, INTERSPEECH.

[130]  John C. Henderson Exploiting Diversity for Natural Language Processing , 1998, AAAI/IAAI.

[131]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[132]  Samuel Kaski,et al.  Self organization of a massive document collection , 2000, IEEE Trans. Neural Networks Learn. Syst..

[133]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[134]  Walter Daelemans,et al.  Improving Accuracy in word class tagging through the Combination of Machine Learning Systems , 2001, CL.

[135]  Emmanuel Roche,et al.  Finite state transducers: parsing free and frozen sentences , 1999 .

[136]  Martin L. King,et al.  Towards a Methodology for Building Ontologies , 1995 .

[137]  Richard M. Schwartz,et al.  Hedge Trimmer: A Parse-and-Trim Approach to Headline Generation , 2003, HLT-NAACL 2003.

[138]  J. van Leeuwen,et al.  Automata Implementation , 2001, Lecture Notes in Computer Science.

[139]  Giacomo Ferrari,et al.  State of the art in Computational Linguistics , 2004 .

[140]  José João Almeida,et al.  jspell.pm: um módulo de análise morfológica para uso em processamento de linguagem natural , 2001 .

[141]  Tong Zhang,et al.  Named Entity Recognition through Classifier Combination , 2003, CoNLL.

[142]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[143]  Tatsuya Kawahara,et al.  Flexible Mixed-Initiative Dialogue Management using Concept-Level Confidence Measures of Speech Recognizer Output , 2000, COLING.

[144]  Emilio Sanchis Arnal,et al.  3LB-SAT: Una herramienta de anotación semántica , 2003, Proces. del Leng. Natural.

[145]  Roberto Pieraccini,et al.  Stochastic automata for language modeling , 1996, Comput. Speech Lang..

[146]  Eneko Agirre,et al.  The Basque Country University system: English and Basque tasks , 2004, SENSEVAL@ACL.

[147]  Hercules Dalianis,et al.  Evaluating a Spelling Support in a Search Engine , 2002, NLDB.

[148]  W. B. Cavnar,et al.  N-gram-based text categorization , 1994 .

[149]  Rafael Muñoz,et al.  An Algorithm for Anaphora Resolution in Spanish Texts , 2001, CL.

[150]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[151]  Ted Pedersen,et al.  A Simple Approach to Building Ensembles of Naive Bayesian Classifiers for Word Sense Disambiguation , 2000, ANLP.

[152]  Vaibhava Goel,et al.  Minimum Bayes-risk automatic speech recognition , 2000, Comput. Speech Lang..

[153]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[154]  Philippe Flajolet,et al.  The analysis of hybrid trie structures , 1998, SODA '98.

[155]  Encarna Segarra,et al.  Extracting Semantic Information Through Automatic Learning Techniques , 2002, Int. J. Pattern Recognit. Artif. Intell..

[156]  Hermann Ney,et al.  The Alignment Template Approach to Statistical Machine Translation , 2004, CL.

[157]  Christiane Fellbaum,et al.  Analysis of a Hand-Tagging Task , 1997, Workshop On Tagging Text With Lexical Semantics: Why, What, And How?.

[158]  Emilio Sanchis Arnal,et al.  Uniclass and Multiclass Connectionist Classification of Dialogue Acts , 2003, CIARP.

[159]  Daniel Dominic Sleator,et al.  Parsing English with a Link Grammar , 1995, IWPT.

[160]  John D. Lafferty,et al.  A Robust Parsing Algorithm for Link Grammars , 1995, IWPT.

[161]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[162]  Johnny Bigert Probabilistic Detection of Context-Sensitive Spelling Errors , 2004, LREC.

[163]  Michio Sugeno,et al.  Toward Everyday Language Computing - Computing from a Viewpoint of Linguistic Theory - , 1997, J. Adv. Comput. Intell. Intell. Informatics.

[164]  Terrence J. Sejnowski,et al.  Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[165]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[166]  Maurice Gross,et al.  Lexicon - Grammar The Representation of Compound Words , 1986, COLING.

[167]  Eneko Agirre,et al.  Syntactic Features for High Precision Word Sense Disambiguation , 2002, COLING.

[168]  Walter Daelemans,et al.  Memory-based semantic role labeling: Optimizing features, algorithm, and output , 2004, CoNLL.

[169]  Henrik Eriksson,et al.  Using knowledge engineering support for a Java documentation viewer , 2002, SEKE '02.

[170]  H. Giles,et al.  Speech Accommodation Theory: The First Decade and Beyond , 1987 .

[171]  Vittorio Castelli,et al.  The relative value of labeled and unlabeled samples in pattern recognition with an unknown mixing parameter , 1996, IEEE Trans. Inf. Theory.

[172]  E. Vajda Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet , 2000 .

[173]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[174]  Eneko Agirre,et al.  Word Sense Disambiguation using Conceptual Density , 1996, COLING.

[175]  W. Bruce Croft,et al.  Searching distributed collections with inference networks , 1995, SIGIR '95.

[176]  Inderjeet Mani,et al.  Another Sys Called Qanda , 2000, TREC.

[177]  Enrique Alfonseca,et al.  Description of the UAM system for generating very short summaries at DUC-2004 ∗ , 2003 .

[178]  David Madigan,et al.  On the Naive Bayes Model for Text Categorization , 2003, AISTATS.

[179]  Francisco Casacuberta,et al.  A MORPHOLOGICAL ANALYSER FOR MACHINE TRANSLATION BASED ON FINITE-STATE TRANSDUCERS , 2001 .

[180]  Daniel Gomes,et al.  A Characterization of the Portuguese Web , 2003 .

[181]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[182]  Robert Gaizauskas,et al.  Coreference Resolution in a Multilingual Information Extraction System , 1998 .

[183]  Mitchell P. Marcus,et al.  Maximum entropy models for natural language ambiguity resolution , 1998 .

[184]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[185]  Sanda M. Harabagiu,et al.  Knowledge-Lean Coreference Resolution and its Relation to Textual Cohesion and Coherence , 1999, Workshop On The Relation Of Discourse/Dialogue Structure And Reference.

[186]  Slava M. Katz Distribution of content words and phrases in text and language modelling , 1996, Natural Language Engineering.

[187]  Fred Karlsson,et al.  Constraint Grammar as a Framework for Parsing Running Text , 1990, COLING.

[188]  Justin Zobel,et al.  Phonetic string matching: lessons from information retrieval , 1996, SIGIR '96.

[189]  Frederick Jelinek,et al.  Interpolated estimation of Markov source parameters from sparse data , 1980 .

[190]  João Paulo da Silva Neto,et al.  The development of a multi-purpose spoken dialogue system , 2003, INTERSPEECH.

[191]  Nuno J. Mamede,et al.  Interpretations and Discourse Obligations in a Dialog System , 2003, PROPOR.

[192]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[193]  Carl Gutwin,et al.  KEA: practical automatic keyphrase extraction , 1999, DL '99.

[194]  Encarna Segarra,et al.  On the Use of the Morphic Generator Grammatical Inference (MGGI) Methodology in Automatic speech Recognition , 1990, Int. J. Pattern Recognit. Artif. Intell..

[195]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[196]  Robert Larose,et al.  « Méthodologie de l'évaluation des traductions » , 1998 .

[197]  V. Barnett,et al.  Applied Linear Statistical Models , 1975 .

[198]  Stephen E. Robertson,et al.  Experimentation as a way of life: Okapi at TREC , 2000, Inf. Process. Manag..

[199]  Khalil Sima'an,et al.  Backoff Parameter Estimation for the DOP Model , 2003, ECML.

[200]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[201]  Werner Winiwarter PETRA - the Personal Embedded Translation and Reading Assistant , 2004 .

[202]  Maximiliano Saiz-Noeda Influencia y aplicación de papeles sintácticos e información semántica en la resolución de la anáfora pronominal en español , 2002, Proces. del Leng. Natural.

[203]  Stanley F. Chen,et al.  Building Probabilistic Models for Natural Language , 1996, ArXiv.

[204]  Sandra M. Aluísio,et al.  Combining Multiple Classifiers to Improve Part of Speech Tagging : A Case Study for Brazilian Portuguese , 2000 .

[205]  Robert C. Holte,et al.  C4.5, Class Imbalance, and Cost Sensitivity: Why Under-Sampling beats Over-Sampling , 2003 .

[206]  Bernard Mérialdo,et al.  Tagging English Text with a Probabilistic Model , 1994, CL.

[207]  David D. Lewis,et al.  Text categorization of low quality images , 1995 .

[208]  Yakov Malkiel El análisis genético de la formación de palabras , 1993 .

[209]  Daniel Gildea,et al.  Automatic Labeling of Semantic Roles , 2000, ACL.

[210]  Scott Weinstein,et al.  Centering: A Framework for Modeling the Local Coherence of Discourse , 1995, CL.

[211]  Piek Vossen,et al.  EuroWordNet: A multilingual database with lexical semantic networks , 1998, Springer Netherlands.

[212]  Xavier Carreras,et al.  Named Entity Extraction using AdaBoost , 2002, CoNLL.

[213]  Adriaan van Wijngaarden,et al.  Orthogonal design and description of a formal language , 1965 .

[214]  Lluís Màrquez i Villodre,et al.  MiniCors and Cast3LB: Two Semantically Tagged Spanish Corpora , 2004, LREC.

[215]  Rafael Muñoz,et al.  Multilayered Question Answering system applied to Temporality evaluation , 2004, Proces. del Leng. Natural.

[216]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.

[217]  J. Bentley,et al.  TERNARY SEARCH TREES , 1998 .

[218]  David R. Traum,et al.  Discourse Obligations in Dialogue Processing , 1994, ACL.

[219]  Namhee Kwon,et al.  Maximum Entropy Models for FrameNet Classification , 2003, EMNLP.

[220]  Ronald Rosenfeld,et al.  Statistical language modeling using the CMU-cambridge toolkit , 1997, EUROSPEECH.

[221]  Proceedings of the First Conference of the Association for Machine Translation in the Americas, AMTA 1994, Columbia, Maryland, USA, October 5-8, 1994 , 1994, AMTA.

[222]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[223]  Leon Davidson,et al.  Retrieval of misspelled names in an airlines passenger record system , 1962, CACM.

[224]  Ellen Riloff,et al.  Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping , 1999, AAAI/IAAI.

[225]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[226]  Eric Brill,et al.  An Improved Error Model for Noisy Channel Spelling Correction , 2000, ACL.

[227]  Xavier Carreras,et al.  Hierarchical Recognition of Propositional Arguments with Perceptrons , 2004, CoNLL.

[228]  Hwee Tou Ng,et al.  Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach , 1996, ACL.

[229]  Dan Roth,et al.  Semantic Role Labeling Via Generalized Inference Over Classifiers , 2004, CoNLL.

[230]  Mark Johnson The DOP Estimation Method Is Biased and Inconsistent , 2002, Computational Linguistics.

[231]  Franz Josef Och,et al.  Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[232]  Andrew McCallum,et al.  Distributional clustering of words for text classification , 1998, SIGIR '98.

[233]  Frederick Jelinek,et al.  Statistical methods for speech recognition , 1997 .

[234]  Joel R. Tetreault Analysis of Syntax-Based Pronoun Resolution Methods , 1999, ACL.

[235]  Kenneth Ward Church,et al.  Poisson mixtures , 1995, Natural Language Engineering.

[236]  Thorsten Joachims,et al.  Learning to classify text using support vector machines - methods, theory and algorithms , 2002, The Kluwer international series in engineering and computer science.

[237]  Montserrat Civit Torruella Criterios de etiquetación y desambiguación morfosintáctica de corpus en español , 2003 .

[238]  Alex Kulesza,et al.  Confidence Estimation for Machine Translation , 2004, COLING.

[239]  Satoshi Sekine,et al.  Extended Named Entity Hierarchy , 2002, LREC.

[240]  Wai Lam,et al.  Evaluation Challenges in Large-Scale Document Summarization , 2003, ACL.

[241]  Johannes Matiasek The Generation of Idiomatic and Collocational Expressions , 1995 .

[242]  Walter Daelemans,et al.  Applying System Combination to Base Noun Phrase Identification , 2000, COLING.

[243]  Ted E. Dunning,et al.  Statistical Identification of Language , 1994 .

[244]  Lalit R. Bahl,et al.  A Maximum Likelihood Approach to Continuous Speech Recognition , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[245]  Lauri Karttunen,et al.  Finite-state lexicon compiler , 1993 .

[246]  James P. Callan,et al.  The effectiveness of query expansion for distributed information retrieval , 2001, CIKM '01.

[247]  Ellen M. Voorhees,et al.  Learning collection fusion strategies , 1995, SIGIR '95.

[248]  Carlo Strapparava,et al.  The "Meaning" system on the English all-words task , 2004, SENSEVAL@ACL.

[249]  Owen Rambow,et al.  Use of Deep Linguistic Features for the Recognition and Labeling of Semantic Arguments , 2003, EMNLP.

[250]  Schubert Foo,et al.  Ontology research and development. Part 1 - a review of ontology generation , 2002, J. Inf. Sci..

[251]  Elena M. Zamora,et al.  The use of trigram analysis for spelling error detection , 1981, Inf. Process. Manag..

[252]  Nigel Gilbert,et al.  Simulating speech systems , 1991 .

[253]  Alex Waibel,et al.  The CMU statistical machine translation system , 2003, MTSUMMIT.

[254]  L. Philips,et al.  Hanging on the metaphone , 1990 .

[255]  M. W. Du,et al.  A model and a fast algorithm for multiple errors spelling correction , 1992, Acta Informatica.

[256]  Thorsten Brants,et al.  TnT – A Statistical Part-of-Speech Tagger , 2000, ANLP.

[257]  John B. Smith,et al.  Author's Argumentation Assistant (AAA): A Hypertext-Based Authoring Tool for Argumentative Texts , 1992, ECHT.

[258]  Borja Navarro-Colorado,et al.  Issues in the Syntactic Annotation of Cast3LB , 2003, LINC@EACL.

[259]  Eneko Agirre,et al.  A Proposal for Word Sense Disambiguation using Conceptual Distance , 1995, ArXiv.

[260]  Stefan Stelian Diaconescu,et al.  Natural Language Understanding Using Generative Dependency Grammar , 2003 .

[261]  Christiane Fellbaum,et al.  Towards a Representation of Idioms in WordNet , 1998, WordNet@ACL/COLING.

[262]  Enrico Motta,et al.  Scholarly Discourse as Computable Structure , 2000, OHS-6/SC-2.

[263]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[264]  Edward P. Stabler,et al.  Minimalist grammars and recognition , 2000 .

[265]  Andrew McCallum,et al.  Using Maximum Entropy for Text Classification , 1999 .

[266]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[267]  Dragomir R. Radev,et al.  The University of Michigan at DUC 2004 , 2004 .

[268]  Jonathan D. Cohen,et al.  Highlights: Language- and Domain-Independent Automatic Indexing Terms for Abstracting , 1995, J. Am. Soc. Inf. Sci..

[269]  Daniel Jurafsky,et al.  Semantic Role Labeling by Tagging Syntactic Chunks , 2004, CoNLL.

[270]  Gerard Salton,et al.  Automatic Text Structuring and Summarization , 1997, Inf. Process. Manag..

[271]  Stefan Diaconescu Natural Language Agreement Description for Reversible Grammars , 2003, Australian Conference on Artificial Intelligence.

[272]  Roger Levy,et al.  A Generative Model for Semantic Role Labeling , 2003, ECML.

[273]  Tom M. Mitchell,et al.  Learning to Extract Symbolic Knowledge from the World Wide Web , 1998, AAAI/IAAI.

[274]  Sven C. Martin,et al.  Statistical Language Modeling Using Leaving-One-Out , 1997 .

[275]  Jorn Kreutel Modelling Questions and Assertions in Dialogue Using Obligations , 1999 .

[276]  Timothy Baldwin,et al.  Multiword Expressions: A Pain in the Neck for NLP , 2002, CICLing.

[277]  Mário J. Silva,et al.  The Case for a Portuguese Web Search Engine , 2003, ICWI.

[278]  Paolo Rosso,et al.  The upv-unige-CIAOSENSO WSD system , 2004, SENSEVAL@ACL.

[279]  Kristina Toutanova,et al.  Pronunciation Modeling for Improved Spelling Correction , 2002, ACL.

[280]  Stefan Diaconescu,et al.  Morphological Categorization Attribute Value Trees and XML , 2003, Intelligent Information Systems.

[281]  Gregor Erbach Head-Driven Lexical Representation of Idioms in HPSG , 1992 .

[282]  Thorsten Brants,et al.  Tagging the Teleman Corpus , 1995, ArXiv.

[283]  Christopher D. Manning,et al.  Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[284]  Harold L. Somers,et al.  An introduction to machine translation , 1992 .

[285]  Henrik Eriksson,et al.  Knowledge modeling at the millennium : The design and evolution of Protégé-2000 , 1999 .

[286]  Julian M. Kupiec,et al.  Robust part-of-speech tagging using a hidden Markov model , 1992 .

[287]  Michael Uschold,et al.  Ontologies: principles, methods and applications , 1996, The Knowledge Engineering Review.

[288]  Branimir Boguraev,et al.  Anaphora for Everyone: Pronominal Anaphora Resolution without a Parser , 1996, COLING.

[289]  Ronald L. Rivest,et al.  Learning decision lists , 2004, Machine Learning.

[290]  Joaquim Ferreira da Silva Extracting Multiword Terms from Document Collections , 1999 .

[291]  Wessel Kraaij,et al.  Headline extraction based on a combination of uni- and multidocument summarization techniques , 2002 .

[292]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[293]  Agata Savary Typographical Nearest-Neighbor Search in a Finite-State Lexicon and Its Application to Spelling Correction , 2001, CIAA.

[294]  Ruslan Mitkov,et al.  Robust Pronoun Resolution with Limited Knowledge , 1998, ACL.

[295]  G. Bès La phrase verbale noyau en français , 1999 .

[296]  David Yarowsky,et al.  One Sense Per Discourse , 1992, HLT.

[297]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[298]  Inderjeet Mani,et al.  Robust Temporal Processing of News , 2000, ACL.

[299]  Steve Pulman The recognition and interpretation of idioms , 1993 .

[300]  Luís Sarmento,et al.  O projecto AC/DC: acesso a corpora/disponibilização de corpora , 2003 .

[301]  Robert Sedgewick,et al.  Fast algorithms for sorting and searching strings , 1997, SODA '97.

[302]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[303]  Jianying Wang,et al.  A corpus analysis approach for automatic query expansion and its extension to multiple databases , 1999, TOIS.

[304]  Alex A. Freitas,et al.  Document Clustering and Text Summarization , 2000 .

[305]  George R. Doddington,et al.  Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics , 2002 .

[306]  David Traum,et al.  Speech Acts for Dialogue Agents , 1999 .

[307]  David R. Karger,et al.  Tackling the Poor Assumptions of Naive Bayes Text Classifiers , 2003, ICML.

[308]  Daniel Jurafsky,et al.  Support Vector Learning for Semantic Argument Classification , 2005, Machine Learning.

[309]  Christopher Culy,et al.  The limits of n-gram translation evaluation metrics , 2003, MTSUMMIT.

[310]  Campbell B. Read,et al.  Zipf's Law , 2004 .

[311]  Lluís Màrquez i Villodre,et al.  Senseval-3: The Spanish lexical sample task , 2004, SENSEVAL@ACL.

[312]  Joan-Andreu Sánchez,et al.  Consistency of Stochastic Context-Free Grammars From Probabilistic Estimation Based on Growth Transformations , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[313]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[314]  George A. Miller,et al.  Using Corpus Statistics and WordNet Relations for Sense Identification , 1998, CL.

[315]  Xavier Carreras,et al.  Introduction to the CoNLL-2004 Shared Task: Semantic Role Labeling , 2004, CoNLL.

[316]  Georgios Paliouras,et al.  Learning to Filter Spam E-Mail: A Comparison of a Naive Bayesian and a Memory-Based Approach , 2000, ArXiv.

[317]  Claire Cardie,et al.  Noun Phrase Coreference as Clustering , 1999, EMNLP.

[318]  Lluís Màrquez i Villodre,et al.  TALP system for the English lexical sample task , 2004, SENSEVAL@ACL.

[319]  Yorick Wilks,et al.  University of Sheffield: description of the LaSIE system as used for MUC-6 , 1995, MUC.

[320]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[321]  Yorick Wilks,et al.  Word Sense Disambiguation using Optimised Combinations of Knowledge Sources , 1998, COLING-ACL.

[322]  関口 洋一,et al.  Web Corpus Construction with Quality Improvement , 2003 .

[323]  Allen R. Hanson,et al.  A Contextual Postprocessing System for Error Correction Using Binary n-Grams , 1974, IEEE Transactions on Computers.

[324]  Scott Cotton,et al.  SENSEVAL-2: Overview , 2001, *SEMEVAL.

[325]  Emmanuel Giguet Methode pour l'analyse automatique de structures formelles sur documents multilingues , 1998 .

[326]  Nancy Ide,et al.  Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[327]  Emmanuel Roche,et al.  Finite-State Language Processing , 1997 .

[328]  I. Good THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .

[329]  Gabriel G. Bès,et al.  Properties in 5P , 2001 .

[330]  Eduard H. Hovy,et al.  The Automated Acquisition of Topic Signatures for Text Summarization , 2000, COLING.

[331]  James F. Allen,et al.  Applying Genetic Algorithms to Pronoun Resolution , 1999, AAAI/IAAI.

[332]  David Yarowsky,et al.  Minimally Supervised Induction of Grammatical Gender , 2003, HLT-NAACL.

[333]  Hae-Chang Rim,et al.  Semantic Role Labeling using Maximum Entropy Model , 2004, CoNLL.

[334]  Hermann Ney,et al.  Reversing and Smoothing the Multinomial Naive Bayes Text Classifier , 2002, PRIS.

[335]  David Yarowsky,et al.  DECISION LISTS FOR LEXICAL AMBIGUITY RESOLUTION: Application to Accent Restoration in Spanish and French , 1994, ACL.

[336]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[337]  Ken Williams,et al.  Learning Transformation Rules for Semantic Role Labeling , 2004, CoNLL.

[338]  R. Mitkov,et al.  Coreference and anaphora: developing annotating tools, annotated resources and annotation strategies , 2000 .

[339]  William B. Dolan,et al.  Word Sense Ambiguation: Clustering Related Senses , 1994, COLING.

[340]  Claudio L. Lucchesi,et al.  Applications of finite automata representing large vocabularies , 1993, Softw. Pract. Exp..

[341]  Lucia Helena Machado Rino,et al.  SuPor: An Environment for AS of Texts in Brazilian Portuguese , 2004, EsTAL.

[342]  Darío Álvarez Gutiérrez,et al.  Naïve Algorithms for Keyphrase Extraction and Text Summarization from a Single Document Inspired by the Protein Biosynthesis Process , 2004, BioADIT.

[343]  David Yarowsky,et al.  Estimating Upper and Lower Bounds on the Performance of Word-Sense Disambiguation Programs , 1992, ACL.

[344]  Joaquín A. García-Medall Villanueva Formaciones prefijales en español: morfología derivativa del verbo , 1992 .

[345]  Janyce Wiebe,et al.  Word-Sense Disambiguation Using Decomposable Models , 1994, ACL.

[346]  Ido Dagan,et al.  Syntax and lexical statistics in Anaphora , 1995, Appl. Artif. Intell..

[347]  Eric Brill,et al.  A Simple Rule-Based Part of Speech Tagger , 1992, HLT.

[348]  Zhiyi Chi,et al.  Estimation of Probabilistic Context-Free Grammars , 1998, Comput. Linguistics.

[350]  David Alex Lamb,et al.  Spelling correction in user interfaces , 1983, CACM.

[351]  鎌田 浩二 書評 Juan Uriagereka: Rhyme and Reason: An Introduction to Minimalist Syntax , 2000 .

[352]  Hermann Ney,et al.  Error Measures and Bayes Decision Rules Revisited with Applications to POS Tagging , 2004, EMNLP.

[353]  Stephen Huffman Acquaintance: Language-Independent Document Categorization by N-Grams , 1995, TREC.

[354]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[355]  Walter Daelemans,et al.  MBT: A Memory-Based Part of Speech Tagger-Generator , 1996, VLC@COLING.

[356]  Victoria J. Hodge,et al.  A Novel Binary Spell Checker , 2001, ICANN.

[357]  Giuseppe ValettoRank Xerox Idarex: Formal Description of Multi-word Lexemes with Regular Expressions , 1995 .

[358]  Liang Zhou,et al.  Headline Summarization at ISI , 2003 .

[359]  Breck Baldwin,et al.  CogNIAC: high precision coreference with limited knowledge and linguistic resources , 1997 .

[360]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[361]  JapkowiczNathalie,et al.  The class imbalance problem: A systematic study , 2002 .

[362]  David Yarowsky,et al.  Hierarchical Decision Lists for Word Sense Disambiguation , 2000, Comput. Humanit..

[363]  Maarten de Rijke,et al.  Alternative approaches for Generating Bodies of Grammar Rules , 2004, ACL.

[364]  Xavier Carreras,et al.  FreeLing: An Open-Source Suite of Language Analyzers , 2004, LREC.

[365]  Jason Eisner,et al.  Bilexical Grammars and their Cubic-Time Parsing Algorithms , 2000 .

[366]  Andreia Gentil Bonfante Uso de Redes Neurais para Correção Gramatical do Português: Um Estudo de Caso , 1997 .

[367]  Mark A. Musen,et al.  The Knowledge Model of Protégé-2000: Combining Interoperability and Flexibility , 2000, EKAW.

[368]  Philip Resnik,et al.  Supervised Sense Tagging using Support Vector Machines , 2001, *SEMEVAL.

[369]  James H. Dee A lexicon of Latin derivatives in Italian, Spanish, French, and English : a synoptic etymological thesaurus with full indices for each language , 1997 .

[370]  Marie-Francine Moens,et al.  K.U.Leuven summarization system - DUC 2003 , 2003 .

[371]  Chris Buckley,et al.  OHSUMED: an interactive retrieval evaluation and new large test collection for research , 1994, SIGIR '94.

[372]  Diana Santos,et al.  Evaluating CETEMPúblico, a Free Resource for Portuguese , 2001, ACL.

[373]  Ethem Alpaydın,et al.  Combined 5 x 2 cv F Test for Comparing Supervised Classification Learning Algorithms , 1999, Neural Comput..

[374]  Ah-Hwee Tan,et al.  Adaptive resonance associative map , 1995, Neural Networks.

[375]  Marc Rossler,et al.  Using Markov Models for Named Entity recognition in German newspapers , 2002 .

[376]  Sergio Ortiz Rojas,et al.  The Spanish<>Catalan machine translation system interNOSTRUM , 2001, MTSUMMIT.

[377]  David D. Lewis,et al.  Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.

[378]  José B. Mariño,et al.  Using x-gram for efficient speech recognition , 1998, ICSLP.

[379]  Enrico Motta,et al.  ScholOnto: an ontology-based digital library server for research documents and discourse , 2000, International Journal on Digital Libraries.

[380]  Rada Mihalcea,et al.  A Method for Word Sense Disambiguation of Unrestricted Text , 1999, ACL.

[381]  Enrique Vidal,et al.  Learning language models through the ECGI method , 1991, Speech Commun..

[382]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[383]  Youngjoong Ko,et al.  Automatic Text Categorization by Unsupervised Learning , 2000, COLING.

[384]  Dan Jurafsky,et al.  A Cognitive Model of Sentence Interpretation: the Construction Grammar approach , 1993 .

[385]  Frederique Segond,et al.  Using a Finite-State Based Formalism to Identify and Generate Multiword Expressions , 1995 .

[386]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[387]  Stan Matwin,et al.  Text Classification Using WordNet Hypernyms , 1998, WordNet@ACL/COLING.

[388]  Natalya F. Noy,et al.  Component-Based Support for Building Knowledge-Acquisition Systems , 2000 .

[389]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[390]  Kenneth R. Beesley,et al.  Language Identifier: A Computer Program for Automatic Natural-Language Identification of On-line Tex , 1988 .

[391]  Andrei Popescu-Belis An experiment in comparative evaluation: humans vs. computers , 2003, MTSUMMIT.

[392]  Lucien Tesnière Éléments de syntaxe structurale , 1959 .

[393]  P. Greenfield Exploiting the model theory and proof theory of propositional logic in the microsystem approach to natural language processing , 1996 .

[394]  Yiming Yang,et al.  An Evaluation of Statistical Approaches to Text Categorization , 1999, Information Retrieval.

[395]  Michael Hoey,et al.  Patterns of Lexis In Text , 1991 .

[396]  Adam Kilgarriff,et al.  95% Replicability for Manual Word Sense Tagging , 1999, EACL.

[397]  Edward P. Stabler,et al.  Derivational Minimalism , 1996, LACL.

[398]  Sriram Raghavan,et al.  Searching the Web , 2001, ACM Trans. Internet Techn..

[399]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[400]  Hwee Tou Ng,et al.  Exemplar-Based Word Sense Disambiguation” Some Recent Improvements , 1997, EMNLP.

[401]  Alan W. Black,et al.  Evaluating and correcting phoneme segmentation for unit selection synthesis , 2003, INTERSPEECH.

[402]  Frédéric Bimbot,et al.  Inference of variable-length acoustic units for continuous speech recognition , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[403]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[404]  Enrique Vidal,et al.  Inference of k-Testable Languages in the Strict Sense and Application to Syntactic Pattern Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[405]  Elaine Rich,et al.  An Architecture for Anaphora Resolution , 1988, ANLP.

[406]  Adam Kilgarriff,et al.  Gold standard datasets for evaluating word sense disambiguation programs , 1998, Comput. Speech Lang..

[407]  André Kempe,et al.  Finite State Transducers Approximating Hidden Markov Models , 1997, ACL.

[408]  Ann Bies,et al.  The Penn Treebank: Annotating Predicate Argument Structure , 1994, HLT.

[409]  Josette Rebeyrolle,et al.  Construction d'une base de connaissances terminologiques à partir de textes : expérimentation et définition d'une méthode , 2000 .

[410]  Emilio Sanchis Arnal,et al.  Continuous speech understanding based on automatic learning of acoustic and semantic models , 1994, ICSLP.

[411]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[412]  David D. Lewis,et al.  A comparison of two learning algorithms for text categorization , 1994 .

[413]  Eric Brill,et al.  A corpus-based approach to language learning , 1993 .

[414]  Peter Weinstein,et al.  Seed ontologies: growing digital libraries as distributed, intelligent systems , 1997, DL '97.

[415]  Jean-Pierre Chanod,et al.  Robustness beyond shallowness: incremental deep parsing , 2002, Natural Language Engineering.

[416]  Frédéric Bimbot,et al.  Language modeling by variable length sequences: theoretical formulation and evaluation of multigrams , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[417]  Piek Vossen Building a Multilingual Database with Wordnets for European Languages , 1998 .

[418]  Shalom Lappin,et al.  An Algorithm for Pronominal Anaphora Resolution , 1994, CL.