Natural language processing for semiautomatic semantics extractio: encyclopedic entry disambiguation and relationship extraction using wikipedia and wordnet

Tesis doctoral inedita. Universidad Autonoma de Madrid, Escuela Politecnica Superior, septiembre de 2009

[1]  Ruslan Mitkov,et al.  The Oxford handbook of computational linguistics , 2003 .

[2]  Ellen Riloff,et al.  Semantic Class Learning from the Web with Hyponym Pattern Linkage Graphs , 2008, ACL.

[3]  Cecile Paris,et al.  Virtual Museums on the Information Superhighway: Prospects and Potholes , 1998 .

[4]  Marti A. Hearst Noun Homograph Disambiguation Using Local Context in Large Text Corpora , 1991 .

[5]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[6]  Ellen Riloff,et al.  Automatically Generating Extraction Patterns from Untagged Text , 1996, AAAI/IAAI, Vol. 2.

[7]  Partha Pratim Talukdar,et al.  Weakly-Supervised Acquisition of Labeled Class Instances using Graph Random Walks , 2008, EMNLP.

[8]  Lucy Vanderwende,et al.  MindNet: Acquiring and Structuring Semantic Information from Text , 1998, COLING-ACL.

[9]  Maria Ruiz-Casado,et al.  Automatic Acquisition of Semantics from Text for Semantic Work Environments , 2008 .

[10]  James L. McClelland,et al.  An interactive activation model of context effects in letter perception: I. An account of basic findings. , 1981 .

[11]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[12]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.

[13]  Cristina Nicolae,et al.  BESTCUT: A Graph Algorithm for Coreference Resolution , 2006, EMNLP.

[14]  Ted Pedersen,et al.  Name Discrimination and Email Clustering using Unsupervised Clustering and Labeling of Similar Contexts , 2005, IICAI.

[15]  Rada Mihalcea,et al.  Using Wikipedia for Automatic Word Sense Disambiguation , 2007, NAACL.

[16]  Dan Roth,et al.  Semantic Integration in Text: From Ambiguous Names to Identifiable Entities , 2005, AI Mag..

[17]  Andrei Mikheev,et al.  Periods, Capitalized Words, etc. , 2002, CL.

[18]  German Rigau Automatic Acquisition of Lexical Knowl-edge from MRDs , 1998 .

[19]  Mitchell P. Marcus,et al.  Maximum entropy models for natural language ambiguity resolution , 1998 .

[20]  Chuck Rieger,et al.  Parsing and comprehending with word experts (a theory and its realization) , 1982 .

[21]  Hideki Isozaki,et al.  Efficient Support Vector Classifiers for Named Entity Recognition , 2002, COLING.

[22]  Rada Mihalcea,et al.  Wikify!: linking documents to encyclopedic knowledge , 2007, CIKM '07.

[23]  David Yarowsky,et al.  Unsupervised Personal Name Disambiguation , 2003, CoNLL.

[24]  Zornitsa Kozareva,et al.  Combining data-driven systems for improving Named Entity Recognition , 2005, Data Knowl. Eng..

[25]  Marius Pasca,et al.  Organizing and searching the world wide web of facts -- step two: harnessing the wisdom of the crowds , 2007, WWW '07.

[26]  Michel C. A. Klein,et al.  Ontology versioning on the Semantic Web , 2001, SWWS.

[27]  B. Hammond Ontology , 2004, Lawrence Booth’s Book of Visions.

[28]  Kalina Bontcheva,et al.  Using Uneven Margins SVM and Perceptron for Information Extraction , 2005, CoNLL.

[29]  Daniel S. Weld,et al.  Using Wikipedia to bootstrap open information extraction , 2009, SGMD.

[30]  Yorick Wilks Right Attachment and Preference Semantics , 1985, EACL.

[31]  Alex Bateman,et al.  An introduction to hidden Markov models. , 2007, Current protocols in bioinformatics.

[32]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[33]  Eduard H. Hovy,et al.  Learning surface text patterns for a Question Answering System , 2002, ACL.

[34]  Jean Véronis,et al.  MACHINE READABLE DICTIONARIES: WHAT HAVE WE LEARNED, WHERE DO WE GO? , 1999 .

[35]  Maria Ruiz-Casado,et al.  Automatic inference of word meaning using phonosemantic patterns , 2002 .

[36]  Christian Posse,et al.  PNNL: A Supervised Maximum Entropy Approach to Word Sense Disambiguation , 2007, SemEval@ACL.

[37]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[38]  Atanas Kiryakov,et al.  Towards Semantic Web Information Extraction , 2003 .

[39]  Joakim Nivre,et al.  Deterministic Dependency Parsing of English Text , 2004, COLING.

[40]  Bernardo Magnini,et al.  Integrating Subject Field Codes into WordNet , 2000, LREC.

[41]  Mitchell P. Marcus,et al.  OntoNotes: The 90% Solution , 2006, NAACL.

[42]  George A. Miller,et al.  A Semantic Concordance , 1993, HLT.

[43]  Paola Velardi,et al.  The Usable Ontology: An Environment for Building and Assessing a Domain Ontology , 2002, SEMWEB.

[44]  Ralph Grishman,et al.  Extracting Relations with Integrated Information Using Kernel Methods , 2005, ACL.

[45]  Manabu Okumura,et al.  Information Extraction and Semantic Annotation of Wikipedia , 2008, Ontology Learning and Population.

[46]  Yuji Matsumoto,et al.  Lexical Knowledge Acquisition , 2005 .

[47]  Tobias Hawker USYD: WSD and Lexical Substitution using the Web1T corpus , 2007, SemEval@ACL.

[48]  Yorick Wilks,et al.  Providing machine tractable dictionary tools , 1990, Machine Translation.

[49]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[50]  Maria Ruiz-Casado,et al.  Automatic Assignment of Wikipedia Encyclopedic Entries to WordNet Synsets , 2005, AWIC.

[51]  Maria Ruiz-Casado,et al.  Automatising the learning of lexical patterns: An application to the enrichment of WordNet by extracting semantic relationships from Wikipedia , 2007, Data Knowl. Eng..

[52]  Marc Moens,et al.  Named Entity Recognition without Gazetteers , 1999, EACL.

[53]  Kentaro Torisawa,et al.  Automatic Discovery of Attribute Words from Web Documents , 2005, IJCNLP.

[54]  Maria Ruiz-Casado,et al.  Automatic Extraction of Semantic Relationships for WordNet by Means of Pattern Learning from Wikipedia , 2005, NLDB.

[55]  Ian H. Witten,et al.  Mining Meaning from Wikipedia , 2008, Int. J. Hum. Comput. Stud..

[56]  George A. Miller,et al.  Using a Semantic Concordance for Sense Identification , 1994, HLT.

[57]  Brian M. Slator,et al.  Providing machine tractable dictionary tools , 1990 .

[58]  Gang Wang,et al.  Enhancing Relation Extraction by Eliciting Selectional Constraint Features from Wikipedia , 2007, NLDB.

[59]  Hwee Tou Ng,et al.  Corpus-Based Approaches to Semantic Interpretation in Natural Language Processing , 1997 .

[60]  Le Sun,et al.  Study of Kernel-Based Methods for Chinese Relation Extraction , 2008, AIRS.

[61]  Piek Vossen,et al.  EuroWordNet: A multilingual database with lexical semantic networks , 1998, Springer Netherlands.

[62]  David Yarowsky,et al.  One Sense per Collocation , 1993, HLT.

[63]  Ian H. Witten,et al.  Topic indexing with Wikipedia , 2008 .

[64]  M. Magnus What's in a word? : Studies in phonosemantics , 2001 .

[65]  M. Ross Quillian,et al.  The teachable language comprehender: a simulation program and theory of language , 1969, CACM.

[66]  Rada Mihalcea,et al.  PageRank on Semantic Networks, with Application to Word Sense Disambiguation , 2004, COLING.

[67]  Feiyu Xu,et al.  A Domain Adaptive Approach to Automatic Acquisition of Domain Relevant Terms and their Relations with Bootstrapping , 2002, LREC.

[68]  Claire Cardie,et al.  Evaluating an Information Extraction System , 1994 .

[69]  Patrick Pantel,et al.  Ontologizing Semantic Relations , 2006, ACL.

[70]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[71]  Mitsuru Ishizuka,et al.  Exploiting Syntactic and Semantic Information for Relation Extraction from Wikipedia , 2006 .

[72]  Adam Kilgarriff,et al.  Introduction to the Special Issue on SENSEVAL , 2000, Comput. Humanit..

[73]  Aurélie Herbelot,et al.  Acquiring Ontological Relationships from Wikipedia Using RMRS , 2006 .

[74]  Harald Trost Computational Morphology , 2003 .

[75]  Christine D. Piatko,et al.  Named Entity Recognition using Hundreds of Thousands of Features , 2003, CoNLL.

[76]  David Yarowsky,et al.  Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora , 2010, COLING.

[77]  Dan I. Moldovan,et al.  Domain-Specific Knowledge Acquisition and Classification Using WordNet , 2000, FLAIRS Conference.

[78]  Hwee Tou Ng,et al.  Supervised Word Sense Disambiguation with Support Vector Machines and multiple knowledge sources , 2004, SENSEVAL@ACL.

[79]  ChengXiang Zhai,et al.  A Systematic Exploration of the Feature Space for Relation Extraction , 2007, NAACL.

[80]  Aldo Gangemi,et al.  Ontology Learning and Its Application to Automated Terminology Translation , 2003, IEEE Intell. Syst..

[81]  David W. Embley,et al.  Peppering knowledge sources with SALT: Boosting conceptual content for ontology generation , 2002 .

[82]  David Yarowsky,et al.  Multi-Field Information Extraction and Cross-Document Fusion , 2005, ACL.

[83]  George A. Miller,et al.  WordNet 2 - A Morphologically and Semantically Enhanced Resource , 1999 .

[84]  Marc Moens,et al.  Description of the LTG System Used for MUC-7 , 1998, MUC.

[85]  Patrick Pantel,et al.  Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations , 2006, ACL.

[86]  Donald Hindle,et al.  Noun Classification From Predicate-Argument Structures , 1990, ACL.

[87]  Ramanathan V. Guha,et al.  Building large knowledge-based systems , 1989 .

[88]  Stephen Soderland,et al.  Learning Information Extraction Rules for Semi-Structured and Free Text , 1999, Machine Learning.

[89]  Adam Kilgarriff,et al.  SENSEVAL: an exercise in evaluating world sense disambiguation programs , 1998, LREC.

[90]  Nancy Ide,et al.  Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[91]  Carlo Strapparava,et al.  Pattern abstraction and term similarity for Word Sense Disambiguation: IRST at Senseval-3 , 2004 .

[92]  Rada Mihalcea,et al.  A Method for Word Sense Disambiguation of Unrestricted Text , 1999, ACL.

[93]  Ellen M. Voorhees,et al.  Using WordNet to disambiguate word senses for text retrieval , 1993, SIGIR.

[94]  Benjamin Van Durme,et al.  Finding Cars, Goddesses and Enzymes: Parametrizable Acquisition of Labeled Instances for Open-Domain Information Extraction , 2008, AAAI.

[95]  Paola Velardi,et al.  Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites , 2004, CL.

[96]  Dmitri V. Kalashnikov,et al.  Disambiguation Algorithm for People Search on the Web , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[97]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[98]  Chung Hee Hwang,et al.  Incompletely and Imprecisely Speaking: Using Dynamic Ontologies for Representing and Retrieving Information , 1999, KRDB.

[99]  Raphael Volz,et al.  The Ontology Extraction & Maintenance Framework Text-To-Onto , 2001 .

[100]  George Karypis,et al.  A Comparison of Document Clustering Techniques , 2000 .

[101]  Naftali Tishby,et al.  Distributional Clustering of English Words , 1993, ACL.

[102]  Michael Sussna,et al.  Word sense disambiguation for free-text indexing using a massive semantic network , 1993, CIKM '93.

[103]  Mitsuru Ishizuka,et al.  Relation Extraction from Wikipedia Using Subtree Mining , 2007, AAAI.

[104]  Martha Palmer,et al.  SemEval-2007 Task-17: English Lexical Sample, SRL and All Words , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[105]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[106]  Feng Luo,et al.  Ontology construction for information selection , 2002, 14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings..

[107]  Adrian Novischi Accurate Semantic Annotations via Pattern Matching , 2002, FLAIRS Conference.

[108]  Benjamin Van Durme,et al.  What You Seek Is What You Get: Extraction of Class Attributes from Query Logs , 2007, IJCAI.

[109]  Maria Ruiz-Casado,et al.  From Wikipedia to Semantic Relationships: a Semi-automated Annotation Approach , 2006, SemWiki.

[110]  Adam Kilgarriff,et al.  The Senseval-3 English lexical sample task , 2004, SENSEVAL@ACL.

[111]  D. Tufis,et al.  BalkaNet : Aims , Methods , Results and Perspectives . A General Overview , 2004 .

[112]  Gerhard Weikum,et al.  LEILA: Learning to Extract Information by Linguistic Analysis , 2006, OntologyLearning@COLING/ACL.

[113]  Rada Mihalcea,et al.  UNT-Yahoo: SuperSenseLearner: Combining SenseLearner with SuperSense and other Coarse Semantic Features , 2007, SemEval@ACL.

[114]  Steffen Staab,et al.  Word classification based on combined measures of distributional and semantic similarity , 2003, EACL.

[115]  Philip Resnik,et al.  Disambiguating Noun Groupings with Respect to Wordnet Senses , 1995, VLC@ACL.

[116]  Markus Krötzsch,et al.  Semantic Wikipedia , 2006, WikiSym '06.

[117]  Steffen Staab,et al.  Discovering Conceptual Relations from Text , 2000, ECAI.

[118]  Raphael Volz,et al.  Semi-automatic Ontology Acquisition from a Corporate Intranet , 2000 .

[119]  Daniel Jurafsky,et al.  Semantic Taxonomy Induction from Heterogenous Evidence , 2006, ACL.

[120]  Christiane Fellbaum,et al.  Lexical Chains as Representations of Context for the Detection and Correction of Malapropisms , 1998 .

[121]  Nanda Kambhatla,et al.  Combining Lexical, Syntactic, and Semantic Features with Maximum Entropy Models for Information Extraction , 2004, ACL.

[122]  Dan Klein,et al.  Named Entity Recognition with Character-Level Models , 2003, CoNLL.

[123]  Andreas Wagner,et al.  Enriching a lexical semantic net with selectional preferences by means of statistical corpus analysis , 2000, ECAI Workshop on Ontology Learning.

[124]  Claude Roux,et al.  An Ontology Enrichment Method for a Pragmatic Information Extraction System gathering Data on Genetic Interactions , 2000, ECAI Workshop on Ontology Learning.

[125]  Marti A. Hearst Automated Discovery of WordNet Relations , 2004 .

[126]  Satoshi Sekine,et al.  On-Demand Information Extraction , 2006, ACL.

[127]  Giuseppe Attardi,et al.  Semantically Annotated Snapshot of the English Wikipedia , 2008, LREC.

[128]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[129]  Slava M. Katz,et al.  Technical terminology: some linguistic properties and an algorithm for identification in text , 1995, Natural Language Engineering.

[130]  Dmitry Zelenko,et al.  Kernel Methods for Relation Extraction , 2002, J. Mach. Learn. Res..

[131]  James A. Hendler,et al.  Agents and the Semantic Web , 2001, IEEE Intell. Syst..