Natural Language Processing as a Foundation of the Semantic Web

The main argument of this paper is that Natural Language Processing (NLP) does, and will continue to, underlie the Semantic Web (SW), including its initial construction from unstructured sources like the World Wide Web (WWW), whether its advocates realise this or not. Chiefly, we argue, such NLP activity is the only way up to a defensible notion of meaning at conceptual levels (in the original SW diagram) based on lower level empirical computations over usage. Our aim is definitely not to claim logic-bad, NLP-good in any simple-minded way, but to argue that the SW will be a fascinating interaction of these two methodologies, again like the WWW (which has been basically a field for statistical NLP research) but with deeper content. Only NLP technologies (and chiefly information extraction) will be able to provide the requisite RDF knowledge stores for the SW from existing unstructured text databases in the WWW, and in the vast quantities needed. There is no alternative at this point, since a wholly or mostly hand-crafted SW is also unthinkable, as is a SW built from scratch and without reference to the WWW. We also assume that, whatever the limitations on current SW representational power we have drawn attention to here, the SW will continue to grow in a distributed manner so as to serve the needs of scientists, even if it is not perfect. The WWW has already shown how an imperfect artefact can become indispensable.

[1]  Ian Horrocks,et al.  Description Logics in Ontology Applications , 2005, KI.

[2]  Yorick Wilks,et al.  Information Extraction: Beyond Document Retrieval , 1998, Int. J. Comput. Linguistics Chin. Lang. Process..

[3]  Karen Sparck Jones Retrieving information or answering questions ? , 2006 .

[4]  Graeme Hirst,et al.  Context as a Spurious Concept , 1997, ArXiv.

[5]  Steffen Staab,et al.  Learning Taxonomic Relations from Heterogeneous Sources of Evidence , 2005 .

[6]  Richard M. Schwartz,et al.  Nymble: a High-Performance Learning Name-finder , 1997, ANLP.

[7]  Sanda M. Harabagiu,et al.  Answering Complex, List and Context Questions with LCC's Question-Answering Server , 2001, TREC.

[8]  David Marr,et al.  VISION A Computational Investigation into the Human Representation and Processing of Visual Information , 2009 .

[9]  Karen Sparck Jones What is the Role of NLP in Text Retrieval , 1999 .

[10]  Richard Granger,et al.  FOUL-UP: A Program that Figures Out Meanings of Words from Context , 1977, IJCAI.

[11]  Sergei Nirenburg,et al.  What's in a symbol: ontology, representation and language , 2001, J. Exp. Theor. Artif. Intell..

[12]  Shih-Hung Wu,et al.  SOAT: A Semi-Automatic Domain Ontology Acquisition Tool from Chinese Corpus , 2002, COLING.

[13]  Mark Sanderson,et al.  Improving cross language retrieval with triangulated translation , 2001, SIGIR '01.

[14]  Hugh Christopher Longuet-Higgins Review Lecture - The algorithmic description of natural language , 1972, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[15]  John Cocke,et al.  A Statistical Approach to Language Translation , 1988, COLING.

[16]  Marc Vilain,et al.  Validation of Terminological Inference in an Information Extraction Task , 1993, HLT.

[17]  C. K. Ogden,et al.  The Meaning of Meaning , 1923 .

[18]  Robert Stevens,et al.  Using OWL to model biological knowledge , 2007, Int. J. Hum. Comput. Stud..

[19]  Nicholas L. Henry,et al.  The future as information , 1973 .

[20]  P. Johnson-Laird Procedural semantics , 1977, Cognition.

[21]  Yorick Wilks,et al.  Can We Make Information Extraction More Adaptive , 1999 .

[22]  J. Wilkins An essay towards a real character, and a philosophical language, 1668 , 1968 .

[23]  Robert J. Gaizauskas,et al.  Using Coreference Chains for Text Summarization , 1999, COREF@ACL.

[24]  J. Pustejovsky The Language of Word Meaning: Type Construction and the Logic of Concepts , 2001 .

[25]  A. Polguère Electric words: Dictionaries, computers, and meanings , 1997 .

[26]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[27]  Sergei Nirenburg,et al.  Readings in Machine Translation , 2003 .

[28]  William A. Gerber The Internet in Britain 2009 , 2009 .

[29]  Yves Schabes,et al.  Deterministic Part-of-Speech Tagging with Finite-State Transducers , 1995, Comput. Linguistics.

[30]  Henri Bergson,et al.  Le rire : essai sur la signification du comique , 1936 .

[31]  Patrick J. Hayes,et al.  The Naive Physics Manifesto , 1990, The Philosophy of Artificial Intelligence.

[32]  Sergei Nirenburg,et al.  Two Types of Adaptive MT Environments , 1994, COLING.

[33]  H. Putnam IS SEMANTICS POSSIBLE , 1970 .

[34]  Alexiei Dingli,et al.  Mining web sites using adaptive information extraction , 2003 .

[35]  Harith Alani,et al.  Trust Strategies for the Semantic Web , 2004, Trust@ISWC.

[36]  Ian Horrocks,et al.  Using an Expressive Description Logic: FaCT or Fiction? , 1998, KR.

[37]  Eugene Charniak,et al.  Jack and Janet in Search of a Theory of Knowledge , 1973, IJCAI.

[38]  Drew McDermott,et al.  Artificial intelligence meets natural stupidity , 1976, SGAR.

[39]  Alexander Bird,et al.  Natural Kinds , 1988, Philosophy.

[40]  Wim Peters,et al.  Distribution-oriented Extension of WordNet's Ontological Framework , 2001 .

[41]  Maria Teresa Pazienza Information Extraction: Towards Scalable, Adaptable Systems , 1999 .

[42]  Ramanathan V. Guha,et al.  Building large knowledge-based systems , 1989 .

[43]  Ralph Grishman,et al.  Bootstrapped Learning of Semantic Classes from Positive and Negative Examples , 2003 .

[44]  M. Strevens Scientific Explanation , 2005 .

[45]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[46]  Yorick Wilks,et al.  Knowledge Structures and Language Boundaries , 1977, IJCAI.

[47]  Robin Collier,et al.  Automatic template creation for information extraction , 1998 .

[48]  Steffen Schulze-Kremer,et al.  The Ontology of the Gene Ontology , 2003, AMIA.

[49]  Kalina Bontcheva,et al.  Semantic Web Enabled, Open Source Language Technology , 2003 .

[50]  Ralph Grishman,et al.  Information Extraction: Techniques and Challenges , 1997, SCIE.

[51]  Geoffrey Leech,et al.  CLAWS4: The Tagging of the British National Corpus , 1994, COLING.

[52]  Alexiei Dingli,et al.  Learning to Harvest Information for the Semantic Web , 2004, ESWS.

[53]  L. A. Miller The Process of Question Answering - A Computer Simulation of Cognition , 1980, CL.

[54]  Yorick Wilks,et al.  Background and Foreground Knowledge in Dynamic Ontology Construction: Viewing Text as Knowledge Maintenance , 2003 .

[55]  Yolanda Gil,et al.  A survey of trust in computer science and the Semantic Web , 2007, J. Web Semant..

[56]  Céline Van Damme,et al.  FolksOntology : An Integrated Approach for Turning Folksonomies into Ontologies , 2007 .

[57]  W. Quine The two dogmas of empiricism , 1951 .

[58]  Jordan B. Pollack,et al.  Recursive Distributed Representations , 1990, Artif. Intell..

[59]  Hamish Cunningham,et al.  GATE - a TIPSTER-based General Architecture for Text Engineering , 1997 .

[60]  Robert Krovetz,et al.  More than One Sense Per Discourse , 1998 .

[61]  Ralph Grishman,et al.  Acquisition of Selectional Patterns , 1992, COLING.

[62]  J. Sowa The challenge of knowledge soup , 2006 .

[63]  John Bear,et al.  Using Information Extraction to Improve Document Retrieval , 1998, TREC.

[64]  Robert J. Gaizauskas,et al.  Mining On-line Sources for Definition Knowledge , 2004, FLAIRS.

[65]  Simon Foster,et al.  A Compositional Operational Semantics for OWL-S , 2005, EPEW/WS-FM.

[66]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[67]  Ramanathan V. Guha,et al.  Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project , 1990 .

[68]  Michael Colclough The Process of Question Answering — A Computer Simulation of Cognition , 1979 .

[69]  Fabio Ciravegna,et al.  (LP) 2 , an Adaptive Algorithm for Information Extraction from Web-related Texts , 2001 .

[70]  Jerry R. Hobbs The Generic Information Extraction System , 1993, MUC.

[71]  Frank van Harmelen,et al.  Reviewing the design of DAML+OIL: an ontology language for the semantic web , 2002, AAAI/IAAI.

[72]  Roberto Basili,et al.  Identification of Relevant Terms to Support the Construction of Domain Ontologies , 2001, HTLKM@ACL.

[73]  J J Gibson,et al.  What gives rise to the perception of motion? , 1968, Psychological review.

[74]  Marti A. Hearst,et al.  A Method for Re ning Automatically-Discovered Lexical Relations: Combining Weak Techniques for Stronger Results , 1992 .

[75]  John McCarthy,et al.  SOME PHILOSOPHICAL PROBLEMS FROM THE STANDPOINT OF ARTI CIAL INTELLIGENCE , 1987 .

[76]  Harry Halpin The Semantic Web: The Origins of Artificial Intelligence Redux , 2005 .

[77]  Steffen Staab,et al.  On How to Perform a Gold Standard Based Evaluation of Ontology Learning , 2006, SEMWEB.

[78]  John Haugeland,et al.  Artificial intelligence - the very idea , 1987 .

[79]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[80]  R. Ackermann Minnesota Studies in the Philosophy of Science , 1975 .

[81]  Noam Chomsky,et al.  वाक्यविन्यास का सैद्धान्तिक पक्ष = Aspects of the theory of syntax , 1965 .

[82]  Bert F. Green,et al.  Baseball: an automatic question-answerer , 1899, IRE-AIEE-ACM '61 (Western).

[83]  Bijan Parsia,et al.  Pellet: An OWL DL Reasoner , 2004, Description Logics.

[84]  Yorick Wilks,et al.  Stone Soup and the French Room , 1994 .

[85]  Asunción Gómez-Pérez,et al.  Evaluation of ontologies , 2001, International Journal of Intelligent Systems.

[86]  Walter Daelemans,et al.  TiMBL: Tilburg Memory-Based Learner, version 2.0, Reference guide , 1998 .

[87]  Sergei Nirenburg,et al.  Toward full-text ontology-based word sense disambiguation , 2000 .

[88]  Christopher Arthur Brewster Mind the gap : bridging from text to ontological knowledge , 2008 .

[89]  Karen Spärck Jones,et al.  Information Retrieval and Artificial Intelligence , 1999, Artif. Intell..

[90]  Barry Smith,et al.  Formal ontology, common sense and cognitive science , 1995, Int. J. Hum. Comput. Stud..

[91]  Yorick Wilks,et al.  Preference Semantics, III-Formedness, and Metaphor , 1983, Am. J. Comput. Linguistics.

[92]  Douglas B. Lenat,et al.  CYC: a large-scale investment in knowledge infrastructure , 1995, CACM.

[93]  John Lafferty,et al.  Information retrieval as statistical translation , 1999, SIGIR 1999.

[94]  Yorick Wilks,et al.  Data Driven Ontology Evaluation , 2004, LREC.

[95]  Claire Cardie,et al.  Empirical Methods in Information Extraction , 1997, AI Mag..

[96]  David Yarowsky,et al.  Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora , 2010, COLING.

[97]  M. Kendall,et al.  The Logic of Scientific Discovery. , 1959 .

[98]  Frank M. Shipman,et al.  Which semantic web? , 2003, HYPERTEXT '03.

[99]  John I. Tait,et al.  Charting a new course : natural language processing and information retrieval : essays in honour of Karen Spärck Jones , 2005 .

[100]  Maurice Gross,et al.  On the Equivalence of Models of Language used in the Fields of Mechanical Translation and Information Retrieval , 1964, EARLYMT.

[101]  Ani Nenkova,et al.  Automatic Summarization , 2011, ACL.

[102]  Eric Brill,et al.  Some Advances in Transformation-Based Part of Speech Tagging , 1994, AAAI.

[103]  Ziqi Zhang,et al.  Dynamic iterative ontology learning , 2007 .

[104]  Robert Stevens,et al.  e-Science and biological pathway semantics , 2007, BMC Bioinformatics.

[105]  Marti A. Hearst,et al.  Refining Automatically-Discovered Lexical Relations: Combining Weak Techniques for Stronger Results , 1992 .

[106]  Nicola Guarino,et al.  Restructuring WordNet's Top-Level: The OntoClean approach , 2002 .

[107]  James A. Hendler,et al.  Knowledge Is Power: A View from the Semantic Web , 2005, AI Mag..

[108]  Yorick Wilks,et al.  Combining Weak Knowledge Sources for Sense Disambiguation , 1999, IJCAI.

[109]  Eelco Herder An Analysis of User Behavior on the Web - Understanding the Web and Its Users , 2007 .

[110]  Karen Spärck Jones What's new about the Semantic Web?: some questions , 2004, SIGF.

[111]  John D. Lafferty,et al.  Computation of the Probability of Initial Substring Generation by Stochastic Context-Free Grammars , 1991, Comput. Linguistics.

[112]  Jordan B. Peterson The Meaning of Meaning , 2007 .

[113]  Yorick Wilks,et al.  Ontologies, taxonomies, thesauri:learning from texts , 2004 .

[114]  Ellen Riloff,et al.  Automatically Acquiring Conceptual Patterns without an Annotated Corpus , 1995, VLC@ACL.

[115]  K. Popper,et al.  The Logic of Scientific Discovery , 1960 .

[116]  William A. Woods,et al.  What's in a Link: Foundations for Semantic Networks , 1975 .

[117]  Wendy Grace Lehnert,et al.  The Process of Question Answering , 2022 .

[118]  Nathalie Bely,et al.  Procédures d'analyse sémantique appliquées à la documentation scientifique , 1970 .

[119]  Sergei Nirenburg Pangloss: A Machine Translation Project , 1994, HLT.

[120]  Shao Fen Liang,et al.  Using query term order for result summarisation , 2005, SIGIR '05.

[121]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[122]  Eric Atwell,et al.  Generic template for the evaluation of dialogue management systems , 1997, EUROSPEECH.

[123]  Nicola Guarino,et al.  Some Ontological Principles for Designing Upper Level Lexical Resources , 1998, LREC.

[124]  Anne Kao,et al.  Natural Language Processing and Text Mining , 2006 .

[125]  Hans Uszkoreit,et al.  Language, Cohesion and Form (Studies in Natural Language Processing) , 2005 .

[126]  Dominique Estival,et al.  Karen Sparck Jones & Julia R. Galliers, Evaluating Natural Language Processing Systems: An Analysis and Review. Lecture Notes in Artificial Intelligence 1083 , 1998, Machine Translation.

[127]  Gerald DeJong Prediction and substantiation: A new approach to natural language processing , 1979 .

[128]  Piek Vossen Introduction to EuroWordNet , 1998 .

[129]  Yorick Wilks,et al.  Text Searching with Templates , 2007 .

[130]  Yorick Wilks,et al.  Book Reviews: Electric Words: Dictionaries, Computers, and Meanings , 1996, CL.

[131]  Mark Smith,et al.  University of Durham: description of the LOLITA system as used in MUC-6 , 1995, MUC.

[132]  Ken Samuel,et al.  Dialogue Act Tagging with Transformation-Based Learning , 1998, ACL.

[133]  Douglas B. Lenat,et al.  CYC: Using Common Sense Knowledge to Overcome Brittleness and Knowledge Acquisition Bottlenecks , 1986, AI Mag..

[134]  Jin Wang,et al.  A Self-Learning Universal Concept Spotter , 1996, COLING.

[135]  Gregory Grefenstette,et al.  The World Wide Web as a Resource for Example-Based Machine Translation Tasks , 1999, TC.

[136]  James A. Hendler,et al.  A Framework for Web Science , 2006, Found. Trends Web Sci..

[137]  Mark Sanderson,et al.  Improving Cross Language Information Retrieval with Triangulated Translation. , 2001, SIGIR 2002.

[138]  Frederick Jelinek,et al.  Exploiting Syntactic Structure for Language Modeling , 1998, ACL.

[139]  Roger W. Schvaneveldt,et al.  Pathfinder associative networks: studies in knowledge organization , 1990 .

[140]  Wolfgang Nedobity Terminology and artificial intelligence , 1985 .

[141]  Terry Winograd,et al.  Understanding natural language , 1974 .

[142]  Charles F. Goldfarb,et al.  SGML: the reason why and the first published hint , 1997 .

[143]  Gerard Salton,et al.  A new comparison between conventional indexing (MEDLARS) and automatic text processing (SMART) , 1972, J. Am. Soc. Inf. Sci..

[144]  Michael L. Mauldin,et al.  Retrieval performance in Ferret a conceptual information retrieval system , 1991, SIGIR '91.

[145]  Yorick Wilks,et al.  Designing Adaptive Information Extraction for the Semantic Web in Amilcare , 2003 .

[146]  Daniel G. Bobrow,et al.  On Overview of KRL, a Knowledge Representation Language , 1976, Cogn. Sci..

[147]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.

[148]  Eduard H. Hovy Toward Large-Scale Shallow Semantics for Higher-Quality NLP , 2005, CLIN.

[149]  Claire Cardie,et al.  University of Massachusetts: Description of the CIRCUS System as Used for MUC-4 , 1992, MUC.

[150]  Khurshid Ahmad,et al.  Pragmatics of Specialist Terms: The Acquisition and Representation of Terminology , 1993, EAMT Workshop.

[151]  Jerry R. Hobbs Ontological Promiscuity , 1985, ACL.

[152]  Alexander A. Morgan,et al.  Gene name identification and normalization using a model organism database , 2004, J. Biomed. Informatics.

[153]  James Pustejovsky,et al.  The Generative Lexicon , 1995, CL.

[154]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[155]  Olivier Bodenreider,et al.  Session Introduction , 2005, Pacific Symposium on Biocomputing.

[156]  Doug Downey,et al.  Unsupervised named-entity extraction from the Web: An experimental study , 2005, Artif. Intell..

[157]  E. Doerr,et al.  General Semantics. , 1958, Science.

[158]  Terry Winograd,et al.  Understanding computers and cognition - a new foundation for design , 1987 .

[159]  W. Bruce Croft,et al.  Resolving ambiguity for cross-language retrieval , 1998, SIGIR '98.

[160]  Yorick Wilks,et al.  A Closer Look at Skip-gram Modelling , 2006, LREC.

[161]  Hans Peter Luhn,et al.  A Statistical Approach to Mechanized Encoding and Searching of Literary Information , 1957, IBM J. Res. Dev..

[162]  E. Riloff,et al.  Automated dictionary construction for information extraction from text , 1993, Proceedings of 9th IEEE Conference on Artificial Intelligence for Applications.

[163]  Stephen Muggleton,et al.  Bayesian inductive logic programming , 1994, COLT '94.

[164]  W. J. Hutchins,et al.  LINGUISTIC PROCESSES IN THE INDEXING AND RETRIEVAL OF DOCUMENTS , 1970 .

[165]  Yorick Wilks,et al.  A Retrospective View of Synonymy and Semantic Classification , 2005 .

[166]  Jin Yang,et al.  The Systran NLP Browser: An Application of Machine Translation Technology in Cross-Language Information Retrieval , 1998 .

[167]  Ralph Grishman,et al.  NYU: Description of the MENE Named Entity System as Used in MUC-7 , 1998, MUC.

[168]  Ellen Helsper,et al.  Internet in Britain: 2007 , 2007 .

[169]  T. Kuhn,et al.  The Structure of Scientific Revolutions. , 1964 .

[170]  Alexiei Dingli,et al.  Mining Web Sites Using Unsupervised Adaptive Information Extraction , 2003, EACL.

[171]  Nicola Guarino,et al.  Conceptual analysis of lexical taxonomies: the case of WordNet top-level , 2001, FOIS.

[172]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[173]  Roger C. Schank,et al.  Conceptual dependency: A theory of natural language understanding , 1972 .

[174]  P. Resnik Selectional constraints: an information-theoretic model and its computational realization , 1996, Cognition.

[175]  Yorick Wilks,et al.  Frames, Semantics and Novelty , 1979 .

[176]  Yorick Wilks,et al.  Knowledge acquisition for knowledge management: position paper , 2001 .

[177]  Jorge Morato,et al.  WordNet Applications , 2004 .

[178]  Peter F. Patel-Schneider,et al.  Three theses of representation in the semantic web , 2003, WWW '03.

[179]  Thomas R. Gruber,et al.  Collective knowledge systems: Where the Social Web meets the Semantic Web , 2008, J. Web Semant..

[180]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[181]  Jian-Yun Nie,et al.  A retrieval model based on an extended modal logic and its application to the RIME experimental approach , 1989, SIGIR '90.

[182]  Naomi Sager,et al.  Natural Language Information Processing: A Computer Grammar of English and Its Applications , 1980 .

[183]  Yorick Wilks,et al.  Is There Progress on Talking Sensibly to Machines? , 2007, Science.

[184]  Mark T. Maybury,et al.  Automatic Summarization , 2002, Computational Linguistics.

[185]  Peter Mark Roget Thesaurus of English Words and Phrases: So Classified and Arranged as to Facilitate the Expression of Ideas and Assist in Literary Composition , 2009 .

[186]  J. Fodor,et al.  Connectionism and the problem of systematicity: Why Smolensky's solution doesn't work , 1990, Cognition.

[187]  Karen Sparck Jones,et al.  Book Reviews: Evaluating Natural Language Processing Systems: An Analysis and Review , 1996, CL.

[188]  Diana Maynard,et al.  JAPE: a Java Annotation Patterns Engine , 2000 .

[189]  Rong Wang,et al.  CRL/Brandeis: The Diderot System , 1993, TIPSTER.