Cross-language Ontology Learning

A formal ontology does not contain lexical knowledge; it is by nature language-independent. Mappings can be added between the ontology and, arbitrarily, many lexica in any number of languages. The result of this operation is what is here referred to as a cross-language ontology. A cross-language ontology can be a useful resource for machine translation or cross-language information retrieval. This chapter focuses on ways of automatically building an ontology by exploiting cross-language information from parallel corpora. The goal is to improve the automatic learning results compared to learning an ontology from resources in a single language. The authors present a framework for cross-language ontology learning, providing a setting in which cross-language evidence (data) can be integrated and quantified. The aim is to investigate the following question: Can cross-language data teach us more than data from a single language for the ontology learning task?

[1]  Eugene Albert Nida,et al.  A Componential Analysis of Meaning: An Introduction to Semantic Structures , 1979 .

[2]  Hans Hjelm,et al.  Identifying Cross Language Term Equivalents Using Statistical Machine Translation and Distributional Association Measures , 2007, NODALIDA.

[3]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[4]  Yorick Wilks The "Fodor"-FODOR fallacy bites back , 1999, ArXiv.

[5]  Christian Jacquemin,et al.  Spotting and Discovering Terms through Natural Language Processing , 1997 .

[6]  John F. Sowa,et al.  Knowledge representation: logical, philosophical, and computational foundations , 2000 .

[7]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[8]  Daniel Jurafsky,et al.  Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[9]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[10]  Jörg Tiedemann,et al.  Optimization of word alignment clues , 2005, Natural Language Engineering.

[11]  Fabio Rinaldi,et al.  A symbolic approach to automatic multiword term structuring , 2005, Comput. Speech Lang..

[12]  Jing Jin,et al.  Towards a weighted-tree similarity algorithm for RNA secondary structure comparison , 2005, Eighth International Conference on High-Performance Computing in Asia-Pacific Region (HPCASIA'05).

[13]  Steffen Staab,et al.  Ontology-based text clustering , 2001, IJCAI 2001.

[14]  José Gabriel Pereira Lopes,et al.  Extracting Equivalents from Aligned Parallel Texts: Comparison of Measures of Similarity , 2000, IBERAMIA-SBIA.

[15]  John Sinclair The Empty Lexicon , 1996 .

[16]  I. Dan Melamed,et al.  Models of translation equivalence among words , 2000, CL.

[17]  Olivier Bodenreider,et al.  Lexically-suggested hyponymic relations among medical terms and their representation in the UMLS , 2001 .

[18]  Peter Koch,et al.  Lexical typology from a cognitive and linguistic point of view , 2001 .

[19]  Dominic Abrams,et al.  Language, Speech, and Communication , 2006 .

[20]  Stephan Bloehdorn,et al.  Learning Ontologies to Improve Text Clustering and Classification , 2005, GfKl.

[21]  Leif Grönqvist Exploring Latent Semantic Vector Models Enriched With N-grams , 2006 .

[22]  Michel Simard,et al.  Multialignement vs bialignement : à plusieurs, c’est mieux ! , 2015, JEPTALNRECITAL.

[23]  Graham Priest,et al.  Willard Van Orman Quine , 1979 .

[24]  Dominic Widdows A Mathematical Model for Context and Word-Meaning , 2003, CONTEXT.

[25]  W. Bruce Croft,et al.  Deriving concept hierarchies from text , 1999, SIGIR '99.

[26]  Gábor Nagypál,et al.  Possibly imperfect ontologies for effective information retrieval , 2007 .

[27]  Hans Hjelm,et al.  Cross-language Ontology Learning: Incorporating and Exploiting Cross-language Data in the Ontology Learning Process , 2009 .

[28]  Philip Resnik,et al.  Exploiting Hidden Meanings: Using Bilingual Text for Monolingual Annotation , 2004, CICLing.

[29]  Hinrich Schütze,et al.  Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[30]  Karen Sparck Jones A statistical interpretation of term specificity and its application in retrieval , 1972 .

[31]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[32]  Q. Mcnemar Note on the sampling error of the difference between correlated proportions or percentages , 1947, Psychometrika.

[33]  Helena Ahonen-Myka,et al.  Simple Semantics in Topic Detection and Tracking , 2004, Information Retrieval.

[34]  Pascale Fung,et al.  Inversion Transduction Grammar Constraints for Mining Parallel Sentences from Quasi-Comparable Corpora , 2005, IJCNLP.

[35]  Martin Volk,et al.  The length factor in automatic bilingual terminology extraction , 2002 .

[36]  Christoph Schwarz,et al.  LiSa-morphological analysis for information retrieval , 2005, NODALIDA.

[37]  Paul Buitelaar,et al.  Ontology-based Information Extraction with SOBA , 2006, LREC.

[38]  Reinhard Rapp,et al.  Automatic Identification of Word Translations from Unrelated English and German Corpora , 1999, ACL.

[39]  Paul Buitelaar,et al.  Multilingual Evidence Improves Clustering-based Taxonomy Extraction , 2008, ECAI.

[40]  Paul Buitelaar,et al.  A systematic evaluation of concept-based cross-language information retrieval in the medical domain , 2002 .

[41]  Christian Biemann,et al.  Ontology Learning from Text: A Survey of Methods , 2005, LDV Forum.

[42]  Hardy Hansen,et al.  Dionysos: Archetypal Image of Indestructible Life , 1977 .

[43]  R. Mooney,et al.  Impact of Similarity Measures on Web-page Clustering , 2000 .

[44]  Pierre Zweigenbaum,et al.  Mining defining contexts to help structuring differential ontologies , 2005 .

[45]  Olivier Kraif From translational data to contrastive knowledge: Using bi-text for bilingual lexicons extraction , 2003 .

[46]  Magnus Merkel,et al.  Understanding and enhancing translation by parallel text processing , 1999 .

[47]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[48]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[49]  Gene H. Golub,et al.  Matrix computations , 1983 .

[50]  Stephan Bloehdorn,et al.  Boosting for Text Classification with Semantic Features , 2004, WebKDD.

[51]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[52]  C. K. Ogden,et al.  The Meaning of Meaning , 1923 .

[53]  Kuo-Chung Tai,et al.  The Tree-to-Tree Correction Problem , 1979, JACM.

[54]  Yiming Yang,et al.  RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[55]  Patrick Pantel,et al.  Automatically Labeling Semantic Classes , 2004, NAACL.

[56]  Kristina Nilsson,et al.  Using Semantic Features Derived from Word-Space Models for Swedish Coreference Resolution , 2009, NODALIDA.

[57]  Brian Roark,et al.  Noun-Phrase Co-Occurence Statistics for Semi-Automatic Semantic Lexicon Construction , 1998, COLING-ACL.

[58]  Willard Van Orman Quine,et al.  Word and Object , 1960 .

[59]  Hinrich Schütze,et al.  Customizing a Lexicon to Better Suit a Computational Task , 1996 .

[60]  Min Song,et al.  Integration of association rules and ontologies for semantic query expansion , 2007, Data Knowl. Eng..

[61]  Key-Sun Choi,et al.  Taxonomy Learning using Term Specificity and Similarity , 2006, OntologyLearning@COLING/ACL.

[62]  Steffen Staab,et al.  Ontology Learning Part One - On Discoverying Taxonomic Relations from the Web , 2002 .

[63]  John Tait,et al.  Karen Spärck Jones , 2008 .

[64]  Sharon A. Caraballo Automatic construction of a hypernym-labeled noun hierarchy from text , 1999, ACL.

[65]  Sergei Nirenburg,et al.  Ontological Semantics (Language, Speech, and Communication) , 2004 .

[66]  Sara Rydin,et al.  Building a hyponymy lexicon with hierarchical structure , 2002, ACL 2002.

[67]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[68]  Philipp Cimiano,et al.  Ontology learning and population from text - algorithms, evaluation and applications , 2006 .

[69]  Martin Haspelmath,et al.  Lexical typology from a cognitive and linguistic point of view , 2001 .

[70]  Gerold Schneider,et al.  Using distributional similarity to organise biomedical terminology , 2005 .

[71]  Paola Velardi,et al.  Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites , 2004, CL.

[72]  Jörg Tiedemann,et al.  Finding Synonyms Using Automatic Word Alignment and Measures of Distributional Similarity , 2006, ACL.

[73]  Magnus Sahlgren,et al.  Automatic bilingual lexicon acquisition using random indexing of parallel corpora , 2005, Nat. Lang. Eng..

[74]  Noah A. Smith,et al.  The Web as a Parallel Corpus , 2003, CL.

[75]  Fabio Rinaldi,et al.  An environment for relation mining over richly annotated corpora: the case of GENIA , 2006, BMC Bioinformatics.

[76]  G. Ruge,et al.  Term associations and computational linguistics , 1991 .

[77]  Goran Nenadic,et al.  Mining term similarities from corpora , 2004 .

[78]  Mohammed Bennamoun,et al.  Tree-Traversing Ant Algorithm for term clustering based on featureless similarities , 2007, Data Mining and Knowledge Discovery.

[79]  David Yarowsky,et al.  Inducing Multilingual Text Analysis Tools via Robust Projection across Aligned Corpora , 2001, HLT.

[80]  Mirella Lapata,et al.  Dependency-Based Construction of Semantic Space Models , 2007, CL.

[81]  Zhenyu Liu,et al.  Knowledge-based query expansion to support scenario-specific retrieval of medical free text , 2005, SAC '05.

[82]  G. Chartrand,et al.  Graph similarity and distance in graphs , 1998 .

[83]  Jörg Tiedemann Recycling Translations : Extraction of Lexical Data from Parallel Corpora and their Application in Natural Language Processing , 2003 .

[84]  Adam Kilgarriff Googleology is Bad Science , 2007, Computational Linguistics.

[85]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[86]  N. Goodman,et al.  Ways of Worldmaking , 1980 .

[87]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[88]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[89]  Pernilla Danielsson Automatic extraction of meaningful units from corpora: A corpus-driven approach using the word stroke , 2003 .

[90]  Lars Borin You'll Take the High Road and I'll Take the Low Road: Using a Third Language to Improve Bilingual Word Alignment , 2000, COLING.

[91]  Hinrich Schütze,et al.  A comparison of classifiers and document representations for the routing problem , 1995, SIGIR '95.

[92]  Magnus Rosell Clustering in Swedish : The Impact of some Properties of the Swedish Language on Document Clustering and an Evaluation Method , 2005 .

[93]  W. Alston Philosophy of Language , 1964 .

[94]  Steffen Staab,et al.  Comparing conceptual, parti-tional and agglomerative clustering for learning taxonomies from text , 2004 .

[95]  Inderjeet Mani,et al.  Automatically Inducing Ontologies from Corpora , 2004 .

[96]  Bernard Wuellner Foundations of Philosophy , 1950 .

[97]  Josef Kittler,et al.  Combining multiple classifiers by averaging or by multiplying? , 2000, Pattern Recognit..

[98]  Magnus Sahlgren,et al.  Creating bilingual lexica using reference wordlists for alignment of monolingual semantic vector spaces , 2005, NODALIDA.

[99]  Paola Velardi,et al.  TermExtractor: a Web Application to Learn the Shared Terminology of Emergent Web Communities , 2007, IESA.

[100]  H. Schütze,et al.  Dimensions of meaning , 1992, Supercomputing '92.

[101]  Sergei Nirenburg,et al.  Book Review: Ontological Semantics, by Sergei Nirenburg and Victor Raskin , 2004, CL.

[102]  Martin Haspelmath,et al.  Universal units in the lexicon , 2001 .

[103]  Magnus Sahlgren,et al.  The Word-Space Model: using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces , 2006 .

[104]  Harold L. Somers,et al.  Knowledge Extraction from Bilingual Corpora , 1999, SCIE.

[105]  P. Kanerva,et al.  Permutations as a means to encode order in word space , 2008 .

[106]  Reinhard Rapp,et al.  Identifying Word Translations in Non-Parallel Texts , 1995, ACL.

[107]  Pascale Fung,et al.  Finding Terminology Translations from Non-parallel Corpora , 1997, VLC.

[108]  David J. Weir,et al.  Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity , 2005, CL.

[109]  W. Quine Ontological Relativity and Other Essays , 1969 .

[110]  Helge Dyvik,et al.  Translations as a Semantic Knowledge Source , 2003 .

[111]  Steffen Staab,et al.  On How to Perform a Gold Standard Based Evaluation of Ontology Learning , 2006, SEMWEB.

[112]  Pascale Fung,et al.  Creating a Bilingual Ontology: A Corpus-Based Approach for Aligning WordNet and HowNet , 2002 .

[113]  Stephan Bloehdorn,et al.  Ontology-Based Question Answering for Digital Libraries , 2007, ECDL.

[114]  George Boolos,et al.  Computability and logic , 1974 .

[115]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[116]  James A. Hendler,et al.  Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential , 2002 .

[117]  John Murdoch,et al.  Statistics: Problems and Solutions , 1973 .

[118]  Key-Sun Choi,et al.  Determining the Specificity of Terms based on Information Theoretic Measures , 2004 .

[119]  Jussi Piitulainen,et al.  Discovering Synonyms and Other Related Words , 2004 .

[120]  Tomaz Erjavec,et al.  The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages , 2006, LREC.

[121]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[122]  Naftali Tishby,et al.  Distributional Clustering of English Words , 1993, ACL.