Cross-language Ontology Learning: Incorporating and Exploiting Cross-language Data in the Ontology Learning Process

An ontology is a knowledge-representation structure, where words, terms or concepts are defined by their mutual hierarchical relations. Ontologies are becoming ever more prevalent in the world of n ...

[1]  M. Teresa Cabré Castellví,et al.  Automatic term detection: A review of current systems , 2001 .

[2]  Jussi Piitulainen,et al.  Discovering Synonyms and Other Related Words , 2004 .

[3]  Nina Wacholder,et al.  Spotting and Discovering Terms Through Natural Language Processing , 2003, Information Retrieval.

[4]  Brian Roark,et al.  Noun-Phrase Co-Occurence Statistics for Semi-Automatic Semantic Lexicon Construction , 1998, COLING-ACL.

[5]  Dan I. Moldovan,et al.  Automatic Discovery of Part-Whole Relations , 2006, CL.

[6]  Josef Kittler,et al.  Combining multiple classifiers by averaging or by multiplying? , 2000, Pattern Recognit..

[7]  Magnus Sahlgren,et al.  Creating bilingual lexica using reference wordlists for alignment of monolingual semantic vector spaces , 2005, NODALIDA.

[8]  Steffen Staab,et al.  Learning Taxonomic Relations from Heterogeneous Sources of Evidence , 2005 .

[9]  Ralf Steinmetz,et al.  Precision and Recall for Ontology Enrichment , 2004 .

[10]  Zhenyu Liu,et al.  Knowledge-based query expansion to support scenario-specific retrieval of medical free text , 2005, SAC '05.

[11]  G. Ruge,et al.  Term associations and computational linguistics , 1991 .

[12]  Hardy Hansen,et al.  Dionysos: Archetypal Image of Indestructible Life , 1977 .

[13]  Magnus Sahlgren,et al.  The Word-Space Model: using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces , 2006 .

[14]  R. Mooney,et al.  Impact of Similarity Measures on Web-page Clustering , 2000 .

[15]  Pierre Zweigenbaum,et al.  Mining defining contexts to help structuring differential ontologies , 2005 .

[16]  Paul Buitelaar,et al.  A systematic evaluation of concept-based cross-language information retrieval in the medical domain , 2002 .

[17]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[18]  Goran Nenadic,et al.  Mining term similarities from corpora , 2004 .

[19]  Daniel Jurafsky,et al.  Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[20]  Peter Koch,et al.  Lexical typology from a cognitive and linguistic point of view , 2001 .

[21]  Leif Grönqvist Exploring Latent Semantic Vector Models Enriched With N-grams , 2006 .

[22]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[23]  Anders Holst,et al.  Random indexing of text samples for latent semantic analysis , 2000 .

[24]  Viggo Kann,et al.  Finding the Correct Interpretation of Swedish Compounds, a Statistical Approach , 2004, LREC.

[25]  C. K. Ogden,et al.  The Meaning of Meaning , 1923 .

[26]  Kuo-Chung Tai,et al.  The Tree-to-Tree Correction Problem , 1979, JACM.

[27]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[28]  Yiming Yang,et al.  RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[29]  Pascale Fung,et al.  Creating a Bilingual Ontology: A Corpus-Based Approach for Aligning WordNet and HowNet , 2002 .

[30]  Stephan Bloehdorn,et al.  Ontology-Based Question Answering for Digital Libraries , 2007, ECDL.

[31]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[32]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[33]  John F. Sowa,et al.  Knowledge representation: logical, philosophical, and computational foundations , 2000 .

[34]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[35]  Jörg Tiedemann,et al.  Optimization of word alignment clues , 2005, Natural Language Engineering.

[36]  Fabio Rinaldi,et al.  A symbolic approach to automatic multiword term structuring , 2005, Comput. Speech Lang..

[37]  Dominic Widdows A Mathematical Model for Context and Word-Meaning , 2003, CONTEXT.

[38]  Steffen Staab,et al.  Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis , 2005, J. Artif. Intell. Res..

[39]  Pascale Fung,et al.  Finding Terminology Translations from Non-parallel Corpora , 1997, VLC.

[40]  W. Quine Ontological Relativity and Other Essays , 1969 .

[41]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[42]  Éric Gaussier,et al.  Bilingual terminology extraction : an approach based on a multilingual thesaurus applicable to comparable corpora , 2002 .

[43]  Magnus Sahlgren,et al.  Automatic bilingual lexicon acquisition using random indexing of parallel corpora , 2005, Nat. Lang. Eng..

[44]  Harold L. Somers,et al.  Knowledge Extraction from Bilingual Corpora , 1999, SCIE.

[45]  P. Kanerva,et al.  Permutations as a means to encode order in word space , 2008 .

[46]  Noah A. Smith,et al.  The Web as a Parallel Corpus , 2003, CL.

[47]  Fabio Rinaldi,et al.  An environment for relation mining over richly annotated corpora: the case of GENIA , 2006, BMC Bioinformatics.

[48]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[49]  Willard Van Orman Quine,et al.  Word and Object , 1960 .

[50]  Balakrishnan Chandrasekaran,et al.  What are ontologies, and why do we need them? , 1999, IEEE Intell. Syst..

[51]  Hinrich Schütze,et al.  Customizing a Lexicon to Better Suit a Computational Task , 1996 .

[52]  Min Song,et al.  Integration of association rules and ontologies for semantic query expansion , 2007, Data Knowl. Eng..

[53]  Marti A. Hearst,et al.  SVMs—a practical consequence of learning theory , 1998 .

[54]  Key-Sun Choi,et al.  Taxonomy Learning using Term Specificity and Similarity , 2006, OntologyLearning@COLING/ACL.

[55]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[56]  Hans Hjelm,et al.  Identifying Cross Language Term Equivalents Using Statistical Machine Translation and Distributional Association Measures , 2007, NODALIDA.

[57]  Olivier Bodenreider,et al.  Lexically-suggested hyponymic relations among medical terms and their representation in the UMLS , 2001 .

[58]  Steffen Staab,et al.  Comparing Conceptual, Divise and Agglomerative Clustering for Learning Taxonomies from Text , 2004, ECAI.

[59]  Christoph Schwarz,et al.  Automatic syntactic analysis of free text , 1990, J. Am. Soc. Inf. Sci..

[60]  Hang Li,et al.  Word Clustering and Disambiguation Based on Co-occurrence Data , 1998, COLING.

[61]  Mirella Lapata,et al.  Dependency-Based Construction of Semantic Space Models , 2007, CL.

[62]  G. Chartrand,et al.  Graph similarity and distance in graphs , 1998 .

[63]  Jörg Tiedemann Recycling Translations : Extraction of Lexical Data from Parallel Corpora and their Application in Natural Language Processing , 2003 .

[64]  Gene H. Golub,et al.  Matrix computations , 1983 .

[65]  Adam Kilgarriff Googleology is Bad Science , 2007, Computational Linguistics.

[66]  Eugene Albert Nida,et al.  A Componential Analysis of Meaning: An Introduction to Semantic Structures , 1979 .

[67]  Carolyn E. Begg,et al.  Database Systems: A Practical Approach to Design, Implementation and Management , 1998 .

[68]  Tomaz Erjavec,et al.  The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages , 2006, LREC.

[69]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[70]  Sergei Nirenburg,et al.  Ontological Semantics (Language, Speech, and Communication) , 2004 .

[71]  Sara Rydin,et al.  Building a hyponymy lexicon with hierarchical structure , 2002, ACL 2002.

[72]  Reinhard Rapp,et al.  Identifying Word Translations in Non-Parallel Texts , 1995, ACL.

[73]  George A. Miller,et al.  Squibs and Discussions: WordNet Nouns: Classes and Instances , 2006, CL.

[74]  Hinrich Schütze,et al.  A comparison of classifiers and document representations for the routing problem , 1995, SIGIR '95.

[75]  Heike Zinsmeister,et al.  Rezension: Tony McEnery, Richard Xiao, Yukio Tono: Corpus-Based Language Studies. 2006 : [Rezension] , 2008 .

[76]  Nigel Collier,et al.  Ontology Express : Statistical and Non-Monotonic Learning of Domain Ontologies from Text , 2004 .

[77]  Q. Mcnemar Note on the sampling error of the difference between correlated proportions or percentages , 1947, Psychometrika.

[78]  Helena Ahonen-Myka,et al.  Simple Semantics in Topic Detection and Tracking , 2004, Information Retrieval.

[79]  Pascale Fung,et al.  Inversion Transduction Grammar Constraints for Mining Parallel Sentences from Quasi-Comparable Corpora , 2005, IJCNLP.

[80]  Nelson Goodman,et al.  Ways of Worldmaking , 1979 .

[81]  Martin Volk,et al.  The length factor in automatic bilingual terminology extraction , 2002 .

[82]  Christoph Schwarz,et al.  LiSa-morphological analysis for information retrieval , 2005, NODALIDA.

[83]  Naftali Tishby,et al.  Distributional Clustering of English Words , 1993, ACL.

[84]  David Yarowsky,et al.  Inducing Multilingual Text Analysis Tools via Robust Projection across Aligned Corpora , 2001, HLT.

[85]  Helge Dyvik,et al.  Translations as a Semantic Knowledge Source , 2003 .

[86]  Steffen Staab,et al.  On How to Perform a Gold Standard Based Evaluation of Ontology Learning , 2006, SEMWEB.

[87]  Jing Jin,et al.  Towards a weighted-tree similarity algorithm for RNA secondary structure comparison , 2005, Eighth International Conference on High-Performance Computing in Asia-Pacific Region (HPCASIA'05).

[88]  Steffen Staab,et al.  Ontology-based text clustering , 2001, IJCAI 2001.

[89]  Eugene Charniak,et al.  Determining the specificity of nouns from text , 1999, EMNLP.

[90]  José Gabriel Pereira Lopes,et al.  Extracting Equivalents from Aligned Parallel Texts: Comparison of Measures of Similarity , 2000, IBERAMIA-SBIA.

[91]  Daniel Jurafsky,et al.  Semantic Taxonomy Induction from Heterogenous Evidence , 2006, ACL.

[92]  Christian Biemann,et al.  Ontology Learning from Text: A Survey of Methods , 2005, LDV Forum.

[93]  Paul Buitelaar,et al.  Multilingual Evidence Improves Clustering-based Taxonomy Extraction , 2008, ECAI.

[94]  Philipp Cimiano,et al.  Ontology learning and population from text - algorithms, evaluation and applications , 2006 .

[95]  Paola Velardi,et al.  Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites , 2004, CL.

[96]  Steffen Staab,et al.  Ontology Learning Part One - On Discoverying Taxonomic Relations from the Web , 2002 .

[97]  Walter Daelemans,et al.  Semantic and Syntactic Features for Dutch Coreference Resolution , 2008, CICLing.

[98]  Sharon A. Caraballo Automatic construction of a hypernym-labeled noun hierarchy from text , 1999, ACL.

[99]  H. Schütze,et al.  Dimensions of meaning , 1992, Supercomputing '92.

[100]  Sergei Nirenburg,et al.  Book Review: Ontological Semantics, by Sergei Nirenburg and Victor Raskin , 2004, CL.

[101]  Martin Haspelmath,et al.  Universal units in the lexicon , 2001 .

[102]  Fred Karlsson,et al.  SWETWOL: A Comprehensive Morphological Analyser for Swedish , 1992 .

[103]  Ulrike Oster Classifying domain-specific intraterm relations: a schema-based approach , 2006 .

[104]  Gerold Schneider,et al.  Using distributional similarity to organise biomedical terminology , 2005 .

[105]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[106]  John Sinclair The Empty Lexicon , 1996 .

[107]  I. Dan Melamed,et al.  Models of translation equivalence among words , 2000, CL.

[108]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[109]  Stephan Bloehdorn,et al.  Learning Ontologies to Improve Text Clustering and Classification , 2005, GfKl.

[110]  Johan Carlberger,et al.  Implementing an Efficient Part-Of-Speech Tagger , 1999, Softw. Pract. Exp..

[111]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[112]  Stephan Bloehdorn,et al.  Boosting for Text Classification with Semantic Features , 2004, WebKDD.

[113]  Patrick Pantel,et al.  Automatically Labeling Semantic Classes , 2004, NAACL.

[114]  George Boolos,et al.  Computability and logic , 1974 .

[115]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[116]  James A. Hendler,et al.  Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential , 2002 .

[117]  Jean-Michel Renders,et al.  Automatic processing of multilingual medical terminology: applications to thesaurus enrichment and cross-language information retrieval , 2005, Artif. Intell. Medicine.

[118]  Key-Sun Choi,et al.  Determining the Specificity of Terms based on Information Theoretic Measures , 2004 .

[119]  Vasileios Hatzivassiloglou,et al.  Translating Collocations for Bilingual Lexicons: A Statistical Approach , 1996, CL.

[120]  Inderjeet Mani,et al.  Automatically Inducing Ontologies from Corpora , 2004 .

[121]  W. Bruce Croft,et al.  Deriving concept hierarchies from text , 1999, SIGIR '99.

[122]  Gábor Nagypál,et al.  Possibly imperfect ontologies for effective information retrieval , 2007 .

[123]  Philip Resnik,et al.  Exploiting Hidden Meanings: Using Bilingual Text for Monolingual Annotation , 2004, CICLing.

[124]  Frank Schilder,et al.  TLR at DUC : Tree similarity , 2005 .

[125]  Paul Buitelaar,et al.  Ontology-based Information Extraction with SOBA , 2006, LREC.

[126]  Patrick Pantel,et al.  Discovering word senses from text , 2002, KDD.

[127]  Reinhard Rapp,et al.  Automatic Identification of Word Translations from Unrelated English and German Corpora , 1999, ACL.