KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures

We outline work to be carried out within the framework of an impending EC project. The goal is to construct a language-independent information system for a specific domain (environment/ecology) anchored in a language-independent ontology that is linked to WordNets in several languages. For each language, information extraction and identification of lexicalized concepts with ontological entries will be done by text miners ("Kybots"). The mapping of language-specific lexemes to the ontology allows for crosslinguistic identification and translation of equivalent terms. The infrastructure developed within this project will enable long-range knowledge sharing and transfer to many languages and cultures, addressing the need for global and uniform transition of knowledge beyond the domain of ecology and environment addressed here.

[1]  German Rigau,et al.  A Proposal for a Shallow Ontologization of Wordnet , 2005, Proces. del Leng. Natural.

[2]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[3]  Adam Pease,et al.  Towards a standard upper ontology , 2001, FOIS.

[4]  Eneko Agirre,et al.  Integrating selectional preferences in WordNet , 2002, ArXiv.

[5]  Federico Neri,et al.  Text Mining Applied to Multilingual Corpora , 2005 .

[6]  Chu-Ren Huang,et al.  Exploring interoperability of language resources: the case of cross-lingual semi-automatic enrichment of wordnets , 2009, Lang. Resour. Evaluation.

[7]  Piek Vossen,et al.  The MEANING Multilingual Central Repository , 2004 .

[8]  Chu-Ren Huang,et al.  Fostering Intercultural Collaboration: A Web Service Architecture for Cross-Fertilization of Distributed Wordnets , 2007, IWIC.

[9]  Adam Pease,et al.  Linking Lixicons and Ontologies: Mapping WordNet to the Suggested Upper Merged Ontology , 2003, IKE.

[10]  Lluís Padró,et al.  An Empirical Study for the Automatic Acquisition of Topic Signatures , 2006 .

[11]  Claudia Soria,et al.  Moving to dynamic computational lexicons with LeXFlow , 2006, LREC.

[12]  Bernardo Magnini,et al.  Integrating Subject Field Codes into WordNet , 2000, LREC.

[13]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[14]  Chu-Ren Huang,et al.  Towards Agent-based Cross-Lingual Interoperability of Distributed Lexical Resources , 2006, Proceedings of the Workshop on Multilingual Language Resources and Interoperability - MLRI '06.

[15]  C. Fellbaum An Electronic Lexical Database , 1998 .

[16]  Chu-Ren Huang,et al.  Hantology-A Linguistic Resource for Chinese Language Processing and Studying , 2006, LREC.

[17]  Allan Terry,et al.  The MILO: A General-purpose, Mid-level Ontology , 2004, IKE.

[18]  Chu-Ren Huang,et al.  Automatic Discovery of Named Entity Variants: Grammar-driven Approaches to Non-Alphabetical Transliterations , 2007, ACL.

[19]  Michael R. Genesereth,et al.  Knowledge Interchange Format , 1991, KR.

[20]  G. Miller,et al.  Semantic networks of english , 1991, Cognition.

[21]  Francesca Bertagna,et al.  Toward an Architecture for the Global Wordnet Initiative , 2006, SWAP.

[22]  Andrea Marchetti,et al.  XFlow: An XML-Based Document-Centric Workflow , 2005, WISE.

[23]  Nicola Guarino,et al.  WonderWeb Deliverable D18 Ontology Library , 2003 .

[24]  Piek T. J. M. Vossen,et al.  MEANING: a Roadmap to Knowledge Technologies , 2002, RAODMAP@COLING.

[25]  F. Neri,et al.  A Multilingual Text Mining based content gathering system for Open Source Intelligence , 2006 .

[26]  Aleš Horák,et al.  DEBVisDic - First Version of New Client-Server Wordnet Browsing and Editing Tool , 2005 .

[27]  Roy Bar-Haim,et al.  The Second PASCAL Recognising Textual Entailment Challenge , 2006 .

[28]  Steffen Staab,et al.  Semantic Web and Peer-to-Peer - Decentralized Management and Exchange of Knowledge and Information , 2006 .

[29]  Steffen Staab,et al.  WonderWeb: Ontology Infrastructure for the Semantic Web , 2004 .

[30]  German Rigau,et al.  Towards the Meaning Top Ontology: Sources of Ontological Meaning , 2004, LREC.

[31]  German Rigau,et al.  Exploring the Automatic Selection of Basic Level Concepts , 2006 .

[32]  Piek Vossen,et al.  EuroWordNet: A multilingual database with lexical semantic networks , 1998, Springer Netherlands.

[33]  The Sigma Ontology Development Environment , 2003 .

[34]  Eneko Agirre,et al.  Learning class-to-class selectional preferences , 2001, CoNLL.

[35]  Ido Dagan,et al.  The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[36]  Awais Rashid,et al.  XML Data Management: Native XML and XML-Enabled Database Systems , 2003 .

[37]  Chu-Ren Huang,et al.  Hanzi grid: toward a knowledge infrastructure for Chinese character-based cultures , 2007 .

[38]  Lluís Padró,et al.  Comparing methods for automatic acquisition of Topic Signatures , 2005 .

[39]  Christiane Fellbaum,et al.  Connecting the Universal to the Specific: Towards the Global Grid , 2007, IWIC.