Promoting Interoperability of Resources in META-SHARE

META-NET is a Network of Excellence aiming to improve significantly on the number of language technologies that can assist European citizens, by enabling enhanced communication and cooperation across languages. A major outcome will be META- SHARE, a searchable network of repositories that collect resources such as language data, tools and related web services, covering a large number of European languages. These resources are intended to facilitate the development and evaluation of a wide range of new language processing applications and services. An important aim of META-SHARE is the promotion of interoperability amongst resources. In this paper, we describe our planned efforts to help to achieve this aim, through the adoption of the UIMA framework and the integration of the U-Compare system within the META-SHARE network. U- Compare facilitates the rapid construction and evaluation of NLP applications that make use of interoperable components, and, as such, can help to speed up the development of a new generation of European language technology applications.

[1]  Matthieu Vernier,et al.  Building a French-speaking community around UIMA, gathering research, education and industrial partners, mainly in Natural Language Processing and Speech Recognizing domains , 2010 .

[2]  Sophia Ananiadou,et al.  Disambiguating the species of biomedical named entities using natural language parsers , 2010, Bioinform..

[3]  Simone Teufel,et al.  An Architecture for Language Processing for Scientific Texts , 2006 .

[4]  Sophia Ananiadou,et al.  Using Workflows to Explore and Optimise Named Entity Recognition for Chemistry , 2011, PloS one.

[5]  Sampo Pyysalo,et al.  Overview of BioNLP’09 Shared Task on Event Extraction , 2009, BioNLP@HLT-NAACL.

[6]  Rune Sætre,et al.  AkaneRE relation extraction: Protein interaction and normalization in the BioCreAtIvE II. 5 challeng , 2009 .

[7]  K. Bretonnel Cohen,et al.  U-Compare: A modular NLP workflow construction and evaluation system , 2011, IBM J. Res. Dev..

[8]  Amol Kamat,et al.  A Metadata Search Engine for Digital Language Archives , 2005, D Lib Mag..

[9]  Núria Bel Rafecas,et al.  Platform for automatic, normalized annotation and cost-effective acquisition of language resources for human language technologies: PANACEA , 2010 .

[10]  Sophia Ananiadou,et al.  Text mining meets workflow: linking U-Compare with Taverna. , 2010, Bioinformatics.

[11]  Peter Murray-Rust,et al.  High-Throughput Identification of Chemistry in Life Science Texts , 2006, CompLife.

[12]  Montserrat Marimon,et al.  CLARIN: Common Language Resources and Technology Infrastructure , 2008, Proces. del Leng. Natural.

[13]  K Bretonnel Cohen,et al.  Journal of Biomedical Discovery and Collaboration Open Access an Open-source Framework for Large-scale, Flexible Evaluation of Biomedical Text Mining Systems , 2008 .

[14]  János Csirik,et al.  The CoNLL-2010 Shared Task: Learning to Detect Hedges and their Scope in Natural Language Text , 2010, CoNLL Shared Task.

[15]  Jonathan G. Fiscus,et al.  A Pratical Introduction to ATLAS , 2002, LREC.

[16]  Jonathan G. Fiscus,et al.  A Practical Introduction to ATLAS , 2002 .

[17]  K. Bretonnel Cohen,et al.  U-Compare: share and compare text mining tools with UIMA , 2009, Bioinform..

[18]  Mikel L. Forcada,et al.  Open-Source Portuguese-Spanish Machine Translation , 2006, PROPOR.