European Language Grid: An Overview

With 24 official EU and many additional languages, multilingualism in Europe and an inclusive Digital Single Market can only be enabled through Language Technologies (LTs). European LT business is dominated by hundreds of SMEs and a few large players. Many are world-class, with technologies that outperform the global players. However, European LT business is also fragmented, by nation states, languages, verticals and sectors, significantly holding back its impact. The European Language Grid (ELG) project addresses this fragmentation by establishing the ELG as the primary platform for LT in Europe. The ELG is a scalable cloud platform, providing, in an easy-to-integrate way, access to hundreds of commercial and non-commercial LTs for all European languages, including running tools and services as well as data sets and resources. Once fully operational, it will enable the commercial and non-commercial European LT community to deposit and upload their technologies and data sets into the ELG, to deploy them through the grid, and to connect with other resources. The ELG will boost the Multilingual Digital Single Market towards a thriving European LT community, creating new jobs and opportunities. Furthermore, the ELG project organises two open calls for up to 20 pilot projects. It also sets up 32 National Competence Centres (NCCs) and the European LT Council (LTC) for outreach and coordination purposes.

[1]  Nicoletta Calzolari,et al.  META-SHARE: One year after , 2014, LREC.

[2]  Max Mühlhäuser,et al.  Darmstadt Knowledge Processing Repository Based on UIMA , 2007 .

[3]  Nada Lavrac,et al.  TextFlows: A visual programming platform for text mining and natural language processing , 2016, Sci. Comput. Program..

[4]  Ron Dekker,et al.  Social Data: CESSDA Best Practices , 2020, Data Intelligence.

[5]  Yu Deng,et al.  Evolution of the IBM Cloud: Enabling an enterprise cloud services ecosystem , 2011, IBM J. Res. Dev..

[6]  Erhard W. Hinrichs,et al.  WebLicht: Web-Based LRT Services for German , 2010, ACL.

[7]  Cristian Borcea,et al.  Packaging and Sharing Machine Learning Models via the Acumos AI Open Platform , 2018, 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA).

[8]  Alessandro Del Sole Introducing Microsoft Cognitive Services , 2018 .

[9]  Victoria Arranz,et al.  Making Metadata Fit for Next Generation Language Technology Platforms: The Metadata Schema of the European Language Grid , 2020, LREC.

[10]  András Kornai Digital language death , 2013 .

[11]  Georg Rehm,et al.  A Workflow Manager for Complex NLP and Content Curation Workflows , 2020, IWLTP@LREC.

[12]  Erhard W. Hinrichs,et al.  The CLARIN Research Infrastructure: Resources and Tools for eHumanities Scholars , 2014, LREC.

[13]  Josef van Genabith,et al.  Fostering the Next Generation of European Language Technology: Recent Developments ― Emerging Initiatives ― Challenges and Opportunities , 2016, LREC.

[14]  Josef van Genabith,et al.  The European Language Technology Landscape in 2020: Language-Centric and Human-Centric AI for Cross-Cultural Communication in Multilingual Europe , 2020, LREC.

[15]  Kalina Bontcheva,et al.  Towards an Interoperable Ecosystem of AI and LT Platforms: A Roadmap for the Implementation of Different Levels of Interoperability , 2020, IWLTP@LREC.

[16]  Georg Rehm,et al.  Language Technology for Multilingual Europe: An Analysis of a Large-Scale Survey regarding Challenges, Demands, Gaps and Needs , 2018, LREC.

[17]  Josef van Genabith,et al.  European Language Resource Coordination: Collecting Language Resources for Public Sector Multilingual Information Management , 2018, LREC.

[18]  Josef van Genabith,et al.  The strategic impact of META-NET on the regional, national and international level , 2014, LREC.

[19]  Sotirios Karampatakis,et al.  Orchestrating NLP Services for the Legal Domain , 2020, LREC.

[20]  Stelios Piperidis The META-SHARE Language Resources Sharing Infrastructure: Principles, Challenges, Solutions , 2012, LREC.

[21]  林 良彦,et al.  言語グリッド(Language Grid)の構想 : 異文化コラボレーション基盤の構築を目指して , 2006 .

[22]  Stelios Piperidis,et al.  Managing Public Sector Data for Multilingual Applications Development , 2018, LREC.

[23]  Mark Greenwood,et al.  OpenMinTeD: A Platform Facilitating Text Mining of Scholarly Content , 2018 .

[24]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[25]  Dominique Estival,et al.  The Alveo Virtual Laboratory: A Web Based Repository API , 2014, LREC.

[26]  Thierry Declerck,et al.  The META-SHARE Metadata Schema for the Description of Language Resources , 2012, LREC.

[27]  Hans Uszkoreit,et al.  The META-NET Strategic Research Agenda for Multilingual Europe 2020: An Extended Summary. , 2014 .

[28]  Adam Lally,et al.  Accelerating Corporate Research in the Development, Application, and Deployment of Human Language Technologies , 2003, HLT-NAACL 2003.