A prototype system for multilingual data discovery of International Long-Term Ecological Research (ILTER) Network data

Shared ecological data have the potential to revolutionize ecological research just as shared genetic sequence data have done for biological research. However, for ecological data to be useful, it must first be discoverable. A broad-scale research topic may require that a researcher be able to locate suitable data from a variety of global, regional and national data providers, which often use different local languages to describe their data. Thus, one of the challenges of international sharing of long-term data is facilitation of multilingual searches. Such searches are hindered by lack of equivalent terms across languages and by uneven application of keywords in ecological metadata. To test whether a thesaurus-based approach to multilingual data searching might be effective, we implemented a prototype web-services-based system for searching International Long-Term Ecological Research Network data repositories. The system builds on the use of a multilingual thesaurus to make searches more complete than would be obtained through search term-translation alone. The resulting system, when coupled to commodity online translation systems, demonstrates the possibility of achieving multilingual searches for ecological data.

[1]  Vanda Broughton Essential Thesaurus Construction , 2006 .

[2]  Jennifer E. Rowley,et al.  Relationships in the Organization of Knowledge , 2002, J. Documentation.

[3]  Shawn Bowers,et al.  An ontology for describing and synthesizing ecological observation data , 2007, Ecol. Informatics.

[4]  Johannes Keizer,et al.  The AGROVOC Concept Scheme : A Walkthrough , 2012 .

[5]  J. Chave The problem of pattern and scale in ecology: what have we learned in 20 years? , 2013, Ecology letters.

[6]  Honglin He,et al.  Fostering ecological data sharing: collaborations in the International Long Term Ecological Research Network , 2015 .

[7]  S. Levin The problem of pattern and scale in ecology , 1992 .

[8]  Sean Bechhofer,et al.  SKOS Simple Knowledge Organization System Reference , 2009 .

[9]  R. Macarthur The Problem of Pattern and Scale in Ecology: The Robert H. MacArthur Award Lecture , 2005 .

[10]  Shawn Bowers,et al.  Advancing ecological research with ontologies. , 2008, Trends in ecology & evolution.

[11]  J. Lawton Are there general laws in ecology , 1999 .

[12]  Kristin Vanderbilt,et al.  A multilingual metadata catalog for the ILTER: Issues and approaches , 2010, Ecol. Informatics.

[13]  Stefan Stoll,et al.  The long-term ecological research (LTER) network: Relevance, current status, future perspective and examples from marine, freshwater and terrestrial long-term observation , 2016 .

[14]  Katharina Schleidt,et al.  SERONTO: a Socio-Ecological Research and Observation oNTOlogy. , 2008 .

[15]  Guan-Shuo Mai,et al.  Linked Open Data of Ecology (LODE): A New Approach for Ecological Data Sharing , 2011 .

[16]  Sylvie Davies,et al.  Multilingual thesauri for the modern world - no ideal solution? , 2001, J. Documentation.

[17]  Matthew B. Jones,et al.  Managing Scientific Metadata , 2001, IEEE Internet Comput..

[18]  Barry Smith,et al.  Semantics in Support of Biodiversity Knowledge Discovery: An Introduction to the Biological Collections Ontology and Related Ontologies , 2014, PloS one.

[19]  Matthew Jones,et al.  Maximizing the Value of Ecological Data with Structured Metadata: An Introduction to Ecological Metadata Language (EML) and Principles for Metadata Creation , 2005 .

[20]  José João Almeida,et al.  T2O - Recycling Thesauri into a Multilingual Ontology , 2006, LREC.

[21]  Dikshant Shahi Apache Solr: An Introduction , 2015 .

[22]  Anthony J. G. Hey,et al.  Jim Gray on eScience: a transformed scientific method , 2009, The Fourth Paradigm.

[23]  Ana Domínguez,et al.  Estimating effective landscape distances and movement corridors: Comparison of habitat and genetic data , 2015 .

[24]  H. Shibata,et al.  ILTER and JaLTER: Their Missions and Linkage to Database Development in the Asia-Pacific Region , 2012 .

[25]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[26]  Matthew B Jones,et al.  Ecoinformatics: supporting ecology as a data-intensive science. , 2012, Trends in ecology & evolution.

[27]  John H. Porter,et al.  A Metadata-based Framework for Multilingual Ecological Information Management , 2006 .

[28]  Lesley Mackenzie-Robb,et al.  Controlled vocabularies vs. full text indexing , 2010 .

[29]  Carol A. Bean,et al.  Relationships in the Organization of Knowledge , 2001, Information Science and Knowledge Management.

[30]  David Wood,et al.  Linked Data , 2014 .

[31]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[32]  P. Balvanera,et al.  Using long-term ecosystem service and biodiversity data to study the impacts and adaptation options , 2013 .

[33]  J. Magnuson,et al.  Intercontinental Comparison of Small-Lake Fish Assemblages: The Balance between Local and Regional Processes , 1990, The American Naturalist.

[34]  M. Willig,et al.  Patterns of species density and productivity at different spatial scales in herbaceous plant communities , 2000 .

[35]  Herbert Schentz,et al.  EnvThes - interlinked thesaurus for long term ecological research, monitoring, and experiments , 2013, EnviroInfo.

[36]  Matthew B. Jones,et al.  Metacat: a schema-independent XML database system , 2001, Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001.