Towards a national bio-environmental data facility: experiences from the Atlas of Living Australia

The Atlas of Living Australia (ALA: http://www.ala.org.au) provides the largest free and open repository of integrated biological and environmental information in a consistent format for the Australian region. As of June 2015, the ALA contained over 55 million records (10% of Global Biodiversity Information Facility’s (GBIF’s) total), consisting of 150,000+ native and alien species, nearly 500 layers of gridded and polygonal bio-environmental data, 39+ million pages of biological literature, and 45,000+ images of species and other integrated biological data. The development of the research interface to the ALA (http://spatial.ala.org.au) was the trigger to develop an architecture designed to tightly integrate environmental data for online use with biological data. Environmental layers are classed as environmental (gridded with continuous values) or contextual (polygonal with discrete class values). A suite of analysis and visualization tools have been developed to demonstrate the value of integrating the ALA’s biological and environmental data. This paper outlines the purpose and process of establishing the ALA and discusses the integration of environmental data relevant to biodiversity research in the Australian region and the vision for continually improved services for research, area management, education, and citizen science. The ALA’s environmental infrastructure addresses current needs but increased data types, volumes, and resolution suggests new directions are needed to provide quality services into the future. The experience of building the ALA has relevance for other agencies setting up similar infrastructure which supports integrated access to and use of their national biological and environmental information.

[1]  P. Barber,et al.  MARSPEC: ocean climate layers for marine spatial ecology , 2013 .

[2]  Miroslav Dudík,et al.  Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation , 2008 .

[3]  Michael F. Hutchinson,et al.  A new stream and nested catchment framework for Australia , 2013 .

[4]  Natasha Simons,et al.  Implementing DOIs for Research Data , 2012, D Lib Mag..

[5]  David Kent,et al.  Use of Representative Climate Futures in impact and adaptation assessment , 2012, Climatic Change.

[6]  Andre Zerger,et al.  Biodiversity profiling: components of a continental biodiversity information capability , 2013 .

[7]  John Deck,et al.  The Trouble with Triplets in Biodiversity Informatics: A Data-Driven Case against Current Identifier Practices , 2014, PloS one.

[8]  Lee Belbin,et al.  Which environmental variables should I use in my biodiversity model? , 2012, Int. J. Geogr. Inf. Sci..

[9]  J L Edwards,et al.  Interoperability of biodiversity databases: biodiversity information on every desktop. , 2000, Science.

[10]  J. Wilkin,et al.  Ocean Interpolation by Four-Dimensional Weighted Least Squares—Application to the Waters around Australasia , 2002 .

[11]  L. Belbin,et al.  Developing biodiverse plantings suitable for changing climatic conditions 2: Using the Atlas of Living Australia , 2012 .

[12]  A. H. Ball,et al.  How to Cite Datasets and Link to Publications:A Report of the Digital Curation Centre , 2012 .

[13]  Dave Roberts,et al.  Towards mainstreaming of biodiversity data publishing: recommendations of the GBIF Data Publishing Framework Task Group , 2011, BMC Bioinformatics.

[14]  Lee Belbin,et al.  Fuse: A FORTRAN V program for agglomerative fusion for minicomputers , 1984 .

[15]  Michael F. Hutchinson,et al.  New developments and applications in the ANUCLIM spatial climatic and bioclimatic modelling package , 2013, Environ. Model. Softw..

[16]  John Wieczorek,et al.  Darwin Core: An Evolving Community-Developed Biodiversity Data Standard , 2012, PloS one.

[17]  Antoine Guisan,et al.  Spatial modelling of biodiversity at the community level , 2006 .

[18]  John Kunze,et al.  DataONE: Data Observation Network for Earth - Preserving Data and Enabling Innovation in the Biological and Environmental Sciences , 2011, D Lib Mag..

[19]  A. Townsend Peterson,et al.  The Importance of Biodiversity E-infrastructures for Megadiverse Countries , 2015, PLoS biology.

[20]  Joseph Antony,et al.  The NCI High Performance Computing and High Performance Data Platform to Support the Analysis of Petascale Environmental Data Collections , 2014, ISESS.

[21]  K. F. レンツ,et al.  the Creative Commons , 2011 .

[22]  J. L. Parra,et al.  Very high resolution interpolated climate surfaces for global land areas , 2005 .

[23]  Roderic D. M. Page Taxonomic names, metadata, and the Semantic Web , 2006 .

[24]  J Tann,et al.  Atlas of Living Australia User Needs Analysis , 2008 .

[25]  Shawn W. Laffan,et al.  Endemism in the Australian flora , 2001 .

[26]  Chris Park,et al.  The Environment , 2010 .

[27]  David Manset,et al.  Flock together with CReATIVE-B: A roadmap of global research data infrastructures supporting biodiversity and ecosystem science , 2014 .

[28]  J. Elith,et al.  Using generalized dissimilarity modelling to analyse and predict patterns of beta diversity in regional biodiversity assessment , 2007 .

[29]  Irina Sens,et al.  The Tenth Anniversary of Assigning DOI Names to Scientific Data and a Five Year History of DataCite , 2015, D Lib Mag..

[30]  John La Salle,et al.  A specialist’s audit of aggregated occurrence records: An ‘aggregator’s’ perspective , 2013, ZooKeys.

[31]  J. Busby BIOCLIM - a bioclimate analysis and prediction system , 1991 .

[32]  Charles Troupin,et al.  Bio‐ORACLE: a global environmental dataset for marine species distribution modelling , 2012 .

[33]  Matthew B. Jones,et al.  Challenges and Opportunities of Open Data in Ecology , 2011, Science.

[34]  Trevor H. Booth,et al.  Using biodiversity databases to verify and improve descriptions of tree species climatic requirements , 2014 .

[35]  Diane M. Griffiths,et al.  THE REGENTS OF THE UNIVERSITY OF CALIFORNIA , 2007 .

[36]  Arno Scharl,et al.  The Geospatial Web: How Geobrowsers, Social Software and the Web 2.0 are Shaping the Network Society , 2007, The Geospatial Web.

[37]  E. Hand,et al.  Citizen science: People power , 2010, Nature.

[38]  Andrew C. Jones,et al.  Identifying and relating biological concepts in the Catalogue of Life , 2011, J. Biomed. Semant..

[39]  Millicent Abell,et al.  Project Steering Committee , 2000 .

[40]  L. Belbin The Use of Non-hierarchical Allocation Methods for Clustering Large Sets of Data , 1987, Aust. Comput. J..

[41]  Marc Hockings,et al.  Managers consider multiple lines of evidence important for biodiversity management decisions. , 2012, Journal of environmental management.

[42]  Daniel P. Miranker,et al.  Schema Driven Assignment and Implementation of Life Science Identifiers (lsids) , 2006 .

[43]  W. Sutherland,et al.  The need for evidence-based conservation. , 2004, Trends in ecology & evolution.

[44]  Robert P. Anderson,et al.  Maximum entropy modeling of species geographic distributions , 2006 .

[45]  Lee Belbin,et al.  Semi‐strong Hybrid Scaling, a new ordination algorithm , 1991 .

[46]  Lee Belbin,et al.  The Atlas of Living Australia‟s Spatial Portal , 2011 .