A high-performance web-based information system for publishing large-scale species range maps in support of biodiversity studies

Abstract Functionality, performance and scalability are critical to Web-based information systems for publishing and disseminating large-scale species distribution data. Existing systems do not support dynamic spatial window queries on large-scale species range maps that are important to compute alpha and beta diversities for biodiversity analysis and modeling. In this study, we have developed a main-memory based novel quadtree data structure to represent large-scale species range maps and support dynamic spatial window queries to retrieve a list of species and their area sizes within a query window efficiently. Using the NatureServe's 400 0 + bird species range maps, experiment results have shown that the memory footprint of the proposed quadtree data structure representing the range maps of all the species is about 1/6 of the quadtree derived by combining individual quadtrees each representing a species range map. The experiment results have also demonstrated that the query response times of our main-memory spatial database are well below a fraction of a second for query windows as large as 10 × 10°, which are 2–3 orders better than using a typical disk-resident spatial database system.

[1]  Hanan Samet,et al.  Object-based and image-based object representations , 2004, CSUR.

[2]  Le Gruenwald,et al.  Embedding and extending GIS for exploratory analysis of large-scale species distribution data , 2008, GIS '08.

[3]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[4]  G. Foody GIS: biodiversity applications , 2008 .

[5]  Jesse Cleary,et al.  OBIS-SEAMAP: The World Data Center for Marine Mammal, Sea Bird, and Sea Turtle Distributions , 2009 .

[6]  Carlo Ricotta,et al.  Through the Jungle of Biological Diversity , 2005, Acta biotheoretica.

[7]  Richard Field,et al.  Spatial species‐richness gradients across scales: a meta‐analysis , 2009 .

[8]  Jianting Zhang Efficient managing large scale species range maps in a spatial database environment , 2009, 2009 17th International Conference on Geoinformatics.

[9]  Robert P. Guralnick,et al.  A web-based GIS tool for exploring the world's biodiversity: The Global Biodiversity Information Facility Mapping and Analysis Portal Application (GBIF-MAPA) , 2007, Ecol. Informatics.

[10]  H. Qian Environment–richness relationships for mammals, birds, reptiles, and amphibians at global and regional scales , 2010, Ecological Research.

[11]  Jianting Zhang,et al.  Testing the correlation between beta diversity and differences in productivity among global ecoregions, biomes, and biogeographical realms , 2009, Ecol. Informatics.

[12]  Gregory Leptoukh,et al.  Giovanni: A Web Service Workflow-Based Data Visualization and Analysis System , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[13]  Jianting Zhang Speeding up large-scale geospatial polygon rasterization on GPGPUs , 2011, HPDGIS '11.

[14]  Alex P. Oberle,et al.  GIS Concepts and ArcGIS Methods , 2004 .

[15]  Jianting Zhang,et al.  GBD-Explorer: Extending open source java GIS for exploring ecoregion-based biodiversity data , 2007, Ecol. Informatics.

[16]  D. S. Hammond,et al.  TRENDS IN THE MEASUREMENT OF ALPHA DIVERSITY IN THE LAST TWO DECADES , 2005 .

[17]  Lammert Kooistra,et al.  Development of a Dynamic Web Mapping Service for Vegetation Productivity Using Earth Observation and in situ Sensors in a Sensor Web Based Approach , 2009, Sensors.

[18]  Hanan Samet,et al.  Foundations of multidimensional and metric data structures , 2006, Morgan Kaufmann series in data management systems.

[19]  Ş. Procheş,et al.  The world's biogeographical regions: cluster analyses based on bat distributions , 2005 .

[20]  Bertram Ludäscher,et al.  A Scientific Workflow Approach to Distributed Geospatial Data Processing using Web Services , 2005, SSDBM.

[21]  David A. Patterson,et al.  Computer Architecture - A Quantitative Approach, 5th Edition , 1996 .

[22]  A. Peterson,et al.  Biodiversity informatics: managing and applying primary biodiversity data. , 2004, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[23]  F. Bisby The quiet revolution: biodiversity informatics and the internet. , 2000, Science.

[24]  Kevin J. Gaston,et al.  Measuring beta diversity for presence–absence data , 2003 .

[25]  Mark Gahegan,et al.  Geospatial Cyberinfrastructure: Past, present and future , 2010, Comput. Environ. Urban Syst..

[26]  Jeffery S. Horsburgh,et al.  A first approach to web services for the National Water Information System , 2008, Environ. Model. Softw..

[27]  Geoffrey J. Hay,et al.  Free and open source geographic information tools for landscape ecology , 2009, Ecol. Informatics.

[28]  Antoine Guisan,et al.  Predictive habitat distribution models in ecology , 2000 .

[29]  Michael Gertz,et al.  Sensor data dissemination systems using Web-based standards: a case study of publishing data in support of evapotranspiration models in California , 2009 .

[30]  Kate S. He,et al.  Linking variability in species composition and MODIS NDVI based on beta diversity measurements , 2009 .

[31]  R. Guralnick,et al.  Biodiversity informatics: automated approaches for documenting global biodiversity patterns and processes , 2009, Bioinform..

[32]  Toshimi Minoura,et al.  WebGRMS: Prototype software for web-based mapping of biological collections , 2007, Biodiversity and Conservation.

[33]  Katherine L. Gross,et al.  WHAT IS THE OBSERVED RELATIONSHIP BETWEEN SPECIES RICHNESS AND PRODUCTIVITY , 2001 .

[34]  Giles M. Foody,et al.  An overview of recent remote sensing and GIS based research in ecological informatics , 2011, Ecol. Informatics.

[35]  Alexander Zeier,et al.  HYRISE - A Main Memory Hybrid Storage Engine , 2010, Proc. VLDB Endow..

[36]  Jeffrey A. Cardille,et al.  SFMN GeoSearch: An interactive approach to the visualization and exchange of point-based ecological data , 2009, Ecol. Informatics.

[37]  Marco A. Casanova,et al.  Geoweb Services for Sharing Modelling Results in Biodiversity Networks , 2009, Trans. GIS.

[38]  Jianting Zhang,et al.  Using Web Services and Scientific Workflow for Species Distribution Prediction Modeling , 2005, WAIM.

[39]  Patrick N. Halpin,et al.  Geospatial web services within a scientific workflow: Predicting marine mammal habitats in a dynamic environment , 2007, Ecol. Informatics.

[40]  Daniel A. Keim,et al.  Challenges in Visual Data Analysis , 2006, Tenth International Conference on Information Visualisation (IV'06).

[41]  Michael Gertz,et al.  Efficiently managing large-scale raster species distribution data in PostgreSQL , 2009, GIS.

[42]  T. Edwin Chow,et al.  The Potential of Maps APIs for Internet GIS Applications , 2008, Trans. GIS.

[43]  Marcel Frehner,et al.  Virtual database: Spatial analysis in a Web-based data management system for distributed ecological data , 2006, Environ. Model. Softw..

[44]  Oliver Günther,et al.  Multidimensional access methods , 1998, CSUR.

[45]  M. Willig,et al.  The Relationship Between Productivity and Species Richness , 1999 .