Inselect: Automating the Digitization of Natural History Collections

The world’s natural history collections constitute an enormous evidence base for scientific research on the natural world. To facilitate these studies and improve access to collections, many organisations are embarking on major programmes of digitization. This requires automated approaches to mass-digitization that support rapid imaging of specimens and associated data capture, in order to process the tens of millions of specimens common to most natural history collections. In this paper we present Inselect—a modular, easy-to-use, cross-platform suite of open-source software tools that supports the semi-automated processing of specimen images generated by natural history digitization programmes. The software is made up of a Windows, Mac OS X, and Linux desktop application, together with command-line tools that are designed for unattended operation on batches of images. Blending image visualisation algorithms that automatically recognise specimens together with workflows to support post-processing tasks such as barcode reading, label transcription and metadata capture, Inselect fills a critical gap to increase the rate of specimen digitization.

[1]  A. Peterson,et al.  New developments in museum-based informatics and applications in biodiversity analysis. , 2004, Trends in ecology & evolution.

[2]  A. Lister Natural history collections as sources of long-term datasets. , 2011, Trends in ecology & evolution.

[3]  Flavia Toloni,et al.  Natural history museum collections provide information on phenological change in British butterflies since the late-nineteenth century , 2014, International Journal of Biometeorology.

[4]  John La Salle,et al.  Whole-drawer imaging for digital management and curation of a large entomological collection , 2012, ZooKeys.

[5]  Nico Cellinese,et al.  Mass digitization of scientific collections: New opportunities to transform the use of biological specimens and underwrite biodiversity science , 2012, ZooKeys.

[6]  Stefan Schmidt,et al.  DScan – a high-performance digital scanning system for entomological collections , 2012, ZooKeys.

[7]  David Raila,et al.  InvertNet: a new paradigm for digital access to invertebrate collections , 2012, ZooKeys.

[8]  John Wieczorek,et al.  Darwin Core: An Evolving Community-Developed Biodiversity Data Standard , 2012, PloS one.

[9]  Gil Nelson,et al.  Five task clusters that enable efficient and effective digitization of biological collections , 2012, ZooKeys.

[10]  Vincent S. Smith,et al.  No specimen left behind: industrial scale digitization of natural history collections , 2012, ZooKeys.

[11]  D. R. Robertson,et al.  Specimen collection: an essential tool. , 2014, Science.

[12]  Oleksandr Holovachov,et al.  Whole-Drawer Imaging of Entomological Collections: Benefits, Limitations and Alternative Applications , 2014 .

[13]  Tim Newbold,et al.  Applications and limitations of museum data for conservation and ecology, with particular attention to species distribution models , 2010 .

[14]  I. Kitching,et al.  Estimating regional species richness of tropical insects from museum data: a comparison of a geography‐based and sample‐based methods , 2007 .

[15]  Arturo H. Ariño APPROACHES TO ESTIMATING THE UNIVERSE OF NATURAL HISTORY COLLECTIONS DATA , 2010 .

[16]  D. Wake,et al.  Coincident mass extirpation of neotropical amphibians with the emergence of the infectious fungal pathogen Batrachochytrium dendrobatidis , 2011, Proceedings of the National Academy of Sciences.

[17]  A. Suarez,et al.  The Value of Museum Collections for Research and Society , 2004 .