Lawrence Berkeley National Laboratory Recent Work Title The environment ontology in 2016 : Bridging domains with increased scope , semantic density , and interoperation Permalink

Background: The Environment Ontology (ENVO; http://www.environmentontology.org/), first described in 2013, is a resource and research target for the semantically controlled description of environmental entities. The ontology's initial aim was the representation of the biomes, environmental features, and environmental materials pertinent to genomic and microbiome-related investigations. However, the need for environmental semantics is common to a multitude of fields, and ENVO's use has steadily grown since its initial description. We have thus expanded, enhanced, and generalised the ontology to support its increasingly diverse applications. Methods: We have updated our development suite to promote expressivity, consistency, and speed: we now develop ENVO in the Web Ontology Language (OWL) and employ templating methods to accelerate class creation. We have also taken steps to better align ENVO with the Open Biological and Biomedical Ontologies (OBO) Foundry principles and interoperate with existing OBO ontologies. Further, we applied text-mining approaches to extract habitat information from the Encyclopedia of Life and automatically create experimental habitat classes within ENVO. Results: Relative to its state in 2013, ENVO's content, scope, and implementation have been enhanced and much of its existing content revised for improved semantic representation. ENVO now offers representations of habitats, environmental processes, anthropogenic environments, and entities relevant to environmental health initiatives and the global Sustainable Development Agenda for 2030. Several branches of ENVO have been used to incubate and seed new ontologies in previously unrepresented domains such as food and agronomy. The current release version of the ontology, in OWL format, is available at http://purl.obolibrary.org/obo/envo.owl. Conclusions: ENVO has been shaped into an ontology which bridges multiple domains including biomedicine, natural and anthropogenic ecology, ‘omics, and socioeconomic development. Through continued interactions with our users and partners, particularly those performing data archiving and sythesis, we anticipate that ENVO’s growth will accelerate in 2017. As always, we invite further contributions and collaboration to advance the semantic representation of the environment, ranging from geographic features and environmental materials, across habitats and ecosystems, to everyday objects in household settings.

[1]  D. Keith,et al.  IUCN Red List of Ecosystems: Implications for Public Policy , 2018 .

[2]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[3]  Richard Gibson,et al.  Value, but high costs in post-deposition data curation , 2016, Database J. Biol. Databases Curation.

[4]  Chengquan Huang,et al.  Conservation policy and the measurement of forests , 2016 .

[5]  Un Desa Transforming our world : The 2030 Agenda for Sustainable Development , 2016 .

[6]  Nathan Wilson,et al.  TraitBank: Practical semantics for organism attribute data , 2016, Semantic Web.

[7]  Nico M. Franz,et al.  Emerging semantics to link phenotype and environment , 2015, PeerJ.

[8]  James F. Meadow,et al.  Microbiota of the indoor environment: a meta-analysis , 2015, Microbiome.

[9]  Andreas Henschel,et al.  Comprehensive Meta-analysis of Ontology Annotated 16S rRNA Profiles Identifies Beta Diversity Clusters of Environmental Bacterial Communities , 2015, PLoS Comput. Biol..

[10]  William Colglazier,et al.  Sustainable development agenda: 2030 , 2015, Science.

[11]  Rachelle M. Jensen,et al.  The ocean sampling day consortium , 2015, GigaScience.

[12]  G. Blöschl,et al.  Bacterial diversity along a 2600 km river continuum , 2015, Environmental microbiology.

[13]  Christian von Mering,et al.  Limits to robustness and reproducibility in the demarcation of operational taxonomic units. , 2015, Environmental microbiology.

[14]  Peer Bork,et al.  Open science resources for the discovery and analysis of Tara Oceans data , 2015, Scientific Data.

[15]  Egon L. Willighagen,et al.  eNanoMapper: harnessing ontologies to enable data integration for nanomaterial risk assessment , 2015, Journal of Biomedical Semantics.

[16]  Janos X. Binder,et al.  DISEASES: Text mining and data integration of disease–gene associations , 2014, bioRxiv.

[17]  Nigel Collier,et al.  Automatic concept recognition using the Human Phenotype Ontology reference and test suite corpora , 2015, Database J. Biol. Databases Curation.

[18]  Christian Bottomley,et al.  Mind the Gap: House Structure and the Risk of Malaria in Uganda , 2015, PloS one.

[19]  Quentin J. Groom,et al.  Piecing together the biogeographic history of Chenopodium vulvaria L. using botanical literature and collections , 2015, PeerJ.

[20]  M. Sogin,et al.  Minimum entropy decomposition: Unsupervised oligotyping for sensitive partitioning of high-throughput marker gene sequences , 2014, The ISME Journal.

[21]  Chris Mungall,et al.  ROBOT: A command-line tool for ontology development , 2015, ICBO.

[22]  Tanya Z. Berardini,et al.  TermGenie – a web-application for pattern-based ontology class generation , 2014, J. Biomed. Semant..

[23]  Jie Zheng,et al.  Meeting report: advancing practical applications of biodiversity ontologies , 2014, Standards in Genomic Sciences.

[24]  Lars Juhl Jensen,et al.  ENVIRONMENTS and EOL: identification of Environment Ontology terms in text and the annotation of the Encyclopedia of Life , 2014, bioRxiv.

[25]  Chris Mungall,et al.  Global biotic interactions: An open infrastructure to share and analyze species-interaction datasets , 2014, Ecol. Informatics.

[26]  Frédéric Mahé,et al.  Swarm: robust and fast clustering method for amplicon-based studies , 2014, PeerJ.

[27]  Patrick K. H. Lee,et al.  Indoor-Air Microbiome in an Urban Subway Network: Diversity and Dynamics , 2014, Applied and Environmental Microbiology.

[28]  Patrick R. Leary,et al.  The Encyclopedia of Life v2: Providing Global Access to Knowledge About Life on Earth , 2014, Biodiversity data journal.

[29]  H. Lund,et al.  WHAT IS A FOREST? DEFINITIONS DO MAKE A DIFFERENCE AN EXAMPLE FROM TURKEY , 2014 .

[30]  Christian von Mering,et al.  Ecological Consistency of SSU rRNA-Based Operational Taxonomic Units at a Global Scale , 2014, PLoS Comput. Biol..

[31]  Barry Smith,et al.  Semantics in Support of Biodiversity Knowledge Discovery: An Introduction to the Biological Collections Ontology and Related Ontologies , 2014, PloS one.

[32]  Pelin Yilmaz,et al.  Meeting Report: GBIF hackathon-workshop on Darwin Core and sample data (22-24 May 2013) , 2014, Standards in Genomic Sciences.

[33]  Salvador Capella-Gutiérrez,et al.  PhylomeDB v4: zooming into the plurality of evolutionary histories of a genome , 2013, Nucleic Acids Res..

[34]  Andreas Wilke,et al.  MIxS-BE: a MIxS extension defining a minimum information standard for sequence data from the built environment , 2013, The ISME Journal.

[35]  Kessy Abarenkov,et al.  Resistance and resilience of the forest soil microbiome to logging-associated compaction , 2013, The ISME Journal.

[36]  Barry Smith,et al.  The environment ontology: contextualising biological and biomedical entities , 2013, Journal of Biomedical Semantics.

[37]  Sharon L. Grim,et al.  Oligotyping: differentiating between closely related microbial taxa using 16S rRNA gene data , 2013, Methods in ecology and evolution.

[38]  Olaf Boebel,et al.  FRAM - FRontiers in Arctic marine Monitoring Visions for permanent observations in a gateway to the Arctic Ocean , 2013, 2013 MTS/IEEE OCEANS - Bergen.

[39]  Reed Beaman,et al.  Clarifying Concepts and Terms in Biodiversity Informatics , 2013, Standards in genomic sciences.

[40]  Noah Fierer,et al.  Home Life: Factors Structuring the Bacterial Diversity Found within and between Homes , 2013, PloS one.

[41]  A. Hardisty,et al.  A decadal view of biodiversity informatics: challenges and priorities , 2013, BMC Ecology.

[42]  Alex Hardisty,et al.  UvA-DARE ( Digital Academic Repository ) A decadal view of biodiversity informatics : challenges and priorities , 2013 .

[43]  William W. Nazaroff,et al.  Human Occupancy as a Source of Indoor Airborne Bacteria , 2012, PloS one.

[44]  Grant Dorsey,et al.  Malaria in Uganda: challenges to control on the long road to elimination. II. The path forward. , 2012, Acta tropica.

[45]  John Wieczorek,et al.  Darwin Core: An Evolving Community-Developed Biodiversity Data Standard , 2012, PloS one.

[46]  S. Lewis,et al.  Uberon, an integrative multi-species anatomy ontology , 2012, Genome Biology.

[47]  Rob Knight,et al.  Microbial Biogeography of Public Restroom Surfaces , 2011, PloS one.

[48]  G. Cochrane,et al.  The Genomic Standards Consortium , 2011, PLoS biology.

[49]  Emily S. Charlson,et al.  Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications , 2011, Nature Biotechnology.

[50]  Alan Ruttenberg,et al.  MIREOT: The minimum information to reference an external ontology term , 2009, Appl. Ontology.

[51]  Tanya Z. Berardini,et al.  Cross-product extensions of the Gene Ontology , 2009, J. Biomed. Informatics.

[52]  Alan Ruttenberg,et al.  Ontobee: A Linked Data Server and Browser for Ontology Terms , 2011, ICBO.

[53]  Oliver Hofmann,et al.  ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level , 2010, Bioinform..

[54]  Francis E. Putz,et al.  Critical need for new definitions of “forest” and “forest degradation” in global climate change agreements , 2009 .

[55]  K. Weathers,et al.  Effects of Air Pollution on Ecosystems and Biological Diversity in the Eastern United States , 2009, Annals of the New York Academy of Sciences.

[56]  J. Elith,et al.  Species Distribution Models: Ecological Explanation and Prediction Across Space and Time , 2009 .

[57]  Erle C. Ellis,et al.  Putting people in the map: anthropogenic biomes of the world , 2008 .

[58]  Andreas Wilke,et al.  phylogenetic and functional analysis of metagenomes , 2022 .

[59]  Tao Zhang,et al.  The Airborne Metagenome in an Indoor Urban Environment , 2008, PloS one.

[60]  Michael Darsow,et al.  ChEBI: a database and ontology for chemical entities of biological interest , 2007, Nucleic Acids Res..

[61]  Alexander Belokurov,et al.  Global Ecological Forest Classification and Forest Protected Area Gap Analysis. Analyses and recommendations in view of the 10% target for forest protection under the Convention on Biological Diversity (CBD) , 2008 .

[62]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[63]  Midori A. Harris,et al.  BIOINFORMATICS APPLICATIONS NOTE doi:10.1093/bioinformatics/btm112 Databases and ontologies OBO-Edit—an ontology editor for biologists , 2007 .

[64]  M. Kearney,et al.  Habitat, environment and niche: what are we modelling? , 2006 .

[65]  Michael L. Morrison,et al.  The habitat concept and a plea for standard terminology , 1997 .