Measuring the level of activity in community built bio-ontologies

In this paper we explore the measurement of activity in ontology projects as an aspect of community ontology building. When choosing whether to use an ontology or whether to participate in its development, having some knowledge of how actively that ontology is developed is an important issue. Our knowledge of biology grows and changes and an ontology must adapt to keep pace with those changes and also adapt with respect to other ontologies and organisational principles. In essence, we need to know if there is an 'active' community involved with a project or whether a given ontology is inactive or moribund. We explore the use of additions, deletions and changes to ontology files, the regularity and frequency of releases, and the number of ontology repository updates to an ontology as the basis for measuring activity in an ontology. We present our results of this study, which show a dramatic range of activity across some of the more prominent community ontologies, illustrating very active and mature efforts through to those which appear to have become dormant for a number of possible reasons. We show that global activity within the community has remained at a similar level over the last 2 years. Measuring additions, deletions and changes, together with release frequency, appear to be useful metrics of activity and useful pointers towards future behaviour. Measuring who is making edits to ontologies is harder to capture; this raises issues of record keeping in ontology projects and in micro-credit, although we have identified one ontologist that appears influential across many community efforts; a Super-Ontologist. We also discuss confounding factors in our activity metric and discuss how it can be improved and adopted as an assessment criterion for community ontology development. Overall, we show that it is possible to objectively measure the activity in an ontology and to make some prediction about future activity.

[1]  Asunción Gómez-Pérez,et al.  Ontology Evaluation , 2004, Handbook on Ontologies.

[2]  Francisco J. García-Peñalvo,et al.  A Survey on Ontology Metrics , 2010, WSKS.

[3]  Mark C. Paulk,et al.  Capability Maturity Model for Software , 2001 .

[4]  Chris F. Taylor,et al.  Survey-based naming conventions for use in OBO Foundry ontology development , 2009, BMC Bioinformatics.

[5]  Nicola Guarino,et al.  An Overview of OntoClean , 2004, Handbook on Ontologies.

[6]  Jesualdo Tomás Fernández-Breis,et al.  A Quality Evaluation Framework for Bio-Ontologies , 2009 .

[7]  Natalya F. Noy,et al.  BioPortal: Ontologies and Integrated Data Resources at the Click of a Mouse , 2009 .

[8]  Yorick Wilks,et al.  Data Driven Ontology Evaluation , 2004, LREC.

[9]  Tania Tudorache,et al.  An analysis of collaborative patterns in large-scale ontology development projects , 2011, K-CAP '11.

[10]  Nathalie Aussenac-Gilles,et al.  OQuaRE: A SQuaRE-based Approach for Evaluating the Quality of Ontologies , 2011, J. Res. Pract. Inf. Technol..

[11]  Kent A. Spackman,et al.  Rates of Change in a Large Clinical Terminology: Three Years Experience with SNOMED Clinical Terms , 2005, AMIA.

[12]  Michael Gruninger,et al.  Methodology for the Design and Evaluation of Ontologies , 1995, IJCAI 1995.

[13]  Mark C. Paulk,et al.  Capability Maturity Model for Software, Version 1.1 , 1993 .

[14]  Sean Bechhofer,et al.  The OWL API: A Java API for Working with OWL 2 Ontologies , 2009, OWLED.

[15]  R. Porzel,et al.  A Task-based Approach for Ontology Evaluation , 2022 .

[16]  Shari Lawrence Pfleeger,et al.  Software Metrics: Progress after 25 Years? , 2008, IEEE Software.

[17]  Dennis Lee,et al.  Implications of SNOMED CT versioning , 2011, Int. J. Medical Informatics.

[18]  Kei-Hoi Cheung,et al.  Using semantic web rules to reason on an ontology of pseudogenes , 2010, Bioinform..

[19]  Martin Boeker,et al.  Unintended consequences of existential quantifications in biomedical ontologies , 2011, BMC Bioinformatics.

[20]  Barry Smith,et al.  On the Application of Formal Principles to Life Science Data: a Case Study in the Gene Ontology , 2004, DILS.

[21]  Marko Grobelnik,et al.  A SURVEY OF ONTOLOGY EVALUATION TECHNIQUES , 2005 .

[22]  William R. Hogan,et al.  Natural Language Processing methods and systems for biomedical ontology learning , 2011, J. Biomed. Informatics.

[23]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[24]  Steffen Staab,et al.  Handbook on Ontologies in Information Systems , 2003 .

[25]  L. Ohno-Machado Journal of Biomedical Informatics , 2001 .

[26]  Barry Smith,et al.  Biodynamic ontology: applying BFO in the biomedical domain. , 2004, Studies in health technology and informatics.