Impact of ontology evolution on functional analyses

MOTIVATION Ontologies are used in the annotation and analysis of biological data. As knowledge accumulates, ontologies and annotation undergo constant modifications to reflect this new knowledge. These modifications may influence the results of statistical applications such as functional enrichment analyses that describe experimental data in terms of ontological groupings. Here, we investigate to what degree modifications of the Gene Ontology (GO) impact these statistical analyses for both experimental and simulated data. The analysis is based on new measures for the stability of result sets and considers different ontology and annotation changes. RESULTS Our results show that past changes in the GO are non-uniformly distributed over different branches of the ontology. Considering the semantic relatedness of significant categories in analysis results allows a more realistic stability assessment for functional enrichment studies. We observe that the results of term-enrichment analyses tend to be surprisingly stable despite changes in ontology and annotation.

[1]  Erhard Rahm,et al.  Analyzing the Evolution of Life Science Ontologies and Mappings , 2008, DILS.

[2]  Erhard Rahm,et al.  FUNC: a package for detecting significant associations between gene sets and ontological annotations , 2007, BMC Bioinformatics.

[3]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[4]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[5]  R. Nielsen,et al.  Patterns of Positive Selection in Six Mammalian Genomes , 2008, PLoS genetics.

[6]  Andreas Wilke,et al.  Functional analysis of metagenomes and metatranscriptomes using SEED and KEGG , 2011, BMC Bioinformatics.

[7]  Catia Pesquita,et al.  Where GO is Going and What it Means for Ontology Extension , 2011, ICBO.

[8]  Chris Mungall,et al.  AmiGO: online access to ontology and annotation data , 2008, Bioinform..

[9]  Mark A. Musen,et al.  Promptdiff: a fixed-point algorithm for comparing ontology versions , 2002, AAAI/IAAI.

[10]  Olivier Bodenreider,et al.  Bio-ontologies: current trends and future directions , 2006, Briefings Bioinform..

[11]  Rachael P. Huntley,et al.  The GOA database in 2009—an integrated Gene Ontology Annotation resource , 2008, Nucleic Acids Res..

[12]  Alessandro Guffanti,et al.  Splicy: a web-based tool for the prediction of possible alternative splicing events from Affymetrix probeset data , 2007, BMC Bioinformatics.

[13]  Erhard Rahm,et al.  CODEX: exploration of semantic changes between ontology versions , 2012, Bioinform..

[14]  Vipin Kumar,et al.  Incorporating functional inter-relationships into protein function prediction algorithms , 2009, BMC Bioinformatics.

[15]  Fan Zhang,et al.  HPD: an online integrated human pathway database enabling systems biology studies , 2009, BMC Bioinformatics.

[16]  Erhard Rahm,et al.  Discovering Evolving Regions in Life Science Ontologies , 2010, DILS.

[17]  Phillip W. Lord,et al.  Semantic Similarity in Biomedical Ontologies , 2009, PLoS Comput. Biol..

[18]  Li Ni,et al.  A procedure for assessing GO annotation consistency , 2005, ISMB.

[19]  Huaiyu Mi,et al.  Ontology annotation: mapping genomic regions to biological function. , 2007, Current opinion in chemical biology.

[20]  Philip S. Yu,et al.  A new method to measure the semantic similarity of GO terms , 2007, Bioinform..

[21]  Erhard Rahm,et al.  Estimating the Quality of Ontology-Based Annotations by Considering Evolutionary Changes , 2009, DILS.

[22]  Erhard Rahm,et al.  COnto-Diff: generation of complex evolution mappings for life science ontologies , 2013, J. Biomed. Informatics.

[23]  Charles A Tilford,et al.  Gene set enrichment analysis. , 2009, Methods in molecular biology.

[24]  Sabina Leonelli,et al.  How the gene ontology evolves , 2011, BMC Bioinformatics.

[25]  Giorgio Valle,et al.  The Gene Ontology project in 2008 , 2007, Nucleic Acids Res..

[26]  Erhard Rahm,et al.  Efficient Management of Biomedical Ontology Versions , 2009, OTM Workshops.

[27]  Jinah Park,et al.  Monitoring the evolutionary aspect of the Gene Ontology to enhance predictability and usability , 2008, BMC Bioinformatics.