Genome-wide modeling of complex phenotypes in Caenorhabditis elegans and Drosophila melanogaster

BackgroundThe genetic and molecular basis for many intermediate and end stage phenotypes in model systems such as C. elegans and D. melanogaster has long been known to involve pleiotropic effects and complex multigenic interactions. Gene sets are groups of genes that contribute to multiple biological or molecular phenomena. They have been used in the analysis of large molecular datasets such as microarray data, Next Generation sequencing, and other genomic datasets to reveal pleiotropic and multigenic contributions to phenotypic outcomes. Many model systems lack species specific organized phenotype based gene sets to enable high throughput analysis of large molecular datasets.Results and discussionHere, we describe two novel collections of gene sets in C. elegans and D. melanogaster that are based exclusively on genetically determined phenotypes and use a controlled phenotypic ontology. We use these collections to build genome-wide models of thousands of defined phenotypes in both model species. In addition, we demonstrate the utility of these gene sets in systems analysis and in analysis of gene expression-based molecular datasets and show how they are useful in analysis of genomic datasets connecting multigenic gene inputs to complex phenotypes.ConclusionsPhenotypic based gene sets in both C. elegans and D. melanogaster are developed, characterized, and shown to be useful in the analysis of large scale species-specific genomic datasets. These phenotypic gene set collections will contribute to the understanding of complex phenotypic outcomes in these model systems.

[1]  J. Felsenstein An alternating least squares approach to inferring phylogenies from pairwise distances. , 1997, Systematic biology.

[2]  Jim Thurmond,et al.  FlyBase 101 – the basics of navigating FlyBase , 2011, Nucleic Acids Res..

[3]  Peilin Jia,et al.  Gene set analysis of genome-wide association studies: methodological issues and perspectives. , 2011, Genomics.

[4]  M. Daly,et al.  PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes , 2003, Nature Genetics.

[5]  Paul W. Sternberg,et al.  Worm Phenotype Ontology: Integrating phenotype data within and beyond the C. elegans community , 2011, BMC Bioinformatics.

[6]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[7]  W. Fitch,et al.  Construction of phylogenetic trees. , 1967, Science.

[8]  Supriyo De,et al.  Systematic analysis, comparison, and integration of disease based human genetic association data and mouse genetic phenotypic information , 2010, BMC Medical Genomics.

[9]  J. Wood,et al.  Chromatin remodeling in the aging genome of Drosophila , 2010, Aging cell.

[10]  Dennis H Kim,et al.  A Decline in p38 MAPK Signaling Underlies Immunosenescence in Caenorhabditis elegans , 2011, PLoS genetics.

[11]  K. Becker,et al.  Disease and phenotype gene set analysis of disease-based gene expression in mouse and human. , 2010, Physiological genomics.

[12]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[13]  Alan Hubbard,et al.  Age‐related behaviors have distinct transcriptional profiles in Caenorhabditis elegans , 2008, Aging cell.

[14]  Kimberly Van Auken,et al.  WormBase 2012: more genomes, more data, new website , 2011, Nucleic Acids Res..

[15]  Kimberly Van Auken,et al.  WormBase: a comprehensive resource for nematode research , 2009, Nucleic Acids Res..

[16]  Sangsoo Kim,et al.  GSA-SNP: a general approach for gene set analysis of polymorphisms , 2010, Nucleic Acids Res..

[17]  Seon-Young Kim,et al.  Gene-set approach for expression pattern analysis , 2008, Briefings Bioinform..

[18]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Seon-Young Kim,et al.  PAGE: Parametric Analysis of Gene Set Enrichment , 2005, BMC Bioinform..