The pea aphid phylome: a complete catalogue of evolutionary histories and arthropod orthology and paralogy relationships for Acyrthosiphon pisum genes

Phylogenetic analyses serve many purposes, including the establishment of orthology relationships, the prediction of protein function and the detection of important evolutionary events. Within the context of the sequencing of the genome of the pea aphid, Acyrthosiphon pisum, we undertook a phylogenetic analysis for every protein of this species. The resulting phylome includes the evolutionary relationships of all predicted aphid proteins and their homologues among 13 other fully‐sequenced arthropods and three out‐group species. Subsequent analyses have revealed multiple gene expansions that are specific to aphids and have served to transfer functional annotations to 4058 pea aphid genes that display one‐to‐one orthology relationships with Drosophila melanogaster annotated genes. All phylogenies and alignments are accessible through the PhylomeDB database. Here we provide a description of this dataset and provide some examples on how can it be exploited.

[1]  O. Collin,et al.  AphidBase: a centralized bioinformatic resource for annotation of the pea aphid genome , 2010, Insect molecular biology.

[2]  G. K. Davis,et al.  Genome Sequence of the Pea Aphid Acyrthosiphon pisum , 2010, PLoS biology.

[3]  Seth M. Barribeau,et al.  Immunity and other defenses in pea aphids, Acyrthosiphon pisum , 2010, Genome Biology.

[4]  Toni Gabaldón,et al.  trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses , 2009, Bioinform..

[5]  Toni Gabaldón,et al.  The Tree versus the Forest: The Fungal Tree of Life and the Topological Diversity within the Yeast Phylome , 2009, PloS one.

[6]  Kenneth H. Wolfe,et al.  Turning a hobby into a job: How duplicated genes find new functions , 2008, Nature Reviews Genetics.

[7]  T. Gabaldón Large-scale assignment of orthology: back to phylogenetics? , 2008, Genome Biology.

[8]  Peer Bork,et al.  The Genome of the Model Beetle and Pest Tribolium Castaneum Vertebrate-specific Orthologues Insect-specific Orthologues Homology Undetectable Similarity , 2022 .

[9]  Joaquín Dopazo,et al.  PhylomeDB: a database for genome-wide collections of gene phylogenies , 2007, Nucleic Acids Res..

[10]  K. Kjer,et al.  Ancient rapid radiations of insects: challenges for phylogenetic analysis. , 2008, Annual review of entomology.

[11]  G. Pyka-Fościak,et al.  Germ cell cluster formation and ovariole structure in viviparous and oviparous generations of the aphid Stomaphis quercus. , 2008, The International journal of developmental biology.

[12]  J. Johnston,et al.  Body Lice and Head Lice (Anoplura: Pediculidae) Have the Smallest Genomes of Any Hemimetabolous Insect Reported to Date , 2007, Journal of medical entomology.

[13]  P. Mayhew Why are there so many insect species? Perspectives from fossils and phylogenies , 2007, Biological reviews of the Cambridge Philosophical Society.

[14]  Joaquín Dopazo,et al.  Phylemon: a suite of web tools for molecular evolution, phylogenetics and phylogenomics , 2007, Nucleic Acids Res..

[15]  J. Dopazo,et al.  The human phylome , 2007, Genome Biology.

[16]  Ute Baumann,et al.  Estimating the annotation error rate of curated GO database sequence annotations , 2007, BMC Bioinformatics.

[17]  O. Gascuel,et al.  Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative. , 2006, Systematic biology.

[18]  A. Douglas Phloem-sap feeding by animals: problems and solutions. , 2006, Journal of experimental botany.

[19]  Robert C. Edgar,et al.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity , 2004, BMC Bioinformatics.

[20]  Joaquín Dopazo,et al.  FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes , 2004, Bioinform..

[21]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[22]  B. Snel,et al.  Genome phylogeny based on gene content , 1999, Nature Genetics.

[23]  O Gascuel,et al.  BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data. , 1997, Molecular biology and evolution.

[24]  L. Cooley,et al.  Kelch encodes a component of intercellular bridges in Drosophila egg chambers , 1993, Cell.

[25]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[26]  W. Fitch Distinguishing homologous from analogous proteins. , 1970, Systematic zoology.