The Dynamics and Evolutionary Potential of Domain Loss and Emergence

The wealth of available genomic data presents an unrivaled opportunity to study the molecular basis of evolution. Studies on gene family expansions and site-dependent analyses have already helped establish important insights into how proteins facilitate adaptation. However, efforts to conduct full-scale cross-genomic comparisons between species are challenged by both growing amounts of data and the inherent difficulty in accurately inferring homology between deeply rooted species. Proteins, in comparison, evolve by means of domain rearrangements, a process more amenable to study given the strength of profile-based homology inference and the lower rates with which rearrangements occur. However, adapting to a constantly changing environment can require molecular modulations beyond reach of rearrangement alone. Here, we explore rates and functional implications of novel domain emergence in contrast to domain gain and loss in 20 arthropod species of the pancrustacean clade. Emerging domains are more likely disordered in structure and spread more rapidly within their genomes than established domains. Furthermore, although domain turnover occurs at lower rates than gene family turnover, we find strong evidence that the emergence of novel domains is foremost associated with environmental adaptation such as abiotic stress response. The results presented here illustrate the simplicity with which domain-based analyses can unravel key players of nature's adaptational machinery, complementing the classical site-based analyses of adaptation.

[1]  Arne Elofsson,et al.  A comparison of sequence and structure protein domain families as a basis for structural genomics , 1999, Bioinform..

[2]  Christian Schaefer,et al.  Protein secondary structure appears to be robust under in silico evolution while protein disorder appears not to be , 2010, Bioinform..

[3]  Jessica H. Fong,et al.  Modeling the evolution of protein domain architectures using maximum parsimony. , 2007, Journal of molecular biology.

[4]  Alex Bateman,et al.  Quantifying the mechanisms of domain gain in animal proteins , 2010, Genome Biology.

[5]  Zoran Obradovic,et al.  Length-dependent prediction of protein intrinsic disorder , 2006, BMC Bioinformatics.

[6]  Yun Ding,et al.  On the origin of new genes in Drosophila. , 2008, Genome research.

[7]  G. Caetano-Anollés,et al.  Global phylogeny determined by the combination of protein domains in proteomes. , 2006, Molecular biology and evolution.

[8]  F. Conlon,et al.  The T-box family , 2002, Genome Biology.

[9]  M. Levitt Nature of the protein universe , 2009, Proceedings of the National Academy of Sciences.

[10]  A. Elofsson,et al.  Quantification of the elevated rate of domain rearrangements in metazoa. , 2007, Journal of molecular biology.

[11]  R. Doolittle,et al.  Phylogeny determined by protein domain content. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[12]  J. Farris Phylogenetic Analysis Under Dollo's Law , 1977 .

[13]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[14]  C. Chothia,et al.  Evolution of the Protein Repertoire , 2003, Science.

[15]  A. Elofsson,et al.  Multi-domain proteins in the three kingdoms of life: orphan domains and other unassigned regions. , 2005, Journal of molecular biology.

[16]  T. Bosch,et al.  More than just orphans: are taxonomically-restricted genes important in evolution? , 2009, Trends in genetics : TIG.

[17]  Stefan Lutz,et al.  Circular permutation: a different way to engineer enzyme structure and function. , 2011, Trends in biotechnology.

[18]  Andrew G. Clark,et al.  Comparative Genomics on the Drosophila Phylogenetic Tree , 2009 .

[19]  L. Tjoelker,et al.  Structural and Functional Definition of the Human Chitinase Chitin-binding Domain* , 2000, The Journal of Biological Chemistry.

[20]  G. Pflugfelder omb and circumstance , 2009, Journal of neurogenetics.

[21]  Sarah A Teichmann,et al.  Relative rates of gene fusion and fission in multi-domain proteins. , 2005, Trends in genetics : TIG.

[22]  Adam Godzik,et al.  Strong functional patterns in the evolution of eukaryotic genomes revealed by the reconstruction of ancestral protein domain repertoires , 2011, Genome Biology.

[23]  S. Teichmann,et al.  The relationship between domain duplication and recombination. , 2005, Journal of molecular biology.

[24]  A. von Haeseler,et al.  A phylogenomic approach to resolve the arthropod tree of life. , 2010, Molecular biology and evolution.

[25]  T. Markow,et al.  Drosophila Biology in the Genomic Age , 2007, Genetics.

[26]  A. Elofsson,et al.  Domain rearrangements in protein evolution. , 2005, Journal of molecular biology.

[27]  Ying Wang,et al.  Insights into social insects from the genome of the honeybee Apis mellifera , 2006, Nature.

[28]  Erich Bornberg-Bauer,et al.  Functional and Evolutionary Insights from the Genomes of Three Parasitoid Nasonia Species , 2010, Science.

[29]  J. Farine,et al.  Volatile components of ripe fruits of Morinda citrifolia and their effects on Drosophila , 1996 .

[30]  Markus Affolter,et al.  Receptor serine/threonine kinases implicated in the control of Drosophila body pattern by decapentaplegic , 1994, Cell.

[31]  D. Hartl,et al.  Effects of X-linkage and sex-biased gene expression on the rate of adaptive protein evolution in Drosophila. , 2008, Molecular biology and evolution.

[32]  Gustavo Caetano-Anollés,et al.  The evolutionary mechanics of domain organization in proteomes and the rise of modularity in the protein world. , 2009, Structure.

[33]  Chao Qian,et al.  Population , 1940, State Rankings 2020: A Statistical View of America.

[34]  Andrew D. Moore,et al.  Arrangements in the modular evolution of proteins. , 2008, Trends in biochemical sciences.

[35]  Stefan Götz,et al.  Blast2GO: A Comprehensive Suite for Functional Analysis in Plant Genomics , 2007, International journal of plant genomics.

[36]  Todd H. Oakley,et al.  The Ecoresponsive Genome of Daphnia pulex , 2011, Science.

[37]  F. Díaz-Benjumea,et al.  The role of the T-box gene optomotor-blind in patterning the Drosophila wing. , 2004, Developmental biology.

[38]  S. Koide Generation of new protein functions by nonhomologous combinations and rearrangements of domains and modules. , 2009, Current opinion in biotechnology.

[39]  Melanie A. Huntley,et al.  Evolution of genes and genomes on the Drosophila phylogeny , 2007, Nature.

[40]  Dannie Durand,et al.  Sequence Similarity Network Reveals Common Ancestry of Multidomain Proteins , 2008, PLoS Comput. Biol..

[41]  Joel Dudley,et al.  TimeTree: a public knowledge-base of divergence times among organisms , 2006, Bioinform..

[42]  Ingmar Reuter,et al.  Integr8 and Genome Reviews: integrated views of complete genomes and proteomes , 2004, Nucleic Acids Res..

[43]  Brian R Johnson,et al.  Taxonomically restricted genes are associated with the evolution of sociality in the honey bee , 2011, BMC Genomics.

[44]  Thomas Lengauer,et al.  Improved scoring of functional groups from gene expression data by decorrelating GO graph structure , 2006, Bioinform..

[45]  M. Kanehisa,et al.  Evolutionary history and functional implications of protein domains and their combinations in eukaryotes , 2007, Genome Biology.

[46]  Andrew D Kern,et al.  Novel genes derived from noncoding DNA in Drosophila melanogaster are frequently X-linked and exhibit testis-biased expression. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[47]  Chittibabu Guda,et al.  Tracing the origin of functional and conserved domains in the human proteome: implications for protein evolution at the modular level , 2006, BMC Evolutionary Biology.

[48]  E. Bornberg-Bauer,et al.  Domain deletions and substitutions in the modular protein evolution , 2006, The FEBS journal.

[49]  Li Ni,et al.  The Gene Ontology's Reference Genome Project: A Unified Framework for Functional Annotation across Species , 2009, PLoS Comput. Biol..

[50]  E. Bornberg-Bauer,et al.  How do new proteins arise? , 2010, Current opinion in structural biology.

[51]  J. Risler,et al.  Identification of genomic features using microsyntenies of domains: domain teams. , 2005, Genome research.

[52]  Mira V. Han,et al.  Gene Family Evolution across 12 Drosophila Genomes , 2007, PLoS genetics.

[53]  S. Teichmann,et al.  Domain combinations in archaeal, eubacterial and eukaryotic proteomes. , 2001, Journal of molecular biology.

[54]  Casey M. Bergman,et al.  The Evolution of tRNA Genes in Drosophila , 2010, Genome biology and evolution.

[55]  A. Templeton,et al.  Population Genetics of the Developmental Gene optomotor-blind (omb) in Drosophila polymorpha , 2004, Genetics.

[56]  D. Yamamoto,et al.  A Database of Wing Diversity in the Hawaiian Drosophila , 2007, PloS one.

[57]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..