Cocos: Constructing multi-domain protein phylogenies

Phylogenies of multi-domain proteins have to incorporate macro-evolutionary events, which dramatically increases the complexity of their construction. We present an application to infer ancestral multi-domain proteins given a species tree and domain phylogenies. As the individual domain phylogenies are often incongruent, we provide diagnostics for the identification and reconciliation of implausible topologies. We implement and extend a suggested algorithmic approach by Behzadi and Vingron (2006).

[1]  Oliver Eulenstein,et al.  The Plexus Model for the Inference of Ancestral Multidomain Proteins , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[3]  Sean R Eddy,et al.  A new generation of homology search tools based on probabilistic inference. , 2009, Genome informatics. International Conference on Genome Informatics.

[4]  The UniProt Consortium,et al.  The Universal Protein Resource (UniProt) 2009 , 2008, Nucleic Acids Res..

[5]  E. Koonin,et al.  Evolution of protein domain promiscuity in eukaryotes. , 2008, Genome research.

[6]  E. Sonnhammer,et al.  Domain tree-based analysis of protein architecture evolution. , 2008, Molecular biology and evolution.

[7]  Dannie Durand,et al.  Domain Architecture Comparison for Multidomain Homology Identification , 2007, J. Comput. Biol..

[8]  Martin Vingron,et al.  Reconstructing Domain Compositions of Ancestral Multi-domain Proteins , 2006, Comparative Genomics.

[9]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt) , 2004, Nucleic Acids Res..

[10]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[11]  Dan Gusfield,et al.  Partition-distance: A problem and class of perfect graphs arising in clustering , 2002, Inf. Process. Lett..

[12]  Sarah A. Teichmann,et al.  An insight into domain combinations , 2001, ISMB.

[13]  Dannie Durand,et al.  NOTUNG: A Program for Dating Gene Duplications and Optimizing Gene Family Trees , 2000, J. Comput. Biol..

[14]  D. Balciunas,et al.  Evidence of domain swapping within the jumonji family of transcription factors. , 2000, Trends in biochemical sciences.

[15]  R. Doolittle The origins and evolution of eukaryotic proteins. , 1995, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[16]  Huang,et al.  AN EFFICIENT GENERAL COOLING SCHEDULE FOR SIMULATED ANNEALING , 1986 .