Fun : predicting enzyme function from structure

The ‘omics’ revolution is causing a flurry of data that all needs to be annotated for it to become useful. Sequences of proteins of unknown function can be annotated with a putative function by comparing them with proteins of known function. This form of annotation is typically performed with BLAST or similar software. Structural genomics is nowadays also bringing us three dimensional structures of proteins with unknown function. We present here software that can be used when sequence comparisons fail to determine the function of a protein with known structure but unknown function. The software, called 3D-Fun, is implemented as a server that runs at several European institutes and is freely available for everybody at all these sites. The 3D-Fun servers accept protein coordinates in the standard PDB format and compare them with all known protein structures by 3D structural superposition using the 3D-Hit software. If structural hits are found with proteins with known function, these are listed together with their function and some vital comparison statistics. This is conceptually very similar in 3D to what BLAST does in 1D. Additionally, the superposition results are displayed using interactive graphics facilities. Currently, the 3D-Fun system only predicts enzyme function but an expanded version with Gene Ontology predictions will be available soon. The server can be accessed at http://3dfun.bioinfo.pl/ or at http://3dfun.cmbi.ru.nl/.

[1]  Timothy B. Stockwell,et al.  The Diploid Genome Sequence of an Individual Human , 2007, PLoS biology.

[2]  A. Herráez Biomolecules in the computer: Jmol to the rescue , 2006, Biochemistry and molecular biology education : a bimonthly publication of the International Union of Biochemistry and Molecular Biology.

[3]  H. Sugimoto,et al.  Crystal structure of human indoleamine 2,3-dioxygenase: catalytic mechanism of O2 incorporation by a heme-containing dioxygenase. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Dariusz Plewczynski,et al.  PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics , 2006, BMC Bioinformatics.

[5]  J. Thornton,et al.  Predicting protein function from sequence and structural data. , 2005, Current opinion in structural biology.

[6]  Adam Godzik,et al.  Clustering of highly homologous sequences to reduce the size of large protein databases , 2001, Bioinform..

[7]  Fionn Murtagh,et al.  A Survey of Recent Advances in Hierarchical Clustering Algorithms , 1983, Comput. J..

[8]  Tadhg P Begley,et al.  Crystal structure and mechanism of tryptophan 2,3-dioxygenase, a heme enzyme involved in tryptophan catabolism and in quinolinate biosynthesis. , 2007, Biochemistry.

[9]  R. Durbin,et al.  The Pfam protein families database , 2004, Nucleic Acids Res..

[10]  Jack A. M. Leunissen,et al.  EXProt: a database for proteins with an experimentally verified function , 2002, Nucleic Acids Res..

[11]  Leszek Rychlewski,et al.  3D-Hit: fast structural comparison of proteins. , 2002, Applied bioinformatics.

[12]  The International HapMap Consortium,et al.  A physical map of the human genome , 2001 .

[13]  D. Defays,et al.  An Efficient Algorithm for a Complete Link Method , 1977, Comput. J..