JavaProtein Dossier: a novel web-based data visualization tool for comprehensive analysis of protein structure

JavaProtein Dossier ((J)PD) is a new concept, database and visualization tool providing one of the largest collections of the physicochemical parameters describing proteins' structure, stability, function and interaction with other macromolecules. By collecting as many descriptors/parameters as possible within a single database, we can achieve a better use of the available data and information. Furthermore, data grouping allows us to generate different parameters with the potential to provide new insights into the sequence-structure-function relationship. In (J)PD, residue selection can be performed according to multiple criteria. (J)PD can simultaneously display and analyze all the physicochemical parameters of any pair of structures, using precalculated structural alignments, allowing direct parameter comparison at corresponding amino acid positions among homologous structures. In order to focus on the physicochemical (and consequently pharmacological) profile of proteins, visualization tools (showing the structure and structural parameters) also had to be optimized. Our response to this challenge was the use of Java technology with its exceptional level of interactivity. (J)PD is freely accessible (within the Gold Sting Suite) at http://sms.cbi.cnptia.embrapa.br, http://mirrors.rcsb.org/SMS, http://trantor.bioc.columbia.edu/SMS and http://www.es.embnet.org/SMS/ (Option: (Java)Protein Dossier).

[1]  C W Hogue,et al.  Cn3D: a new generation of three-dimensional molecular structure viewer. , 1997, Trends in biochemical sciences.

[2]  Chris Sander,et al.  The double cubic lattice method: Efficient approaches to numerical integration of surface area and volume and to dot surface contouring of molecular assemblies , 1995, J. Comput. Chem..

[3]  M. L. Jones,et al.  PDBsum: a Web-based database of summaries and analyses of all PDB structures. , 1997, Trends in biochemical sciences.

[4]  P H Patel,et al.  DNA polymerase active site is highly mutable: evolutionary consequences. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Oleg V. Tsodikov,et al.  Novel computer program for fast exact calculation of accessible and molecular surface areas and average surface curvature , 2002, J. Comput. Chem..

[6]  Amos Bairoch,et al.  ScanProsite: a reference implementation of a PROSITE scanning tool. , 2002, Applied bioinformatics.

[7]  P. Argos,et al.  Knowledge‐based protein secondary structure assignment , 1995, Proteins.

[8]  Edward Rolf Tufte,et al.  The visual display of quantitative information , 1985 .

[9]  Philip E. Bourne,et al.  The Protein Data Bank: A Case Study in Management of Community Data , 2004 .

[10]  P. Fayers,et al.  The Visual Display of Quantitative Information , 1990 .

[11]  J. Thorner,et al.  The kindest cuts of all: crystal structures of Kex2 and furin reveal secrets of precursor processing. , 2004, Trends in biochemical sciences.

[12]  Amos Bairoch,et al.  A Generalized Profile Syntax for Biomolecular Sequence Motifs and its Function in Automatic Sequence Interpretation , 1994, ISMB.

[13]  Richard Wolfenden,et al.  Comparing the polarities of the amino acids: side-chain distribution coefficients between the vapor phase, cyclohexane, 1-octanol, and neutral aqueous solution , 1988 .

[14]  Itay Mayrose,et al.  Rate4Site: an algorithmic tool for the identification of functional regions in proteins by surface mapping of evolutionary determinants within their homologues , 2002, ISMB.

[15]  Chris Sander,et al.  The HSSP database of protein structure-sequence alignments , 1993, Nucleic Acids Res..

[16]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[17]  Alexandre Alvaro,et al.  STING Millennium: a web-based suite of programs for comprehensive and simultaneous analysis of protein structure and sequence , 2003, Nucleic Acids Res..

[18]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[19]  K. Sharp,et al.  Protein folding and association: Insights from the interfacial and thermodynamic properties of hydrocarbons , 1991, Proteins.

[20]  C. Sander,et al.  Database of homology‐derived protein structures and the structural meaning of sequence alignment , 1991, Proteins.

[21]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.