Ontological Foundation for Protein Data Models

In this paper we proposed a Protein Ontology to integrate protein data and information from various Protein Data Sources. Protein Ontology provides the technical and scientific infrastructure and knowledge to allow description and analysis of relationships between various proteins. Protein Ontology uses relevant protein data sources of information like PDB, SCOP, and OMIM. Protein Ontology describes: Protein Sequence and Structure Information, Protein Folding Process, Cellular Functions of Proteins, Molecular Bindings internal and external to Proteins, and Constraints affecting the Final Protein Conformation. We also created a database of 10 Major Prion Proteins available in various Protein data sources, based on the vocabulary provided by Protein Ontology. Details about Protein Ontology are available online at http://www.proteinontology.info/.

[1]  H. Lehmann,et al.  Nucleic Acid Research , 1967 .

[2]  Tim J. P. Hubbard,et al.  SCOP: a Structural Classification of Proteins database , 1999, Nucleic Acids Res..

[3]  D. Valle,et al.  Online Mendelian Inheritance In Man (OMIM) , 2000, Human mutation.

[4]  Helge Weissig,et al.  Protein structure resources. , 2002, Acta crystallographica. Section D, Biological crystallography.

[5]  Tharam S. Dillon,et al.  Protein ontology: vocabulary for protein data , 2005, Third International Conference on Information Technology and Applications (ICITA'05).

[6]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[7]  T.S. Dillon,et al.  Ontology-based knowledge representation for protein data , 2005, INDIN '05. 2005 3rd IEEE International Conference on Industrial Informatics, 2005..

[8]  T. N. Bhat,et al.  The PDB data uniformity project , 2001, Nucleic Acids Res..

[9]  Amandeep S. Sidhu,et al.  An XML based semantic protein map , 2004 .

[10]  T. N. Bhat,et al.  The Protein Data Bank: unifying the archive , 2002, Nucleic Acids Res..

[11]  Russ B. Altman,et al.  RiboWeb: An Ontology-Based System for Collaborative Molecular Biology , 1999, IEEE Intell. Syst..

[12]  T.S. Dillon,et al.  An Ontology for Protein Data Models , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[13]  Peter D. Karp,et al.  EcoCyc: Encyclopedia of Escherichia coli genes and metabolism , 1998, Nucleic Acids Res..

[14]  Alexander Borgida,et al.  Description Logics in Data Management , 1995, IEEE Trans. Knowl. Data Eng..

[15]  Steffen Schulze-Kremer,et al.  Ontologies for Molecular Biology , 2001, Electron. Trans. Artif. Intell..

[16]  J. Blake,et al.  Creating the Gene Ontology Resource : Design and Implementation The Gene Ontology Consortium 2 , 2001 .

[17]  Haruki Nakamura,et al.  PDBML: the representation of archival macromolecular structure data in XML , 2005, Bioinform..

[18]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[19]  Suzanna E Lewis,et al.  Gene Ontology: looking backwards and forwards , 2004, Genome Biology.