Bioinformatics in Proteomics: A Review on Methods and Algorithms

ABSTRACT It is often said that bioinformatics is a knowledge based discipline. This means that many of the search and prediction methods that have been used to greatest effect in bioinformatics exploit information that has already been accumulated about the problem of interest, rather than working from first principles. Most of the methods and algorithms discussed in this paper adopt these knowledge-based approaches for protein studies. Typically we have some given examples i.e. data of a given class or function, and we try to identify patterns in that data which characterize these sequences or structures and distinguish them from others that are not in this class. The purpose of this paper is to describe the basic conceptual methods and adjacent algorithms and applications that are used to obtain better and more reliable information of the studied characteristic patterns.

[1]  P. Higgs,et al.  Bioinformatics and Molecular Evolution , 2004, Comparative and Functional Genomics.

[2]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[3]  P. Argos,et al.  Incorporation of non-local interactions in protein secondary structure prediction from the amino acid sequence. , 1996, Protein engineering.

[4]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[5]  Christian Cole,et al.  The Jpred 3 secondary structure prediction server , 2008, Nucleic Acids Res..

[6]  Anirvan M. Sengupta,et al.  Better estimation of protein-DNA interaction parameters improve prediction of functional sites , 2008, BMC biotechnology.

[7]  Damian Counsell,et al.  Published online in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/cfg.486 Book Review , 2022 .

[8]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[9]  Dinesh Gupta,et al.  Molecular Modeling Studies of the Interaction Between Plasmodium falciparum HslU and HslV Subunits , 2009, Journal of biomolecular structure & dynamics.

[10]  Thomas D. Otto,et al.  AnEnPi: identification and annotation of analogous enzymes , 2008, BMC Bioinformatics.

[11]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[12]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[13]  Lukasz A. Kurgan,et al.  PFRES: protein fold classification by using evolutionary information and predicted secondary structure , 2007, Bioinform..

[14]  Richard M. Jackson,et al.  An evaluation of automated homology modelling methods at low target-template sequence similarity , 2007, Bioinform..

[15]  Burkhard Rost,et al.  The PredictProtein server , 2003, Nucleic Acids Res..

[16]  R. Podlipná,et al.  Understanding Bioinformatics , 2009, Biologia Plantarum.

[17]  Liam J. McGuffin,et al.  Protein structure prediction servers at University College London , 2005, Nucleic Acids Res..

[18]  Simon Parsons,et al.  Bioinformatics: The Machine Learning Approach by P. Baldi and S. Brunak, 2nd edn, MIT Press, 452 pp., $60.00, ISBN 0-262-02506-X , 2004, The Knowledge Engineering Review.

[19]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[20]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[21]  Yan-Hong Zhou,et al.  A Contact Energy Function Considering Residue Hydrophobic Environment and Its Application in Protein Fold Recognition , 2016, Genomics, proteomics & bioinformatics.

[22]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[23]  A. Laio,et al.  Predicting the effect of a point mutation on a protein fold: the villin and advillin headpieces and their Pro62Ala mutants. , 2008, Journal of molecular biology.

[24]  Ning Zhang,et al.  Corra: Computational framework and tools for LC-MS discovery and targeted mass spectrometry-based proteomics , 2008, BMC Bioinformatics.

[25]  Tatsuya Yoshikawa,et al.  Development of an Affinity Evaluation and Prediction System by Using the Shape complementarity Characteristic between proteins , 2008, J. Bioinform. Comput. Biol..

[26]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.