Comparative protein structure modeling of genes and genomes.

Comparative modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. The number of protein sequences that can be modeled and the accuracy of the predictions are increasing steadily because of the growth in the number of known protein structures and because of the improvements in the modeling software. Further advances are necessary in recognizing weak sequence-structure similarities, aligning sequences with structures, modeling of rigid body shifts, distortions, loops and side chains, as well as detecting errors in a model. Despite these problems, it is currently possible to model with useful accuracy significant parts of approximately one third of all known protein sequences. The use of individual comparative models in biology is already rewarding and increasingly widespread. A major new challenge for comparative modeling is the integration of it with the torrents of data from genome sequencing projects as well as from functional and structural genomics. In particular, there is a need to develop an automated, rapid, robust, sensitive, and accurate comparative modeling pipeline applicable to whole genomes. Such large-scale modeling is likely to encourage new kinds of applications for the many resulting models, based on their large number and completeness at the level of the family, organism, or functional network.

[1]  D. Phillips,et al.  A possible three-dimensional structure of bovine alpha-lactalbumin based on that of hen's egg-white lysozyme. , 1969, Journal of molecular biology.

[2]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[3]  A. Lesk,et al.  How different amino acid sequences determine similar protein structures: the structure and evolutionary dynamics of the globins. , 1980, Journal of molecular biology.

[4]  J. Greer,et al.  Model for haptoglobin heavy chain based upon structural homology. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[5]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[6]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[7]  J. Felsenstein CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP , 1985, Evolution; international journal of organic evolution.

[8]  T. A. Jones,et al.  Using known substructures in protein model building and crystallography. , 1986, The EMBO journal.

[9]  J. Moult,et al.  An algorithm for determining the conformation of polypeptide segments in proteins by systematic search , 1986, Proteins.

[10]  A. Lesk,et al.  The relation between the divergence of sequence and structure in proteins. , 1986, The EMBO journal.

[11]  C. Levinthal,et al.  Predicting antibody hypervariable loop conformations II: Minimization and molecular dynamics studies of MCPC603 from many randomly generated loop conformations , 1986, Proteins.

[12]  T. Blundell,et al.  Knowledge based modelling of homologous proteins, Part I: Three-dimensional frameworks derived from the simultaneous superposition of multiple structures. , 1987, Protein engineering.

[13]  S H Bryant,et al.  Correctly folded proteins make twice as many hydrophobic contacts. , 1987, International journal of peptide and protein research.

[14]  T. L. Blundell,et al.  Knowledge-based prediction of protein structures and the design of novel molecules , 1987, Nature.

[15]  M. Sternberg,et al.  Analysis of the relationship between side-chain conformation and secondary structure in globular proteins. , 1987, Journal of molecular biology.

[16]  C. Levinthal,et al.  Predicting antibody hypervariable loop conformation. I. Ensembles of random conformations for ringlike structures , 1987, Biopolymers.

[17]  M. Sternberg,et al.  A strategy for the rapid multiple alignment of protein sequences. Confidence levels from tertiary structure comparisons. , 1987, Journal of molecular biology.

[18]  A. Lesk,et al.  Canonical structures for the hypervariable regions of immunoglobulins. , 1987, Journal of molecular biology.

[19]  M. Karplus,et al.  Prediction of the folding of short polypeptide segments by uniform conformational sampling , 1987, Biopolymers.

[20]  F. Corpet Multiple sequence alignment with hierarchical clustering. , 1988, Nucleic acids research.

[21]  H. Scheraga,et al.  Pattern recognition in the prediction of protein structure. I. Tripeptide conformational probabilities calculated from the amino acid sequence , 1989 .

[22]  Michael J. Sutcliffe,et al.  Knowledge-based protein modelling , 1989 .

[23]  J L Sussman,et al.  A 3D building blocks approach to analyzing and predicting structure of proteins , 1989, Proteins.

[24]  A C Martin,et al.  Modeling antibody hypervariable loops: a combined algorithm. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[25]  S. Wodak,et al.  Modelling the polypeptide backbone with 'spare parts' from known protein structures. , 1989, Protein engineering.

[26]  A. Lesk,et al.  Conformations of immunoglobulin hypervariable regions , 1989, Nature.

[27]  B. L. Sibanda,et al.  Conformation of beta-hairpins in protein structures. A systematic classification with applications to modelling by homology, electron density fitting and protein engineering. , 1989, Journal of molecular biology.

[28]  M. Gribskov,et al.  [9] Profile analysis , 1990 .

[29]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[30]  M Karplus,et al.  Modeling of globular proteins. A distance-based data search procedure for the construction of insertion/deletion regions and Pro----non-Pro mutations. , 1990, Journal of molecular biology.

[31]  J. Greer Comparative modeling methods: Application to the family of the mammalian serine proteases , 1990, Proteins.

[32]  M. Karplus,et al.  Conformational sampling using high‐temperature molecular dynamics , 1990, Biopolymers.

[33]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[34]  R M Stroud,et al.  Prediction of homologous protein structures based on conformational searches and energetics , 1990, Proteins.

[35]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[36]  C. DeLisi,et al.  Determining minimum energy conformations of polypeptides by dynamic programming , 1990, Biopolymers.

[37]  W. Pearson Rapid and sensitive sequence comparison with FASTP and FASTA. , 1990, Methods in enzymology.

[38]  C. Sander,et al.  Database algorithm for generating protein backbone and side-chain co-ordinates from a C alpha trace application to model building and detection of co-ordinate errors. , 1991, Journal of molecular biology.

[39]  D. Eisenberg,et al.  A method to identify protein sequences that fold into a known three-dimensional structure. , 1991, Science.

[40]  F E Cohen,et al.  Protein folding. Effect of packing density on chain conformation. , 1991, Journal of molecular biology.

[41]  Timothy F. Havel,et al.  A new method for building protein conformations from sequence alignments with homologues of known structure. , 1991, Journal of molecular biology.

[42]  S. Bryant,et al.  The frequency of ion‐pair substructures in proteins is quantitatively related to electrostatic potential: A statistical model for nonbonded interactions , 1991, Proteins.

[43]  R. Bruccoleri,et al.  Application of a directed conformational search for generating 3‐D coordinates for protein structures from α‐carbon coordinates , 1992, Proteins.

[44]  D. Eisenberg,et al.  Assessment of protein models with three-dimensional profiles , 1992, Nature.

[45]  T J Oldfield,et al.  SQUID: a program for the analysis and display of data from crystallography and molecular dynamics. , 1992, Journal of molecular graphics.

[46]  G A Petsko,et al.  Structure determination of turkey egg-white lysozyme using Laue diffraction data. , 1992, Acta crystallographica. Section B, Structural science.

[47]  D. T. Jones,et al.  A new approach to protein fold recognition , 1992, Nature.

[48]  M. Levitt Accurate modeling of protein conformation by automatic segment matching. , 1992, Journal of molecular biology.

[49]  A. Godzik,et al.  Topology fingerprint approach to the inverse protein folding problem. , 1992, Journal of molecular biology.

[50]  A V Finkelstein,et al.  Search for the stable state of a short chain in a molecular field. , 1992, Protein engineering.

[51]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[52]  M C Peitsch,et al.  A 3-D model for the CD40 ligand predicts that it is a compact trimer similar to the tumor necrosis factors. , 1993, International immunology.

[53]  R. Perham,et al.  Prediction of the three‐dimensional structures of the biotinylated domain from yeast pyruvate carboxylase and of the lipoylated H‐protein from the pea leaf glycine cleavage system: A new automated method for the prediction of protein tertiary structure , 1993, Protein science : a publication of the Protein Society.

[54]  Robert E. Bruccoleri,et al.  Application of Systematic Conformational Search to Protein Modeling , 1993 .

[55]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[56]  F E Cohen,et al.  Structure-based inhibitor design by using protein models for the development of antiparasitic agents. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[57]  M. Sippl Recognition of errors in three‐dimensional structures of proteins , 1993, Proteins.

[58]  D A Agard,et al.  Modeling side-chain conformation for homologous proteins using an energy-based rotamer search. , 1993, Journal of molecular biology.

[59]  M Karplus,et al.  Three-dimensional models of four mouse mast cell chymases. Identification of proteoglycan binding regions and protease-specific antigenic epitopes. , 1993, The Journal of biological chemistry.

[60]  S. Sudarsanam,et al.  An automated method for modeling proteins on known templates using distance geometry , 1993, Protein science : a publication of the Protein Society.

[61]  B. Rost,et al.  Prediction of protein secondary structure at better than 70% accuracy. , 1993, Journal of molecular biology.

[62]  T. Yeates,et al.  Verification of protein structures: Patterns of nonbonded atomic interactions , 1993, Protein science : a publication of the Protein Society.

[63]  Roland L. Dunbrack,et al.  Backbone-dependent rotamer library for proteins. Application to side-chain prediction. , 1993, Journal of molecular biology.

[64]  Scott R. Presnell,et al.  Erythropoietin structure-function relationships. Mutant proteins that test a model of tertiary structure. , 1993, The Journal of biological chemistry.

[65]  T L Blundell,et al.  An evaluation of the performance of an automated procedure for comparative modelling of protein tertiary structure. , 1993, Protein engineering.

[66]  J. Garnier,et al.  Modeling of protein loops by simulated annealing , 1993, Protein science : a publication of the Protein Society.

[67]  J Bajorath,et al.  Knowledge‐based model building of proteins: Concepts and examples , 1993, Protein science : a publication of the Protein Society.

[68]  P. Argos,et al.  Rotamers: to be or not to be? An analysis of amino acid side-chain conformations in globular proteins. , 1993, Journal of molecular biology.

[69]  Richard S. Judson,et al.  Analysis of the genetic algorithm method of molecular conformation determination , 1993, J. Comput. Chem..

[70]  R C Brower,et al.  Exhaustive conformational search and simulated annealing for models of lattice peptides , 1993, Biopolymers.

[71]  John P. Overington,et al.  A structural basis for sequence comparisons. An evaluation of scoring methodologies. , 1993, Journal of molecular biology.

[72]  P. Koehl,et al.  Application of a self-consistent mean field theory to predict protein side-chains conformation and estimate their conformational entropy. , 1994, Journal of molecular biology.

[73]  K. Fidelis,et al.  Comparison of systematic search and database methods for constructing segments of protein structure. , 1994, Protein engineering.

[74]  G Vriend,et al.  Predicting local structural changes that result from point mutations. , 1994, Protein engineering.

[75]  G Vriend,et al.  A novel search method for protein sequence--structure relations using property profiles. , 1994, Protein engineering.

[76]  D. Haussler,et al.  Hidden Markov models in computational biology. Applications to protein modeling. , 1993, Journal of molecular biology.

[77]  T. Blundell,et al.  Knowledge-based protein modeling. , 1994, Critical reviews in biochemistry and molecular biology.

[78]  John P. Overington,et al.  Comparative modelling of major house dust mite allergen Der p I: structure validation using an extended environmental amino acid propensity table. , 1994, Protein engineering.

[79]  L Chiche,et al.  Homology modelling of annexin I: implicit solvation improves side-chain prediction and combination of evaluation criteria allows recognition of different types of conformational error. , 1994, Protein engineering.

[80]  G J Barton,et al.  Structural features can be unconserved in proteins with similar folds. An analysis of side-chain to side-chain contacts secondary structure and accessibility. , 1994, Journal of molecular biology.

[81]  David T. Jones,et al.  Protein superfamilles and domain superfolds , 1994, Nature.

[82]  G Vasmatzis,et al.  Predicting immunoglobulin‐like hypervariable loops , 1994, Biopolymers.

[83]  Fred E. Cohen,et al.  Conformational Sampling of Loop Structures Using Genetic Algorithms , 1994 .

[84]  S. Altschul,et al.  Issues in searching molecular sequence databases , 1994, Nature Genetics.

[85]  M Karplus,et al.  Analysis of two-residue turns in proteins. , 1994, Journal of molecular biology.

[86]  I. Pastan,et al.  Design of interchain disulfide bonds in the framework region of the Fv fragment of the monoclonal antibody B3 , 1994, Proteins.

[87]  Celia W G van Gelder,et al.  A molecular dynamics approach for the generation of complete protein structures from limited coordinate data , 1994, Proteins.

[88]  S. Henikoff,et al.  Protein family classification based on searching a database of blocks. , 1994, Genomics.

[89]  Ruben Abagyan,et al.  Detailed ab initio prediction of lysozyme–antibody complex with 1.6 Å accuracy , 1994, Nature Structural Biology.

[90]  P. Koehl,et al.  Polar and nonpolar atomic environments in the protein core: Implications for folding and binding , 1994, Proteins.

[91]  T. P. Flores,et al.  Multiple protein structure alignment , 1994, Protein science : a publication of the Protein Society.

[92]  John P. Overington,et al.  Derivation of rules for comparative protein modeling from a database of protein structure alignments , 1994, Protein science : a publication of the Protein Society.

[93]  A. Kidera,et al.  Enhanced conformational sampling in Monte Carlo simulations of proteins: application to a constrained peptide. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[94]  A. Mathiowetz,et al.  De novo prediction of polypeptide conformations using dihedral probability grid Monte Carlo methodology , 1995, Protein Science.

[95]  A Sali,et al.  Comparative protein modeling by satisfaction of spatial restraints. , 1996, Molecular medicine today.

[96]  A Sali,et al.  Modeling mutations and homologous proteins. , 1995, Current opinion in biotechnology.

[97]  M J Sippl,et al.  Progress in fold recognition , 1995, Proteins.

[98]  P S Kim,et al.  Repacking protein cores with backbone freedom: structure prediction for coiled coils. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[99]  Rakefet Rosenfeld,et al.  Simultaneous modeling of multiple loops in proteins , 1995, Protein science : a publication of the Protein Society.

[100]  Richard L. Stevens,et al.  Packaging of Proteases and Proteoglycans in the Granules of Mast Cells and Other Hematopoietic Cells , 1995, The Journal of Biological Chemistry.

[101]  W. Pearson Comparison of methods for searching protein sequence databases , 1995, Protein science : a publication of the Protein Society.

[102]  M. Karplus,et al.  Evaluation of comparative protein modeling by MODELLER , 1995, Proteins.

[103]  S. Henikoff,et al.  Automated construction and graphical presentation of protein blocks from unaligned sequences. , 1995, Gene.

[104]  P. Koehl,et al.  A self consistent mean field approach to simultaneous gap closure and side-chain positioning in homology modelling , 1995, Nature Structural Biology.

[105]  Lee Testing homology modeling on mutant proteins: predicting structural and thermodynamic effects in the Ala98-->Val mutants of T4 lysozyme. , 1995, Folding & design.

[106]  W. Goddard,et al.  Prediction of polyelectrolyte polypeptide structures using Monte Carlo conformational search methods with implicit solvation modeling , 1995, Protein science : a publication of the Protein Society.

[107]  Andrej ⩽ali,et al.  Comparative protein modeling by satisfaction of spatial restraints , 1995 .

[108]  Burkhard Rost,et al.  TOPITS: Threading One-Dimensional Predictions Into Three-Dimensional Structures , 1995, ISMB.

[109]  S. Sudarsanam,et al.  Modeling protein loops using a ϕi+1, Ψi dimer database , 1995, Protein science : a publication of the Protein Society.

[110]  Loop Problem in Proteins: Developments on the Monte Carlo , 1996 .

[111]  R. F. Smith,et al.  BCM Search Launcher--an integrated interface to molecular biology data base search and analysis services available on the World Wide Web. , 1996, Genome research.

[112]  C Sander,et al.  Mapping the Protein Universe , 1996, Science.

[113]  M C Peitsch,et al.  ProMod and Swiss-Model: Internet-based tools for automated comparative protein modelling. , 1996, Biochemical Society transactions.

[114]  D J Kyle,et al.  Accuracy and reliability of the scaling‐relaxation method for loop closure: An evaluation based on extensive and multiple copy conformational samplings , 1996, Proteins.

[115]  R Nussinov,et al.  Fast protein fold recognition via sequence to structure alignment and contact capacity potentials. , 1996, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[116]  J. Thornton,et al.  AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR , 1996, Journal of biomolecular NMR.

[117]  A Sali,et al.  Site-directed mutagenesis of recombinant human beta 2-glycoprotein I identifies a cluster of lysine residues that are critical for phospholipid binding and anti-cardiolipin antibody activity. , 1996, Journal of immunology.

[118]  D. Fischer,et al.  Protein fold recognition using sequence‐derived predictions , 1996, Protein science : a publication of the Protein Society.

[119]  W R Taylor,et al.  Multiple protein sequence alignment: algorithms and gap insertion. , 1996, Methods in enzymology.

[120]  W U Primrose,et al.  A model for human cytochrome P450 2D6 based on homology modeling and NMR studies of substrate binding. , 1996, Biochemistry.

[121]  Andrej Sali,et al.  Ligand Specificity of Brain Lipid-binding Protein* , 1996, The Journal of Biological Chemistry.

[122]  C. Sander,et al.  Verification of protein structures : Side-chain planarity , 1996 .

[123]  S Subbiah,et al.  A structural explanation for the twilight zone of protein sequence homology. , 1996, Structure.

[124]  S. Wodak,et al.  Deviations from standard atomic volumes as a quality measure for protein crystal structures. , 1996, Journal of molecular biology.

[125]  A Godzik,et al.  Structural diversity in a family of homologous proteins. , 1996, Journal of molecular biology.

[126]  P. Wolynes,et al.  Self‐consistently optimized statistical mechanical energy functions for sequence structure alignment , 1996, Protein science : a publication of the Protein Society.

[127]  C. Sander,et al.  Errors in protein structures , 1996, Nature.

[128]  W R Taylor,et al.  Homology modelling by distance geometry. , 1996, Folding & design.

[129]  Louis Carlacci,et al.  Loop problem in proteins: Developments on Monte Carlo simulated annealing approach , 1996 .

[130]  M. Vásquez,et al.  Modeling side-chain conformation. , 1996, Current opinion in structural biology.

[131]  Tim J. P. Hubbard,et al.  SCOP: a structural classification of proteins database , 1998, Nucleic Acids Res..

[132]  D. Fischer,et al.  Assigning folds to the proteins encoded by the genome of Mycoplasma genitalium. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[133]  A. Godzik,et al.  Similarities and differences between nonhomologous proteins with similar folds: evaluation of threading strategies. , 1997, Folding & design.

[134]  Andrew C. R. Martin,et al.  Assessment of comparative modeling in CASP2 , 1997, Proteins.

[135]  R D Appel,et al.  Large‐scale protein modelling and integration with the SWISS‐PROT and SWISS‐2DPAGE databases: The example of Escherichia coli , 1997, Electrophoresis.

[136]  I D Kuntz,et al.  Structure-based design and combinatorial chemistry yield low nanomolar inhibitors of cathepsin D. , 1997, Chemistry & biology.

[137]  David C. Jones,et al.  Progress in protein structure prediction. , 1997, Current opinion in structural biology.

[138]  Roland L. Dunbrack,et al.  Prediction of protein side-chain rotamers from a backbone-dependent rotamer library: a new homology modeling tool. , 1997, Journal of molecular biology.

[139]  I. Vakser,et al.  Evaluation of GRAMM low‐resolution docking methodology on the hemagglutinin‐antibody complex , 1997, Proteins.

[140]  O. Lund,et al.  Protein distance constraints predicted by neural networks and probability density functions. , 1997, Protein engineering.

[141]  S. Bryant,et al.  Critical assessment of methods of protein structure prediction (CASP): Round II , 1997, Proteins.

[142]  Gapped BLAST and PSI-BLAST: A new , 1997 .

[143]  Andrej Sali,et al.  Crystal Structure of the δ′ Subunit of the Clamp-Loader Complex of E. coli DNA Polymerase III , 1997, Cell.

[144]  Richard H. Lathrop,et al.  Current Limitations to Protein Threading Approaches , 1997, J. Comput. Biol..

[145]  R. Abagyan,et al.  Protein engineering with monomeric triosephosphate isomerase (monoTIM): the modelling and structure verification of a seven-residue loop. , 1997, Protein engineering.

[146]  C. Zhang,et al.  Relations of the numbers of protein sequences, families and folds. , 1997, Protein engineering.

[147]  S. Jones,et al.  Prediction of protein-protein interaction sites using patch analysis. , 1997, Journal of molecular biology.

[148]  T. Blundell,et al.  Predicting the conformational class of short and medium size loops connecting regular secondary structures: application to comparative modelling. , 1997, Journal of molecular biology.

[149]  R Sánchez,et al.  Advances in comparative protein-structure modelling. , 1997, Current opinion in structural biology.

[150]  M. Levitt,et al.  A structural census of the current population of protein sequences. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[151]  M. Karplus,et al.  PDB-based protein loop prediction: parameters for selection and methods for optimization. , 1997, Journal of molecular biology.

[152]  M Levitt,et al.  Competitive assessment of protein fold recognition and alignment accuracy , 1997, Proteins.

[153]  Baldomero Oliva,et al.  An automated classification of the structure of protein loops. , 1997, Journal of molecular biology.

[154]  R Sánchez,et al.  Evaluation of comparative protein structure modeling by MODELLER‐3 , 1997, Proteins.

[155]  A E Torda,et al.  Perspectives in protein-fold recognition. , 1997, Current opinion in structural biology.

[156]  J Skolnick,et al.  Functional analysis of the Escherichia coli genome using the sequence-to-structure-to-function paradigm: identification of proteins exhibiting the glutaredoxin/thioredoxin disulfide oxidoreductase activity. , 1998, Journal of molecular biology.

[157]  A. Sali 100,000 protein structures for the biologist , 1998, Nature Structural Biology.

[158]  S. Kim,et al.  Structure-based assignment of the biochemical function of a hypothetical protein: a test case of structural genomics. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[159]  J M Thornton,et al.  Validation of protein models derived from experiment. , 1998, Current opinion in structural biology.

[160]  J. Thompson,et al.  Multiple sequence alignment with Clustal X. , 1998, Trends in biochemical sciences.

[161]  Stephen K. Burley,et al.  Crystal Structure of a GCN5-Related N-acetyltransferase Serratia marcescens Aminoglycoside 3-N-acetyltransferase , 1998, Cell.

[162]  A. Godzik,et al.  Fold and function predictions for Mycoplasma genitalium proteins. , 1998, Folding & design.

[163]  Jay W. Ponder,et al.  Protein structure prediction using a combination of sequence homology and global energy minimization: II. Energy functions , 1998, J. Comput. Chem..

[164]  M. Levitt,et al.  Accuracy of side‐chain prediction upon near‐native protein backbones generated by ab initio folding methods , 1998, Proteins.

[165]  C. Chothia,et al.  Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[166]  A. Sali,et al.  Large-scale protein structure modeling of the Saccharomyces cerevisiae genome. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[167]  P Bork,et al.  Homology-based fold predictions for Mycoplasma genitalium proteins. , 1998, Journal of molecular biology.

[168]  M. Levitt,et al.  A unified statistical framework for sequence comparison and structure comparison. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[169]  D. Haussler,et al.  Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods. , 1998, Journal of molecular biology.

[170]  W. Pearson Empirical statistical estimates for sequence similarity searches. , 1998, Journal of molecular biology.

[171]  M Karplus,et al.  Protein sidechain conformer prediction: a test of the energy function. , 1998, Folding & design.

[172]  R Samudrala,et al.  A graph-theoretic algorithm for comparative modeling of protein structure. , 1998, Journal of molecular biology.

[173]  J. Newman,et al.  Class‐directed structure determination: Foundation for a protein structure initiative , 1998, Protein science : a publication of the Protein Society.

[174]  Chris Sander,et al.  Who checks the checkers? Four validation tools applied to eight atomic resolution structures. EU 3-D Validation Network. , 1998, Journal of molecular biology.

[175]  Alexander D. MacKerell,et al.  All-atom empirical potential for molecular modeling and dynamics studies of proteins. , 1998, The journal of physical chemistry. B.

[176]  Christophe G. Lambert,et al.  Comparative analysis of seven multiple protein sequence alignment servers: clues to enhance reliability of predictions , 1998, Bioinform..

[177]  M J Sternberg,et al.  Misleading local sequence alignments: implications for comparative protein modelling. , 1998, Protein engineering.

[178]  A D Baxevanis,et al.  Practical aspects of multiple sequence alignment. , 1998, Methods of biochemical analysis.

[179]  Raffaele Giancarlo,et al.  Sequence alignment in molecular biology , 1998, Mathematical Support for Molecular Biology.

[180]  M Mezei,et al.  Chameleon sequences in the PDB. , 1998, Protein engineering.

[181]  F. Melo,et al.  Assessing protein structures with a non-local atomic interaction energy. , 1998, Journal of molecular biology.

[182]  A. Fiser,et al.  Convergent evolution of Trichomonas vaginalis lactate dehydrogenase from malate dehydrogenase. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[183]  J. Wójcik,et al.  New efficient statistical sequence-dependent structure prediction of short to medium-sized protein loops based on an exhaustive loop classification. , 1999, Journal of molecular biology.

[184]  T. Alwyn Jones,et al.  CASP3 comparative modeling evaluation , 1999, Proteins.

[185]  A. Bairoch,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999 , 1999, Nucleic Acids Res..

[186]  James E. Bray,et al.  The CATH Database provides insights into protein structure/function relationships , 1999, Nucleic Acids Res..

[187]  B. Rost Twilight zone of protein sequence alignments. , 1999, Protein engineering.

[188]  D Gorse,et al.  Prediction of the location and type of β‐turns in proteins using neural networks , 1999, Protein science : a publication of the Protein Society.

[189]  R A Friesner,et al.  Prediction of loop geometries using a generalized born model of solvation effects , 1999, Proteins.

[190]  A. Sali,et al.  Structural genomics: beyond the Human Genome Project , 1999, Nature Genetics.

[191]  M. Karplus,et al.  Discrimination of the native from misfolded protein models with an energy function including implicit solvation. , 1999, Journal of molecular biology.

[192]  Chris Sander,et al.  Protein folds and families: sequence and structure alignments , 1999, Nucleic Acids Res..

[193]  Roberto Sánchez,et al.  ModBase: A database of comparative protein structure models , 1999, Bioinform..

[194]  M J Sternberg,et al.  Progress in protein structure prediction: assessment of CASP3. , 1999, Current opinion in structural biology.

[195]  Michael Levitt,et al.  A brighter future for protein structure prediction , 1999, Nature Structural Biology.

[196]  M. Sternberg,et al.  Benchmarking PSI-BLAST in genome annotation. , 1999, Journal of molecular biology.

[197]  E V Koonin,et al.  A phylogenetic approach to target selection for structural genomics: solution structure of YciH. , 1999, Nucleic acids research.

[198]  David C. Jones,et al.  GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences. , 1999, Journal of molecular biology.

[199]  Andrej Sali,et al.  Comparative Protein Structure Modeling in Genomics , 1999 .

[200]  D Fischer,et al.  CAFASP‐1: Critical assessment of fully automated structure prediction methods , 1999, Proteins.

[201]  G. Montelione,et al.  A banner year for membranes , 1999, Nature Structural Biology.

[202]  Steven E. Brenner,et al.  The PRESAGE database for structural genomics , 1999, Nucleic Acids Res..

[203]  T F Smith,et al.  The art of matchmaking: sequence alignment methods and their structural implications. , 1999, Structure.

[204]  A. Sali,et al.  lynx1, an Endogenous Toxin-like Modulator of Nicotinic Acetylcholine Receptors in the Mammalian CNS , 1999, Neuron.

[205]  José M. Mas,et al.  Refinement of modelled structures by knowledge-based energy profiles and secondary structure prediction: Application to the human procarboxypeptidase A2 , 2000, J. Comput. Aided Mol. Des..

[206]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[207]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[208]  A. Sali,et al.  Modeling of loops in protein structures , 2000, Protein science : a publication of the Protein Society.

[209]  Geoffrey J. Barton,et al.  Protein Sequence Alignment and Database Scanning , 2001 .

[210]  G. Schuler,et al.  Sequence alignment and database searching. , 2001, Methods of biochemical analysis.

[211]  Narayanan Eswar,et al.  MODBASE, a database of annotated comparative protein structure models , 2002, Nucleic Acids Res..

[212]  Helen R. Saibil,et al.  Challenges at the frontiers of structural biology , 2002, Nature Structural Biology.

[213]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .