Machine Learning Algorithms for Protein Structure Prediction

OF THE DISSERTATION xviii

[1]  Arne Elofsson,et al.  3D-Jury: A Simple Approach to Improve Protein Structure Predictions , 2003, Bioinform..

[2]  David T. Jones,et al.  Rapid protein domain assignment from amino acid sequence using predicted secondary structure , 2002, Protein science : a publication of the Protein Society.

[3]  S. Betz Disulfide bonds and the stability of globular proteins , 1993, Protein science : a publication of the Protein Society.

[4]  B. Honig,et al.  Protein structure prediction: inroads to biology. , 2005, Molecular cell.

[5]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[6]  A. Fersht,et al.  Engineered disulfide bonds as probes of the folding pathway of barnase: increasing the stability of proteins against the rate of denaturation. , 1993, Biochemistry.

[7]  M. Levitt Accurate modeling of protein conformation by automatic segment matching. , 1992, Journal of molecular biology.

[8]  B Honig,et al.  Combining multiple structure and sequence alignments to improve sequence detection and alignment: Application to the SH2 domains of Janus kinases , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[9]  J. S. Sodhi,et al.  Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. , 2004, Journal of molecular biology.

[10]  Obradovic,et al.  Predicting Protein Disorder for N-, C-, and Internal Regions. , 1999, Genome informatics. Workshop on Genome Informatics.

[11]  Brian G. Jones,et al.  Ichnology of the Pleistocene Ironshore Formation, Grand Cayman Island, British West Indies , 1988 .

[12]  I W Hunter,et al.  3D-1D threading methods for protein fold recognition. , 2000, Pharmacogenomics.

[13]  Hongyi Zhou,et al.  Quantifying the effect of burial of amino acid residues on protein stability , 2003, Proteins.

[14]  John D. Westbrook,et al.  The PDB Format, mmCIF Formats, and Other Data Formats , 2005 .

[15]  Ralf Zimmer,et al.  SSEP-Domain: protein domain prediction by alignment of secondary structure elements and profiles , 2006, Bioinform..

[16]  T. Sejnowski,et al.  Predicting the secondary structure of globular proteins using neural network models. , 1988, Journal of molecular biology.

[17]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[18]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[19]  Pierre Baldi,et al.  ICBS: a database of interactions between protein chains mediated by ?-sheet formation , 2004, Bioinform..

[20]  Pierre Baldi,et al.  SCRATCH: a protein structure and structural feature prediction server , 2005, Nucleic Acids Res..

[21]  Tatsuya Akutsu,et al.  Protein homology detection using string alignment kernels , 2004, Bioinform..

[22]  Frances M. G. Pearl,et al.  The CATH protein family database: A resource for structural and functional annotation of genomes , 2002, Proteomics.

[23]  T L Blundell,et al.  Prediction of the stability of protein mutants based on structural environment-dependent amino acid substitution and propensity tables. , 1997, Protein engineering.

[24]  Li Liao,et al.  Combining pairwise sequence similarity and support vector machines for remote protein homology detection , 2002, RECOMB '02.

[25]  P. T. Szymanski,et al.  Adaptive mixtures of local experts are source coding solutions , 1993, IEEE International Conference on Neural Networks.

[26]  M J Sternberg,et al.  Enhancement of protein modeling by human intervention in applying the automatic programs 3D‐JIGSAW and 3D‐PSSM , 2001, Proteins.

[27]  Yoshua Bengio,et al.  Input-output HMMs for sequence processing , 1996, IEEE Trans. Neural Networks.

[28]  Dong Xu,et al.  PROSPECT II: protein structure prediction program for genome-scale applications. , 2003, Protein engineering.

[29]  M. Karplus,et al.  Effective energy functions for protein structure prediction. , 2000, Current opinion in structural biology.

[30]  L Regan,et al.  Modulating Protein Folding Rates in Vivo and in Vitro by Side-chain Interactions between the Parallel β Strands of Green Fluorescent Protein* , 2000, The Journal of Biological Chemistry.

[31]  Janet M. Thornton,et al.  From protein structure to biochemical function? , 2004, Journal of Structural and Functional Genomics.

[32]  M. Sternberg,et al.  Enhanced genome annotation using structural profiles in the program 3D-PSSM. , 2000, Journal of molecular biology.

[33]  H. Scheraga,et al.  Disulfide bonds and protein folding. , 2000, Biochemistry.

[34]  D. Haussler,et al.  Hidden Markov models in computational biology. Applications to protein modeling. , 1993, Journal of molecular biology.

[35]  C. Sander,et al.  Database algorithm for generating protein backbone and side-chain co-ordinates from a C alpha trace application to model building and detection of co-ordinate errors. , 1991, Journal of molecular biology.

[36]  L Serrano,et al.  Development of the multiple sequence approximation within the AGADIR model of alpha-helix formation: comparison with Zimm-Bragg and Lifson-Roig formalisms. , 1997, Biopolymers.

[37]  D. Fischer,et al.  Protein fold recognition using sequence‐derived predictions , 1996, Protein science : a publication of the Protein Society.

[38]  S J Wodak,et al.  Contribution of the hydrophobic effect to protein stability: analysis based on simulations of the Ile-96----Ala mutation in barnase. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[39]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[40]  A. Godzik,et al.  Topology fingerprint approach to the inverse protein folding problem. , 1992, Journal of molecular biology.

[41]  C. Chothia,et al.  Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. , 2001, Journal of molecular biology.

[42]  Anders Krogh,et al.  Hidden Markov models for sequence analysis: extension and analysis of the basic method , 1996, Comput. Appl. Biosci..

[43]  J. Skolnick,et al.  Ab initio folding of proteins using restraints derived from evolutionary information , 1999, Proteins.

[44]  T. L. Blundell,et al.  Knowledge-based prediction of protein structures and the design of novel molecules , 1987, Nature.

[45]  D. T. Jones,et al.  A new approach to protein fold recognition , 1992, Nature.

[46]  E. Lindahl,et al.  Identification of related proteins on family, superfamily and fold level. , 2000, Journal of molecular biology.

[47]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[48]  Piero Fariselli,et al.  Prediction of the disulfide‐bonding state of cysteines in proteins at 88% accuracy , 2002, Protein science : a publication of the Protein Society.

[49]  R. A. George,et al.  Snapdragon: a Method to Delineate Protein Structural Domains from Sequence Data , 2022 .

[50]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[51]  J. Kruskal On the shortest spanning subtree of a graph and the traveling salesman problem , 1956 .

[52]  K. Ginalski Comparative modeling for protein structure prediction. , 2006, Current opinion in structural biology.

[53]  Lei Xie,et al.  Using multiple structure alignments, fast model building, and energetic analysis in fold recognition and homology modeling , 2003, Proteins.

[54]  Lars Malmström,et al.  Automated prediction of CASP‐5 structures using the Robetta server , 2003, Proteins.

[55]  Liam J. McGuffin,et al.  Protein structure prediction servers at University College London , 2005, Nucleic Acids Res..

[56]  B. Rost,et al.  Sequence-based prediction of protein domains. , 2004, Nucleic acids research.

[57]  A. Elofsson,et al.  Hidden Markov models that use predicted secondary structures for fold recognition , 1999, Proteins.

[58]  David C. Jones,et al.  GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences. , 1999, Journal of molecular biology.

[59]  M. Perutz,et al.  Structure of haemoglobin: a three-dimensional Fourier synthesis at 5.5-A. resolution, obtained by X-ray analysis. , 1960, Nature.

[60]  S. Bryant,et al.  An empirical energy function for threading protein sequence through the folding motif , 1993, Proteins.

[61]  D Fischer,et al.  CAFASP‐1: Critical assessment of fully automated structure prediction methods , 1999, Proteins.

[62]  Harpreet Kaur Saini,et al.  BIOINFORMATICS APPLICATIONS NOTE Structural bioinformatics Meta-DP: domain prediction meta-server , 2022 .

[63]  C. Sander,et al.  Database of homology‐derived protein structures and the structural meaning of sequence alignment , 1991, Proteins.

[64]  F. Sanger,et al.  The amino-acid sequence in the glycyl chain of insulin. I. The identification of lower peptides from partial hydrolysates. , 1953, The Biochemical journal.

[65]  U. Hobohm,et al.  Selection of representative protein data sets , 1992, Protein science : a publication of the Protein Society.

[66]  Leszek Rychlewski,et al.  Fold prediction by a hierarchy of sequence, threading, and modeling methods , 1998, Protein science : a publication of the Protein Society.

[67]  Robert B. Russell,et al.  GlobPlot: exploring protein sequences for globularity and disorder , 2003, Nucleic Acids Res..

[68]  Paolo Frasconi,et al.  A recursive connectionist approach for predicting disulfide connectivity in proteins , 2003, SAC '03.

[69]  J Skolnick,et al.  Defrosting the frozen approximation: PROSPECTOR— A new approach to threading , 2001, Proteins.

[70]  R B Russell,et al.  Fold recognition from sequence comparisons , 2001, Proteins.

[71]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[72]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[73]  Bernhard Schölkopf,et al.  Support Vector Machine Applications in Computational Biology , 2004 .

[74]  B. Honig Protein folding: from the levinthal paradox to structure prediction. , 1999, Journal of molecular biology.

[75]  Pierre Baldi,et al.  Three-stage prediction of protein ?-sheets by neural networks, alignments and graph algorithms , 2005, ISMB.

[76]  David A. Lee,et al.  Progress towards mapping the universe of protein folds , 2004, Genome Biology.

[77]  L. Pauling,et al.  Configuration of Polypeptide Chains , 1951, Nature.

[78]  J. Skolnick,et al.  MONSSTER: a method for folding globular proteins with a small number of distance restraints. , 1997, Journal of molecular biology.

[79]  A G Murzin,et al.  Distant homology recognition using structural classification of proteins , 1997, Proteins.

[80]  Marianne Rooman,et al.  PoPMuSiC, rationally designing point mutations in protein structures , 2002, Bioinform..

[81]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[82]  Paolo Frasconi,et al.  A two-stage SVM architecture for predicting the disulfide bonding state of cysteines , 2002, Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing.

[83]  A. Tropsha,et al.  Four-body potentials reveal protein-specific correlations to stability changes caused by hydrophobic core mutations. , 2001, Journal of molecular biology.

[84]  Cédric Notredame,et al.  3DCoffee: combining protein sequences and structures within multiple sequence alignments. , 2004, Journal of molecular biology.

[85]  Anna Tramontano,et al.  Critical assessment of methods of protein structure prediction—Round VII , 2007, Proteins.

[86]  H. Dyson,et al.  Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. , 1999, Journal of molecular biology.

[87]  Bruce E. Shapiro,et al.  Cellerator: extending a computer algebra system to include biochemical arrows for signal transduction simulations , 2003, Bioinform..

[88]  A. Sali,et al.  Alignment of protein sequences by their profiles , 2004, Protein science : a publication of the Protein Society.

[89]  Pierre Baldi,et al.  Prediction of contact maps by GIOHMMs and recurrent neural networks using lateral propagation from all four cardinal corners , 2002, ISMB.

[90]  Andrej Sali,et al.  Comparative Protein Structure Modeling and its Applications to Drug Discovery , 2004 .

[91]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[92]  Marianne Rooman,et al.  Prediction of stability changes upon single-site mutations using database-derived potentials , 1999 .

[93]  D. Baker,et al.  Prospects for ab initio protein structural genomics. , 2001, Journal of molecular biology.

[94]  J. Thornton,et al.  Determinants of strand register in antiparallel β‐sheets of proteins , 1998, Protein science : a publication of the Protein Society.

[95]  M. A. Wouters,et al.  An analysis of side chain interactions and pair correlations within antiparallel β‐sheets: The differences between backbone hydrogen‐bonded and non‐hydrogen‐bonded residue pairs , 1995, Proteins.

[96]  C. Chothia,et al.  Structural patterns in globular proteins , 1976, Nature.

[97]  W R Taylor,et al.  Towards protein tertiary fold prediction using distance and motif constraints. , 1991, Protein engineering.

[98]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[99]  Pierre Baldi,et al.  Large-Scale Prediction of Disulphide Bond Connectivity , 2004, NIPS.

[100]  H Oschkinat,et al.  Improving the refolding yield of interleukin-4 through the optimization of local interactions. , 2000, Journal of biotechnology.

[101]  C Sander,et al.  Specific recognition in the tertiary structure of beta-sheets of proteins. , 1980, Journal of molecular biology.

[102]  M. Sternberg,et al.  Analysis and classification of disulphide connectivity in proteins. The entropic effect of cross-linkage. , 1994, Journal of molecular biology.

[103]  M. Madera,et al.  A comparison of profile hidden Markov model procedures for remote homology detection. , 2002, Nucleic acids research.

[104]  P S Kim,et al.  Context is a major determinant of beta-sheet propensity. , 1994, Nature.

[105]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[106]  L. Holm,et al.  Exhaustive enumeration of protein domain families. , 2003, Journal of molecular biology.

[107]  Pierre Baldi,et al.  Bioinformatics - the machine learning approach (2. ed.) , 2000 .

[108]  Pierre Baldi,et al.  The Principled Design of Large-Scale Recursive Neural Network Architectures--DAG-RNNs and the Protein Structure Prediction Problem , 2003, J. Mach. Learn. Res..

[109]  L. Regan,et al.  Guidelines for Protein Design: The Energetics of β Sheet Side Chain Interactions , 1995, Science.

[110]  R. L. Baldwin The nature of protein folding pathways: The classical versus the new view , 1995, Journal of biomolecular NMR.

[111]  John L. Klepeis,et al.  Prediction of β‐sheet topology and disulfide bridges in polypeptides , 2003, J. Comput. Chem..

[112]  Jeffery G Saven,et al.  Combinatorial protein design. , 2002, Current opinion in structural biology.

[113]  András Fiser,et al.  Predicting the oxidation state of cysteines by multiple sequence alignment , 2000, Bioinform..

[114]  L. Pauling,et al.  The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[115]  Eleazar Eskin,et al.  The Spectrum Kernel: A String Kernel for SVM Protein Classification , 2001, Pacific Symposium on Biocomputing.

[116]  Pierre Baldi,et al.  Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles , 2002, Proteins.

[117]  M J Sternberg,et al.  Side‐chain conformational entropy in protein folding , 1995, Protein science : a publication of the Protein Society.

[118]  B. Rost,et al.  Alignments grow, secondary structure prediction improves , 2002, Proteins.

[119]  B. Rost,et al.  Prediction of protein secondary structure at better than 70% accuracy. , 1993, Journal of molecular biology.

[120]  Stephen L Mayo,et al.  Prudent modeling of core polar residues in computational protein design. , 2003, Journal of molecular biology.

[121]  Bernhard Schölkopf,et al.  Kernel-Based Integration of Genomic Data Using Semidefinite Programming , 2004 .

[122]  Peter Clote,et al.  Disulfide connectivity prediction using secondary structure information and diresidue frequencies , 2005, Bioinform..

[123]  M. A. McClure,et al.  Hidden Markov models of biological primary sequence information. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[124]  Michael Gribskov,et al.  Score Distributions for Simultaneous Matching to Multiple Motifs , 1997, J. Comput. Biol..

[125]  D. Haussler,et al.  Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods. , 1998, Journal of molecular biology.

[126]  A. Elofsson,et al.  Local moves: An efficient algorithm for simulation of protein folding , 1995, Proteins.

[127]  A. Godzik,et al.  Comparison of sequence profiles. Strategies for structural predictions using sequence information , 2008, Protein science : a publication of the Protein Society.

[128]  Arlo Z. Randall,et al.  Prediction of protein stability changes for single‐site mutations using support vector machines , 2005, Proteins.

[129]  Burkhard Rost,et al.  UniqueProt: creating representative protein sequence sets , 2003, Nucleic Acids Res..

[130]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[131]  R. G. Hart,et al.  Structure of Myoglobin: A Three-Dimensional Fourier Synthesis at 2 Å. Resolution , 1960, Nature.

[132]  Ralf Zimmer,et al.  Profile-Profile Alignment: A Powerful Tool for Protein Structure Prediction , 2002, Pacific Symposium on Biocomputing.

[133]  Jason Weston,et al.  Mismatch string kernels for discriminative protein classification , 2004, Bioinform..

[134]  S. L. Mayo,et al.  Computational protein design. , 1999, Structure.

[135]  Manuel C. Peitsch,et al.  SWISS-MODEL: an automated protein homology-modeling server , 2003, Nucleic Acids Res..

[136]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[137]  Chris Sander,et al.  Touring protein fold space with Dali/FSSP , 1998, Nucleic Acids Res..

[138]  Minoru Asogawa,et al.  Beta-Sheet Prediction Using Inter-Strand Residue Pairs and Refinement with Hopfield Neural Network , 1997, ISMB.

[139]  Christopher Bystroff,et al.  Predicting interresidue contacts using templates and pathways , 2003, Proteins.

[140]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[141]  A. Travers,et al.  DNA conformation and protein binding. , 1989, Annual review of biochemistry.

[142]  M. Levitt,et al.  Accurate prediction of the stability and activity effects of site-directed mutagenesis on a protein core , 1991, Nature.

[143]  N. Grishin,et al.  COMPASS: a tool for comparison of multiple protein alignments with assessment of statistical significance. , 2003, Journal of molecular biology.

[144]  Daniel Fischer,et al.  3D‐SHOTGUN: A novel, cooperative, fold‐recognition meta‐predictor , 2003, Proteins.

[145]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[146]  T. Joachims Support Vector Machines , 2002 .

[147]  Y. Freund,et al.  Profile-based string kernels for remote homology detection and motif extraction. , 2005, Journal of bioinformatics and computational biology.

[148]  Burkhard Rost,et al.  Sisyphus and prediction of protein structure , 1997, Comput. Appl. Biosci..

[149]  A. Panchenko,et al.  Combination of threading potentials and sequence profiles improves fold recognition. , 2000, Journal of molecular biology.

[150]  Y Shan,et al.  Fold recognition and accurate query‐template alignment by a combination of PSI‐BLAST and threading , 2001, Proteins.

[151]  B. Rost,et al.  Protein fold recognition by prediction-based threading. , 1997, Journal of molecular biology.

[152]  M Vendruscolo,et al.  Recovery of protein structure from contact maps. , 1997, Folding & design.

[153]  R. Abagyan,et al.  Recognition of distantly related proteins through energy calculations , 1994, Proteins.

[154]  Richard Bonneau,et al.  Distributions of beta sheets in proteins with application to structure prediction , 2002, Proteins.

[155]  A Kolinski,et al.  Dynamic Monte Carlo simulations of a new lattice model of globular protein folding, structure and dynamics. , 1991, Journal of molecular biology.

[156]  M Karplus,et al.  Simulation analysis of the stability mutant R96H of T4 lysozyme. , 1991, Biochemistry.

[157]  A. Atiya,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[158]  D. Eisenberg,et al.  A method to identify protein sequences that fold into a known three-dimensional structure. , 1991, Science.

[159]  Steven E Brenner,et al.  The Impact of Structural Genomics: Expectations and Outcomes , 2005, Science.

[160]  C Sander,et al.  Dictionary of recurrent domains in protein structures , 1998, Proteins.

[161]  A. Sali,et al.  Protein Structure Prediction and Structural Genomics , 2001, Science.

[162]  Roland L Dunbrack,et al.  Scoring profile‐to‐profile sequence alignments , 2004, Protein science : a publication of the Protein Society.

[163]  T L Blundell,et al.  FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties. , 2001, Journal of molecular biology.

[164]  M S Waterman,et al.  Sequence alignment and penalty choice. Review of concepts, case studies and implications. , 1994, Journal of molecular biology.

[165]  B. Honig,et al.  On the role of structural information in remote homology detection and sequence alignment: new methods using hybrid sequence profiles. , 2003, Journal of molecular biology.

[166]  C. Sander,et al.  Parser for protein folding units , 1994, Proteins.

[167]  Pierre Baldi,et al.  A machine learning information retrieval approach to protein fold recognition. , 2006, Bioinformatics.

[168]  W. Braun,et al.  Sequence specificity, statistical potentials, and three‐dimensional structure prediction with self‐correcting distance geometry calculations of β‐sheet formation in proteins , 2008 .

[169]  P. Baldi,et al.  Prediction of coordination number and relative solvent accessibility in proteins , 2002, Proteins.

[170]  L Serrano,et al.  Elucidating the folding problem of alpha-helices: local motifs, long-range electrostatics, ionic-strength dependence and prediction of NMR parameters. , 1998, Journal of molecular biology.

[171]  M. O. Dayhoff,et al.  Establishing homologies in protein sequences. , 1983, Methods in enzymology.

[172]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[173]  A. Gronenborn,et al.  Crystal structure of interleukin 8: symbiosis of NMR and crystallography. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[174]  Christopher M. Summa,et al.  De novo design and structural characterization of proteins and metalloproteins. , 1999, Annual review of biochemistry.

[175]  B. Rost,et al.  Improved prediction of protein secondary structure by use of sequence profiles and neural networks. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[176]  Nello Cristianini,et al.  An introduction to Support Vector Machines , 2000 .

[177]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[178]  R L Jernigan,et al.  Protein stability for single substitution mutants and the extent of local compactness in the denatured state. , 1994, Protein engineering.

[179]  Nick V. Grishin,et al.  Probabilistic scoring measures for profile-profile comparison yield more accurate short seed alignments , 2003, Bioinform..

[180]  Adrian A Canutescu,et al.  Access the most recent version at doi: 10.1110/ps.03154503 References , 2003 .

[181]  John C. Wootton,et al.  Non-globular Domains in Protein Sequences: Automated Segmentation Using Complexity Measures , 1994, Comput. Chem..

[182]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[183]  Alessandro Sperduti,et al.  A general framework for adaptive processing of data structures , 1998, IEEE Trans. Neural Networks.

[184]  Richard Hughey,et al.  Hidden Markov models for detecting remote protein homologies , 1998, Bioinform..

[185]  P Fariselli,et al.  Prediction of contact maps with neural networks and correlated mutations. , 2001, Protein engineering.

[186]  R. Abagyan,et al.  Do aligned sequences share the same fold? , 1997, Journal of molecular biology.

[187]  P. Koehl,et al.  Application of a self-consistent mean field theory to predict protein side-chains conformation and estimate their conformational entropy. , 1994, Journal of molecular biology.

[188]  Peter A. Kollman,et al.  Free energy calculations on protein stability: Thr-157 .fwdarw. Val-157 mutation of T4 lysozyme , 1989 .

[189]  Thomas Lengauer,et al.  Arby: automatic protein structure prediction using profile-profile alignment and confidence measures , 2004, Bioinform..

[190]  David Haussler,et al.  A Discriminative Framework for Detecting Remote Protein Homologies , 2000, J. Comput. Biol..

[191]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[192]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[193]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[194]  Hongyi Zhou,et al.  Fold recognition by combining sequence profiles derived from evolution and from depth‐dependent structural alignment of fragments , 2004, Proteins.

[195]  C. Frenz,et al.  Neural network‐based prediction of mutation‐induced protein stability changes in Staphylococcal nuclease at 20 residue positions , 2005, Proteins.

[196]  J. Thornton,et al.  Prediction of strand pairing in antiparallel and parallel β‐sheets using information theory , 2002, Proteins.

[197]  R. Raines,et al.  Contribution of disulfide bonds to the conformational stability and catalytic activity of ribonuclease A. , 2000, European journal of biochemistry.

[198]  Arne Elofsson,et al.  Profile–profile methods provide improved fold‐recognition: A study of different profile–profile alignment methods , 2004, Proteins.

[199]  L. Looger,et al.  Computational design of receptor and sensor proteins with novel functions , 2003, Nature.

[200]  R. Abagyan,et al.  Large‐scale prediction of protein geometry and stability changes for arbitrary single point mutations , 2004, Proteins.

[201]  Paolo Frasconi,et al.  Disulfide connectivity prediction using recursive neural networks and evolutionary information , 2004, Bioinform..

[202]  C. Levinthal Are there pathways for protein folding , 1968 .

[203]  Lee Testing homology modeling on mutant proteins: predicting structural and thermodynamic effects in the Ala98-->Val mutants of T4 lysozyme. , 1995, Folding & design.

[204]  Golan Yona,et al.  Automatic prediction of protein domains from sequence information using a hybrid learning system , 2004, Bioinform..

[205]  Raphael Guerois,et al.  Energy estimation in protein design. , 2002, Current opinion in structural biology.

[206]  Pierre Baldi,et al.  On the relationship between deterministic and probabilistic directed Graphical models: From Bayesian networks to recursive neural networks , 2005, Neural Networks.

[207]  Johannes Söding,et al.  Protein homology detection by HMM?CHMM comparison , 2005, Bioinform..

[208]  William Lawrence Bragg The Rutherford Memorial Lecture, 1960 - The development of X-ray analysis , 1961, Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences.

[209]  Y. Mandel-Gutfreund,et al.  Contributions of residue pairing to beta-sheet formation: conservation and covariation of amino acid residue pairs on antiparallel beta-strands. , 2001, Journal of molecular biology.

[210]  L. Demetrius Thermodynamics and evolution. , 2000, Journal of theoretical biology.

[211]  Akinori Sarai,et al.  ProTherm, version 2.0: thermodynamic database for proteins and mutants , 2000, Nucleic Acids Res..

[212]  Liam J. McGuffin,et al.  Improving sequence-based fold recognition by using 3D model quality assessment , 2005, Bioinform..

[213]  D Gilis,et al.  Stability changes upon mutation of solvent-accessible residues in proteins evaluated by database-derived potentials. , 1996, Journal of molecular biology.

[214]  T. Gibson,et al.  Protein disorder prediction: implications for structural proteomics. , 2003, Structure.

[215]  B. Rost,et al.  Conservation and prediction of solvent accessibility in protein families , 1994, Proteins.

[216]  Pierre Baldi,et al.  DOMpro: Protein Domain Prediction Using Profiles, Secondary Structure, Relative Solvent Accessibility, and Recursive Neural Networks , 2006, Data Mining and Knowledge Discovery.

[217]  P Fariselli,et al.  Role of evolutionary information in predicting the disulfide‐bonding state of cysteine in proteins , 1999, Proteins.

[218]  Sam Griffiths-Jones,et al.  The use of structure information to increase alignment accuracy does not aid homologue detection with profile HMMs , 2002, Bioinform..

[219]  P Parham,et al.  Structure, function, and diversity of class I major histocompatibility complex molecules. , 1990, Annual review of biochemistry.

[220]  Pierre Baldi,et al.  Large‐scale prediction of disulphide bridges using kernel methods, two‐dimensional recursive neural networks, and weighted graph matching , 2005, Proteins.

[221]  D Fischer,et al.  Hybrid fold recognition: combining sequence derived properties with evolutionary information. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[222]  Stephen H. Bryant,et al.  Domain size distributions can predict domain boundaries , 2000, Bioinform..

[223]  L. Regan,et al.  Construction and Design of β-Sheets , 1997 .

[224]  K Nishikawa,et al.  Experimental verification of the 'stability profile of mutant protein' (SPMP) data using mutant human lysozymes. , 1999, Protein engineering.

[225]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[226]  M J Sippl,et al.  Knowledge-based potentials for proteins. , 1995, Current opinion in structural biology.

[227]  Robert M. MacCallum,et al.  Striped sheets and protein contact prediction , 2004, ISMB/ECCB.

[228]  B. Dahiyat,et al.  In silico design for protein stabilization. , 1999, Current opinion in biotechnology.

[229]  P. Kollman,et al.  Exhaustive mutagenesis in silico: Multicoordinate free energy calculations on proteins and peptides , 2000, Proteins.

[230]  Pierre Baldi,et al.  Sigmoid: a software infrastructure for pathway bioinformatics and systems biology , 2005, IEEE Intelligent Systems.

[231]  K. Takano,et al.  Are the parameters of various stabilization factors estimated from mutant human lysozymes compatible with other proteins? , 2001, Protein engineering.

[232]  A Valencia,et al.  A neural network approach to evaluate fold recognition results , 2003, Proteins.

[233]  Piero Fariselli,et al.  Predicting Free Energy Contribution to the Conformational Stability of Folded Proteins From the Residue Sequence with Radial Basis Function Networks , 1995, ISMB.

[234]  F. Sanger,et al.  The amino-acid sequence in the glycyl chain of insulin. II. The investigation of peptides from enzymic hydrolysates. , 1951, The Biochemical journal.

[235]  C. Branden,et al.  Introduction to protein structure , 1991 .

[236]  Marcin von Grotthuss,et al.  ORFeus: detection of distant homology using sequence profiles and predicted secondary structure , 2003, Nucleic Acids Res..

[237]  Piero Fariselli,et al.  A neural-network-based method for predicting protein stability changes upon single point mutations , 2004, ISMB/ECCB.