Sixty-five years of the long march in protein secondary structure prediction: the final stretch?

Abstract Protein secondary structure prediction began in 1951 when Pauling and Corey predicted helical and sheet conformations for protein polypeptide backbone even before the first protein structure was determined. Sixty-five years later, powerful new methods breathe new life into this field. The highest three-state accuracy without relying on structure templates is now at 82–84%, a number unthinkable just a few years ago. These improvements came from increasingly larger databases of protein sequences and structures for training, the use of template secondary structure information and more powerful deep learning techniques. As we are approaching to the theoretical limit of three-state prediction (88–90%), alternative to secondary structure prediction (prediction of backbone torsion angles and Cα-atom-based angles and torsion angles) not only has more room for further improvement but also allows direct prediction of three-dimensional fragment structures with constantly improved accuracy. About 20% of all 40-residue fragments in a database of 1199 non-redundant proteins have <6 Å root-mean-squared distance from the native conformations by SPIDER2. More powerful deep learning methods with improved capability of capturing long-range interactions begin to emerge as the next generation of techniques for secondary structure prediction. The time has come to finish off the final stretch of the long march towards protein secondary structure prediction.

[1]  Xin-Qiu Yao,et al.  A dynamic Bayesian network approach to protein secondary structure prediction , 2008, BMC Bioinformatics.

[2]  G J Kleywegt,et al.  Validation of protein models from Calpha coordinates alone. , 1997, Journal of molecular biology.

[3]  Thomas A. Hopf,et al.  Protein 3D Structure Computed from Evolutionary Sequence Variation , 2011, PloS one.

[4]  Dapeng Li,et al.  A novel structural position-specific scoring matrix for the prediction of protein secondary structures , 2012, Bioinform..

[5]  B. Rost,et al.  Protein flexibility and rigidity predicted from sequence , 2005, Proteins.

[6]  D. Kihara The effect of long‐range interactions on the secondary structure formation of proteins , 2005, Protein science : a publication of the Protein Society.

[7]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[8]  Ron Unger,et al.  Codon-level information improves predictions of inter-residue contacts in proteins by correlated mutation analysis , 2015, eLife.

[9]  Jiangning Song,et al.  Prediction of cis/trans isomerization in proteins using PSI-BLAST profiles and secondary structure information , 2006, BMC Bioinformatics.

[10]  Zarrin Minuchehr,et al.  Chameleon sequences in neurodegenerative diseases , 2016, Biochemical and Biophysical Research Communications.

[11]  M. Gromiha,et al.  Real value prediction of solvent accessibility from amino acid sequence , 2003, Proteins.

[12]  K. Dill,et al.  Protein folding by zipping and assembly , 2007, Proceedings of the National Academy of Sciences.

[13]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[14]  J. Skolnick,et al.  What is the probability of a chance prediction of a protein structure with an rmsd of 6 A? , 1998, Folding & design.

[15]  Gavin Giovannoni,et al.  Inactive or moderately active human promoters are enriched for inter-individual epialleles , 2013, Genome Biology.

[16]  David A. Lee,et al.  CATH: comprehensive structural and functional annotations for genome sequences , 2014, Nucleic Acids Res..

[17]  E. Kabat,et al.  The influence of nearest-neighbor amino acids on the conformation of the middle amino acid in proteins: comparison of predicted and experimental determination of -sheets in concanavalin A. , 1973, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Albert Y. Zomaya,et al.  Machine Learning Techniques for Protein Secondary Structure Prediction:An Overview and Evaluation , 2008 .

[19]  Pierre Baldi,et al.  SSpro/ACCpro 5: almost perfect prediction of protein secondary structure and relative solvent accessibility using profiles, machine learning and structural similarity , 2014, Bioinform..

[20]  Bernard F. Buxton,et al.  Secondary structure prediction with support vector machines , 2003, Bioinform..

[21]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[22]  V A Simossis,et al.  Integrating protein secondary structure prediction and multiple sequence alignment. , 2004, Current protein & peptide science.

[23]  Alan Wee-Chung Liew,et al.  Sequence-Based Prediction of Protein-Carbohydrate Binding Sites Using Support Vector Machines , 2016, J. Chem. Inf. Model..

[24]  Linus Pauling,et al.  The Structure of Proteins , 1939 .

[25]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[26]  Wayne A Hendrickson,et al.  A force field for virtual atom molecular mechanics of proteins , 2009, Proceedings of the National Academy of Sciences.

[27]  Elaine R Mardis,et al.  Anticipating the $1,000 genome , 2006, Genome Biology.

[28]  Yaoqi Zhou,et al.  Prediction of One‐Dimensional Structural Properties Of Proteins by Integrated Neural Networks , 2010 .

[29]  Yücel Altunbasak,et al.  Protein secondary structure prediction for a single-sequence using hidden semi-Markov models , 2006, BMC Bioinformatics.

[30]  M. Karplus,et al.  Protein secondary structure prediction with a neural network. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Yaoqi Zhou,et al.  Achieving 80% ten‐fold cross‐validated accuracy for secondary structure prediction by large‐scale training , 2006, Proteins.

[32]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[33]  Dong Xu,et al.  MUPRED: A tool for bridging the gap between template based methods and sequence profile based methods for protein secondary structure prediction , 2006, Proteins.

[34]  Geoffrey E. Hinton Learning multiple layers of representation , 2007, Trends in Cognitive Sciences.

[35]  R. Casadio,et al.  Predictions of protein segments with the same aminoacid sequence and different secondary structure: A benchmark for predictive methods , 2000, Proteins.

[36]  Alessio Ceroni,et al.  Learning protein secondary structure from sequential and relational data , 2005, Neural Networks.

[37]  D. Fischer,et al.  Protein fold recognition using sequence‐derived predictions , 1996, Protein science : a publication of the Protein Society.

[38]  S. Hua,et al.  A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach. , 2001, Journal of molecular biology.

[39]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[40]  Alessandro Vullo,et al.  Accurate prediction of protein secondary structure and solvent accessibility by consensus combiners of sequence and structure information , 2007, BMC Bioinformatics.

[41]  Yuxing Liao,et al.  CASP9 assessment of free modeling target predictions , 2011, Proteins.

[42]  W. Caughey,et al.  Protein secondary structures in water from second-derivative amide I infrared spectra. , 1990, Biochemistry.

[43]  Gert Vriend,et al.  Detection of trans–cis flips and peptide-plane flips in protein structures , 2015, Acta crystallographica. Section D, Biological crystallography.

[44]  James G. Lyons,et al.  Improving prediction of secondary structure, local backbone angles, and solvent accessible surface area of proteins by iterative deep learning , 2015, Scientific Reports.

[45]  Y. Duan,et al.  Trends in template/fragment-free protein structure prediction , 2010, Theoretical chemistry accounts.

[46]  Jaap Heringa,et al.  Protein secondary structure prediction. , 2010, Methods in molecular biology.

[47]  S Brunak,et al.  Protein secondary structure and homology by neural networks. The alpha-helices in rhodopsin. , 1988, FEBS letters.

[48]  David S. Wishart,et al.  Improving the accuracy of protein secondary structure prediction using structural alignment , 2006, BMC Bioinformatics.

[49]  Xin Deng,et al.  MSACompro: protein multiple sequence alignment using predicted secondary structure, solvent accessibility, and residue-residue contacts , 2011, BMC Bioinformatics.

[50]  Luciano A. Abriata,et al.  Structural database resources for biological macromolecules , 2016, Briefings Bioinform..

[51]  Jürgen Schmidhuber,et al.  Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.

[52]  Sofia Khan,et al.  Spectrum of disease-causing mutations in protein secondary structures , 2007, BMC Structural Biology.

[53]  Lukasz A. Kurgan,et al.  MoRFpred, a computational tool for sequence-based prediction and characterization of short disorder-to-order transitioning binding regions in proteins , 2012, Bioinform..

[54]  K Ravi Acharya,et al.  The advantages and limitations of protein crystal structures. , 2005, Trends in pharmacological sciences.

[55]  J. Skolnick,et al.  Ab initio modeling of small proteins by iterative TASSER simulations , 2007, BMC Biology.

[56]  Christophe Dugave,et al.  Cis-trans isomerization of organic molecules and biomolecules: implications and applications. , 2003, Chemical reviews.

[57]  Yunlong Liu,et al.  DDIG-in: detecting disease-causing genetic variations due to frameshifting indels and nonsense mutations employing sequence and structural properties at nucleotide and protein levels , 2015, Bioinform..

[58]  T. Stevens,et al.  Are membrane proteins “inside‐out” proteins? , 1999, Proteins.

[59]  Guoli Wang,et al.  PISCES: recent improvements to a PDB sequence culling server , 2005, Nucleic Acids Res..

[60]  James E. Fitzgerald,et al.  Mimicking the folding pathway to improve homology-free protein structure prediction , 2009, Proceedings of the National Academy of Sciences.

[61]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[62]  Yaohang Li,et al.  Template-based C8-SCORPION: a protein 8-state secondary structure prediction method using structural information and context-based features , 2014, BMC Bioinformatics.

[63]  David S. Wishart,et al.  CSI 3.0: a web server for identifying secondary and super-secondary structure in proteins using NMR chemical shifts , 2015, Nucleic Acids Res..

[64]  V. Lim Algorithms for prediction of α-helical and β-structural regions in globular proteins , 1974 .

[65]  Robert Preissner,et al.  Prediction of prolyl residues in cis‐conformation in protein structures on the basis of the amino acid sequence , 1990, FEBS letters.

[66]  Rafael Brüschweiler,et al.  PPM_One: a static protein structure based chemical shift predictor , 2015, Journal of biomolecular NMR.

[67]  B. Rost,et al.  Redefining the goals of protein secondary structure prediction. , 1994, Journal of molecular biology.

[68]  M. Karplus,et al.  Folding thermodynamics of a model three-helix-bundle protein. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[69]  Wen-Lian Hsu,et al.  HYPROSP II-A knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence , 2005, Bioinform..

[70]  Zoran Obradovic,et al.  Improving Protein Secondary-Structure Prediction by Predicting Ends of Secondary-Structure Segments , 2005, IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[71]  Thomas C Terwilliger,et al.  Lessons from structural genomics. , 2009, Annual review of biophysics.

[72]  L. Pauling,et al.  Configurations of Polypeptide Chains With Favored Orientations Around Single Bonds: Two New Pleated Sheets. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[73]  N. Greenfield Using circular dichroism spectra to estimate protein secondary structure , 2007, Nature Protocols.

[74]  Yaoqi Zhou,et al.  Intrinsically Semi-disordered State and Its Role in Induced Folding and Protein Aggregation , 2013, Cell Biochemistry and Biophysics.

[75]  D. Yee,et al.  Principles of protein folding — A perspective from simple exact models , 1995, Protein science : a publication of the Protein Society.

[76]  G. Crooks,et al.  Protein secondary structure: entropy, correlations and prediction. , 2003, Bioinformatics.

[77]  J. Kong,et al.  Obtaining information about protein secondary structures in aqueous solution using Fourier transform IR spectroscopy , 2015, Nature Protocols.

[78]  S L Mowbray,et al.  Cα‐based torsion angles: A simple tool to analyze protein conformational changes , 1995, Protein science : a publication of the Protein Society.

[79]  Zhiyong Wang,et al.  Protein 8-class secondary structure prediction using Conditional Neural Fields , 2010, 2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[80]  Jaime G. Carbonell,et al.  Comparison of probabilistic combination methods for protein secondary structure prediction , 2004, Bioinform..

[81]  Adam Liwo,et al.  Coarse-grained force field: general folding theory. , 2011, Physical chemistry chemical physics : PCCP.

[82]  D. Pal,et al.  Cis peptide bonds in proteins: residues involved, their conformations, interactions and locations. , 1999, Journal of molecular biology.

[83]  L. Pauling,et al.  The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[84]  Predrag Radivojac,et al.  Automated inference of molecular mechanisms of disease from amino acid substitutions , 2009, Bioinform..

[85]  Jun-tao Guo,et al.  Analysis of chameleon sequences and their implications in biological processes , 2007, Proteins.

[86]  Yaoqi Zhou,et al.  Predicting the errors of predicted local backbone angles and non-local solvent- accessibilities of proteins by deep neural networks , 2016, Bioinform..

[87]  O. Ptitsyn,et al.  Statistical analysis of the correlation among amino acid residues in helical, beta-structural and non-regular regions of globular proteins. , 1971, Journal of molecular biology.

[88]  V. Lim Algorithms for prediction of alpha-helical and beta-structural regions in globular proteins. , 1974, Journal of molecular biology.

[89]  Yaoqi Zhou,et al.  Improving protein disorder prediction by deep bidirectional long short‐term memory recurrent neural networks , 2016, Bioinform..

[90]  Dimitrios I. Fotiadis,et al.  Detection of discriminative sequence patterns in the neighborhood of proline cis peptide bonds and their functional annotation , 2009, BMC Bioinformatics.

[91]  E. Lander,et al.  Protein secondary structure prediction using nearest-neighbor methods. , 1993, Journal of molecular biology.

[92]  Yanjun Qi,et al.  A Unified Multitask Architecture for Predicting Local Protein Properties , 2012, PloS one.

[93]  J. Skolnick,et al.  Development and large scale benchmark testing of the PROSPECTOR_3 threading algorithm , 2004, Proteins.

[94]  J. Garnier,et al.  Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. , 1978, Journal of molecular biology.

[95]  P. Frasconi,et al.  On the role of long-range dependencies in learning protein secondary structure , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[96]  P Willett,et al.  Use of techniques derived from graph theory to compare secondary structure motifs in proteins. , 1990, Journal of molecular biology.

[97]  M A Mortin,et al.  Molecular modeling of RNA polymerase II mutations onto DNA polymerase I. , 1994, Journal of molecular biology.

[98]  Tim J. P. Hubbard,et al.  Data growth and its impact on the SCOP database: new developments , 2007, Nucleic Acids Res..

[99]  Christian Cole,et al.  The Jpred 3 secondary structure prediction server , 2008, Nucleic Acids Res..

[100]  Thomas A. Hopf,et al.  Three-Dimensional Structures of Membrane Proteins from Genomic Sequencing , 2012, Cell.

[101]  J. Kendrew,et al.  A Three-Dimensional Model of the Myoglobin Molecule Obtained by X-Ray Analysis , 1958, Nature.

[102]  Oliviero Carugo,et al.  Criteria to Extract High-Quality Protein Data Bank Subsets for Structure Users. , 2016, Methods in molecular biology.

[103]  M. Karplus,et al.  Interpreting the folding kinetics of helical proteins , 1999, Nature.

[104]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[105]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[106]  Thomas A. Hopf,et al.  Structured States of Disordered Proteins from Genomic Sequences , 2016, Cell.

[107]  Kuldip K. Paliwal,et al.  Highly accurate sequence-based prediction of half-sphere exposures of amino acid residues in proteins , 2016, Bioinform..

[108]  Yaoqi Zhou,et al.  Characterizing the existing and potential structural space of proteins by large-scale multiple loop permutations. , 2011, Journal of molecular biology.

[109]  J. Skolnick,et al.  TOUCHSTONE: An ab initio protein structure prediction method that uses threading-based tertiary restraints , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[110]  Yu Xue,et al.  MBA: a literature mining system for extracting biomedical abbreviations , 2009, BMC Bioinformatics.

[111]  G J Barton,et al.  Application of multiple sequence alignment profiles to improve protein secondary structure prediction , 2000, Proteins.

[112]  Frank Wien,et al.  Accurate secondary structure prediction and fold recognition for circular dichroism spectroscopy , 2015, Proceedings of the National Academy of Sciences.

[113]  Gianluca Pollastri,et al.  Porter, PaleAle 4.0: high-accuracy prediction of protein secondary structure and relative solvent accessibility , 2013, Bioinform..

[114]  C. Sander,et al.  Correlated mutations and residue contacts in proteins , 1994, Proteins.

[115]  A. Dunker,et al.  Use of conditional probabilities for determining relationships between amino acid sequence and protein secondary structure , 1992, Proteins.

[116]  R. L. Jernigan,et al.  Fast learning optimized prediction methodology (FLOPRED) for protein secondary structure prediction , 2012, Journal of Molecular Modeling.

[117]  Haitao Cheng,et al.  Consensus Data Mining (CDM) Protein Secondary Structure Prediction Server: Combining GOR V and Fragment Database Mining (FDM) , 2007, Bioinform..

[118]  Gajendra P. S. Raghava,et al.  Evaluation of Protein Dihedral Angle Prediction Methods , 2014, PloS one.

[119]  A. Tramontano,et al.  New encouraging developments in contact prediction: Assessment of the CASP11 results , 2016, Proteins.

[120]  M Mezei,et al.  Chameleon sequences in the PDB. , 1998, Protein engineering.

[121]  J Heringa Computational methods for protein secondary structure prediction using multiple sequence alignments. , 2000, Current protein & peptide science.

[122]  Simon J. Hubbard,et al.  Improved prediction for N‐termini of α‐helices using empirical information , 2004 .

[123]  Samy Bengio,et al.  Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[124]  Christian Cole,et al.  JPred4: a protein secondary structure prediction server , 2015, Nucleic Acids Res..

[125]  Yuedong Yang,et al.  Discriminating between disease-causing and neutral non-frameshifting micro-INDELs by support vector machines by means of integrated sequence- and structure-based features , 2013 .

[126]  Honglak Lee,et al.  Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.

[127]  Pierre Baldi,et al.  Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles , 2002, Proteins.

[128]  H. Scheraga,et al.  Status of empirical methods for the prediction of protein backbone topography. , 1976, Biochemistry.

[129]  S. Muggleton,et al.  Protein secondary structure prediction using logic-based machine learning. , 1992, Protein engineering.

[130]  Daniel W. A. Buchan,et al.  Protein annotation and modelling servers at University College London , 2010, Nucleic Acids Res..

[131]  Christopher J. Oldfield,et al.  Intrinsic disorder and functional proteomics. , 2007, Biophysical journal.

[132]  Wei Zheng,et al.  A large-scale comparative assessment of methods for residue–residue contact prediction , 2016, Briefings Bioinform..

[133]  Simon J Hubbard,et al.  Improved prediction for N-termini of alpha-helices using empirical information. , 2004, Proteins.

[134]  P. Y. Chou,et al.  Prediction of protein conformation. , 1974, Biochemistry.

[135]  A. Godzik,et al.  Computational protein function prediction: Are we making progress? , 2007, Cellular and Molecular Life Sciences.

[136]  Qiang Guo,et al.  Advances in protein contact map prediction based on machine learning. , 2015, Medicinal chemistry (Shariqah (United Arab Emirates)).

[137]  Hongjun Bai,et al.  Assessment of template‐free modeling in CASP10 and ROLL , 2014, Proteins.

[138]  Wei Chu,et al.  Bayesian segmental models with multiple sequence alignment profiles for protein secondary structure and contact map prediction , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[139]  D. Baker,et al.  Contact order, transition state placement and the refolding rates of single domain proteins. , 1998, Journal of molecular biology.

[140]  D. Baker,et al.  Coupled prediction of protein secondary and tertiary structure , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[141]  Jian Peng,et al.  Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields , 2015, Scientific Reports.

[142]  Ming Zhang,et al.  Comparing sequences without using alignments: application to HIV/SIV subtyping , 2007, BMC Bioinformatics.

[143]  Lisa N Kinch,et al.  ChSeq: A database of chameleon sequences , 2015, Protein science : a publication of the Protein Society.

[144]  Jianlin Cheng,et al.  A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[145]  B. Rost,et al.  Improved prediction of protein secondary structure by use of sequence profiles and neural networks. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[146]  Piero Fariselli,et al.  The s2D method: simultaneous sequence-based prediction of the statistical populations of ordered and disordered regions in proteins. , 2015, Journal of molecular biology.

[147]  M. Sternberg,et al.  Prediction of protein secondary structure and active sites using the alignment of homologous sequences. , 1987, Journal of molecular biology.

[148]  Bart van Beusekom,et al.  Data Mining of Macromolecular Structures. , 2016, Methods in molecular biology.

[149]  Lukasz A. Kurgan,et al.  SPINE X: Improving protein secondary structure prediction by multistep learning coupled with prediction of solvent accessible surface area and backbone torsion angles , 2012, J. Comput. Chem..

[150]  Bin Xue,et al.  Real‐value prediction of backbone torsion angles , 2008, Proteins.

[151]  Yaoqi Zhou,et al.  SPEM: improving multiple sequence alignment with sequence profiles and predicted secondary structures. , 2005, Bioinformatics.

[152]  Dimitrios I. Fotiadis,et al.  Prediction of cis/trans isomerization using feature selection and support vector machines , 2009, J. Biomed. Informatics.

[153]  Kuldip K. Paliwal,et al.  Predicting backbone Cα angles and dihedrals from protein sequences by stacked sparse auto‐encoder deep neural network , 2014, J. Comput. Chem..

[154]  Yaoqi Zhou,et al.  Improving the prediction accuracy of residue solvent accessibility and real‐value backbone torsion angles of proteins by guided‐learning through a two‐layer neural network , 2009, Proteins.

[155]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[156]  Aleksey A. Porollo,et al.  Accurate prediction of solvent accessibility using neural networks–based regression , 2004, Proteins.

[157]  Alexandre G de Brevern,et al.  Analysis of protein chameleon sequence characteristics , 2009, Bioinformation.

[158]  Debora S. Marks,et al.  Estimation of Residue-Residue Coevolution using Direct Coupling Analysis Identifies Many Native Contacts Across a Large Number of Domain Families , 2012 .

[159]  B. Rost Review: protein secondary structure prediction continues to rise. , 2001, Journal of structural biology.

[160]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[161]  D C Rees,et al.  Turning a reference inside‐out: Commentary on an article by Stevens and Arkin entitled: “Are membrane proteins ‘inside‐out’ proteins?” (Proteins 1999;36:135–143) , 2000, Proteins.

[162]  Yaohang Li,et al.  Context-Based Features Enhance Protein Secondary Structure Prediction Accuracy , 2014, J. Chem. Inf. Model..

[163]  A Keith Dunker,et al.  Assessing secondary structure assignment of protein structures by using pairwise sequence‐alignment benchmarks , 2008, Proteins.

[164]  Dimitrios I. Fotiadis,et al.  PBOND: Web Server for the Prediction of Proline and Non-Proline cis/trans Isomerization , 2009, Genom. Proteom. Bioinform..

[165]  Harold A. Scheraga,et al.  Structural Studies of Ribonuclease. III. A Model for the Secondary and Tertiary Structure1,2 , 1960 .

[166]  J. Moult,et al.  Loss of protein structure stability as a major causative factor in monogenic disease. , 2005, Journal of molecular biology.

[167]  S. Brunak,et al.  Protein secondary structure and homology by neural networks The α‐helices in rhodopsin , 1988 .