The prediction of protein secondary structure is an important step in the prediction of protein tertiary structure. A new protein secondary structure prediction method, SVMpsi, was developed to improve the current level of prediction by incorporating new tertiary classifiers and their jury decision system, and the PSI-BLAST PSSM profiles. Additionally, efficient methods to handle unbalanced data and a new optimization strategy for maximizing the Q(3) measure were developed. The SVMpsi produces the highest published Q(3) and SOV94 scores on both the RS126 and CB513 data sets to date. For a new KP480 set, the prediction accuracy of SVMpsi was Q(3) = 78.5% and SOV94 = 82.8%. Moreover, the blind test results for 136 non-redundant protein sequences which do not contain homologues of training data sets were Q(3) = 77.2% and SOV94 = 81.8%. The SVMpsi results in CASP5 illustrate that it is another competitive method to predict protein secondary structure.
[1]
F. Young.
Biochemistry
,
1955,
The Indian Medical Gazette.
[2]
Michael I. Jordan,et al.
Advances in Neural Information Processing Systems 30
,
1995
.
[3]
Vladimir Vapnik,et al.
Statistical learning theory
,
1998
.
[4]
B. Schölkopf,et al.
Advances in kernel methods: support vector learning
,
1999
.
[5]
Nello Cristianini,et al.
An Introduction to Support Vector Machines and Other Kernel-based Learning Methods
,
2000
.
[6]
Vladimir N. Vapnik,et al.
The Nature of Statistical Learning Theory
,
2000,
Statistics for Engineering and Information Science.
[7]
김삼묘,et al.
“Bioinformatics” 특집을 내면서
,
2000
.