Diseases classification using support vector machine (SVM)

The paper proposed a new method: disease classification based on protein sequence. Support vector machine was used for this problem and a new encoding for the multicode protein sequence was suggested. Two extracted features were selected for classifying, the results showed the capability of SVM for such bioinformatics problems and the goodness of the system of protein sequence based disease classification. It gave error around 4-5%, which presented that although it is good for such problems, improvement of the algorithm also should be made.