AN EFFECTIVE DATA MINING TECHNIQUE FOR THE MULTI-CLASS PROTEIN SEQUENCE CLASSIFICATION

Sub cellular localization or solubility and various other properties can be predicted from the features and sequences extracted from Amino acid sequences, using classifier algorithms. Various packages are needed to be installed and data needs to be converted into different formats even though there are a lot of feature extraction and classifier construction software tools available because the application is not straight forward. The objective of this project is to make the sequence based classification techniques quick and explorative for biologists. ProtS is a software application run for finding sequence based properties of proteins in predetermined groups. In singleton integrated, interactive environment it gives data importation, sequence-based property calculation and solution along with data classifier construction with testing.