An Approach for Data Selection of Protein Function Prediction