APPLICATION OF SPARSE LINEAR DISCRIMINANT ANALYSIS FOR PREDICTION OF PROTEIN-PROTEIN INTERACTIONS

To understand the complex cellular mechanisms involved in a biological system, it is necessary to study protein-protein interactions (PPIs) at the molecular level, in which prediction of PPIs plays a significant role. In this paper we propose a new classification approach based on the sparse discriminant analysis [10] to predict obligate (permanent) and non-obligate (transient) protein-protein interactions. The sparse discriminant analysis [10] circumvents the limitations of the classical discriminant analysis [4, 9] in the high dimensional low sample size settings by incorporating inherently the feature selection into the optimization procedure. To characterize properties of protein interaction, we proposed to use the binding free energies. The performance of our proposed classifier is 75% ± 5%.