Predicting human immunodeficiency virus protease cleavage sites in proteins by a discriminant function method

Based on the sequence‐coupled (Markov chain) model and vector‐projection principle, a discriminant function method is proposed to predict sites in protein substrates that should be susceptible to cleavage by the HIV‐1 protease. The discriminant function is defined by Δ = ϕ+ – ϕ−, where ϕ+ and ϕ− are the cleavable and noncleavable attributes for a given peptide, and they can be derived from two complementary sets of peptides, S+ and S−, known to be cleavable and noncleavable, respectively, by the enzyme. The rate of correct prediction by the method for the 62 cleavable peptides and 239 noncleavable peptides in the training set are 100 and 96.7%, respectively. Application of the method to the 55 sequences which are outside the training set and known to be cleaved by the HIV‐1 protease accurately predicted 100% of the peptides as substrates of the enzyme. The method also predicted all but one of the sites hydrolyzed by the protease in native HIV‐1 and HIV‐2 reverse transcriptases, where the HIV‐1 protease discriminates between nearly identical sequences in a very subtle fashion. Finally, the algorithm predicts correctly all of the HIV‐1 protease processing sites in the native gag and gag/pol HIV‐1 polyproteins, and all of the cleavage sites identified in denatured protease and reverse transcriptase. The new predictive algorithm provides a novel route toward understanding the specificity of this important therapeutic target.

[1]  A. Berger,et al.  On the size of the active site in proteases. I. Papain. , 1967, Biochemical and biophysical research communications.

[2]  Gregory K. Miller,et al.  Elements of Applied Stochastic Processes , 1972 .

[3]  D. Veber,et al.  HIV-1 protease specificity of peptide cleavage is sufficient for processing of gag and pol polyproteins. , 1988, Biochemical and biophysical research communications.

[4]  T. Copeland,et al.  Molecular characterization of gag proteins from simian immunodeficiency virus (SIVMne) , 1988, Journal of virology.

[5]  C. Debouck,et al.  Characterization and autoprocessing of precursor and mature forms of human immunodeficiency virus type 1 (HIV 1) protease purified from Escherichia coli , 1989, Proteins.

[6]  V. Kostka,et al.  Sub‐site preferences of the aspartic proteinase from the human immunodeficiency virus, HIV‐1 , 1990, FEBS letters.

[7]  E. Wimmer,et al.  Mutational analysis of a native substrate of the human immunodeficiency virus type 1 proteinase , 1990, Journal of virology.

[8]  C. Vlahos,et al.  Substitutions at the P2' site of gag p17-p24 affect cleavage efficiency by HIV-1 protease. , 1990, Biochemical and biophysical research communications.

[9]  V. Kostka,et al.  Hydrolysis of synthetic chromogenic substrates by HIV-1 and HIV-2 proteinases. , 1990, Biochemical and biophysical research communications.

[10]  M. Oswald,et al.  Fibronectin is a non‐viral substrate for the HIV proteinase , 1991, FEBS letters.

[11]  A. Israël,et al.  Processing of the precursor of NF-κB by the HIV-1 protease during acute infection , 1991, Nature.

[12]  D. Decamp,et al.  Calcium‐free calmodulin is a substrate of proteases from human immunodeficiency viruses 1 and 2 , 1991, Proteins.

[13]  Synthesis of homologous peptides using fragment condensation: analogs of an HIV proteinase substrate. , 2009, International journal of peptide and protein research.

[14]  A. Tomasselli,et al.  Actin, troponin C, Alzheimer amyloid precursor protein and pro-interleukin 1 beta as substrates of the protease from human immunodeficiency virus. , 1991, The Journal of biological chemistry.

[15]  A. Tomasselli,et al.  A cumulative specificity model for proteases from human immunodeficiency virus types 1 and 2, inferred from statistical analysis of an extended substrate data base. , 1991, The Journal of biological chemistry.

[16]  K C Chou,et al.  A correlation-coefficient method to predicting protein-structural classes from amino acid compositions. , 1992, European journal of biochemistry.

[17]  A. Tomasselli,et al.  Purification and characterization of heterodimeric human immunodeficiency virus type 1 (HIV-1) reverse transcriptase produced by in vitro processing of p66 with recombinant HIV-1 protease. , 1992, The Journal of biological chemistry.

[18]  C. Debouck,et al.  Proteolysis of an active site peptide of lactate dehydrogenase by human immunodeficiency virus type 1 protease. , 1992, Biochemistry.

[19]  J. Louis,et al.  Kinetic and modeling studies of S3-S3' subsites of HIV proteinases. , 1992, Biochemistry.

[20]  A. Wlodawer,et al.  Different requirements for productive interaction between the active site of HIV-1 proteinase and substrates containing -hydrophobic*hydrophobic- or -aromatic*pro- cleavage sites. , 1992, Biochemistry.

[21]  D B Evans,et al.  Human immunodeficiency virus type‐1 reverse transcriptase and ribonuclease h as substrates of the viral protease , 1993, Protein science : a publication of the Protein Society.

[22]  K. Chou,et al.  A vectorized sequence-coupling model for predicting HIV protease cleavage sites in proteins. , 1993, The Journal of biological chemistry.

[23]  K. Chou,et al.  Studies on the specificity of HIV protease: An application of Markov chain theory , 1993, Journal of protein chemistry.

[24]  C. Craik,et al.  Regulation of autoproteolysis of the HIV-1 and HIV-2 proteases with engineered amino acid substitutions. , 1993, The Journal of biological chemistry.

[25]  D. P. Brunner,et al.  Large scale purification and refolding of HIV-1 protease fromEscherichia coli inclusion bodies , 1993, Journal of protein chemistry.

[26]  K. Chou,et al.  A vector projection approach to predicting HIV protease cleavage sites in proteins , 1993, Proteins.

[27]  J. Chou,et al.  Predicting cleavability of peptide sequences by HIV protease via correlation-angle approach , 1993, Journal of protein chemistry.

[28]  A. Wlodawer,et al.  [14]Subsite preferences of retroviral proteinases , 1994 .

[29]  A. Tomasselli,et al.  Specificity of retroviral proteases: an analysis of viral and nonviral protein substrates. , 1994, Methods in enzymology.

[30]  C. Zhang,et al.  An alternate-subsite-coupled model for predicting HIV protease cleavage sites in proteins. , 1994, Protein engineering.

[31]  W. Howe,et al.  The HIV-1 protease as enzyme and substrate: mutagenesis of autolysis sites and generation of a stable mutant with retained kinetic properties. , 1994, Biochemistry.

[32]  R. Heinrikson,et al.  The differential processing of homodimers of reverse transcriptases from human immunodeficiency viruses type 1 and 2 is a consequence of the distinct specificities of the viral proteases. , 1995, The Journal of biological chemistry.