Studies on the specificity of HIV protease: An application of Markov chain theory

A sequence-coupled (Markov chain) model is proposed to predict the cleavage sites in proteins by proteases with extended specificity subsites. In addition to the probability of an amino acid occurring at each of these subsites as observed from a training set of oligopeptides known cleavable by HIV protease, the conditional probabilities as reflected by the neighbor-coupled effect along the subsite sequence are also taken into account. These conditional probabilities are derived from an expanded training set consisting of sufficiently large peptide sequences generated by the Monte Carlo sampling process. Very high accuracy was obtained in predicting protein cleavage sites by both HIV-1 and HIV-2 proteases. The new method provides a rapid and accurate means for analyzing the specificity of HIV protease, and hence can be used to help find effective inhibitors of HIV protease as potential drugs against AIDS. The principle of this method can also be used to study the specificity of any multisubsite enzyme.

[1]  J Sninsky,et al.  HIV-1 isolates are rapidly evolving quasispecies: evidence for viral mixtures and preferred nucleotide substitutions. , 1989, Journal of acquired immune deficiency syndromes.

[2]  A Wlodawer,et al.  Structure of complex of synthetic HIV-1 protease with a substrate-based inhibitor at 2.3 A resolution. , 1989, Science.

[3]  P Martel,et al.  Biophysical aspects of neutron scattering from vibrational modes of proteins. , 1992, Progress in biophysics and molecular biology.

[4]  T. Copeland,et al.  Molecular characterization of gag proteins from simian immunodeficiency virus (SIVMne) , 1988, Journal of virology.

[5]  A. Meyerhans,et al.  Selection, recombination, and G----A hypermutation of human immunodeficiency virus type 1 genomes , 1991, Journal of virology.

[6]  J. Louis,et al.  Kinetic and modeling studies of S3-S3' subsites of HIV proteinases. , 1992, Biochemistry.

[7]  K. Chou,et al.  A vectorized sequence-coupling model for predicting HIV protease cleavage sites in proteins. , 1993, The Journal of biological chemistry.

[8]  John P. Overington,et al.  X-ray analysis of HIV-1 proteinase at 2.7 Å resolution confirms structural homology among retroviral enzymes , 1989, Nature.

[9]  M. Jaskólski,et al.  Conserved folding in retroviral proteases: crystal structure of a synthetic HIV-1 protease. , 1989, Science.

[10]  J. Chermann,et al.  Isolation of a T-lymphotropic retrovirus from a patient at risk for acquired immune deficiency syndrome (AIDS). , 1983, Science.

[11]  Gregory K. Miller,et al.  Elements of Applied Stochastic Processes , 1972 .

[12]  S. Wain-Hobson HIV genome variability in vivo , 1989, AIDS.

[13]  T. Miyata,et al.  Retroviral protease-like sequence in the yeast transposon Ty 1 , 1985, Nature.

[14]  B. Haynes,et al.  Frequent detection and isolation of cytopathic retroviruses (HTLV-III) from patients with AIDS and at risk for AIDS. , 1984, Science.

[15]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[16]  E. Wimmer,et al.  Proteolytic processing of polyproteins in the replication of RNA viruses. , 1989, Biochemistry.

[17]  A. Wlodawer,et al.  Different requirements for productive interaction between the active site of HIV-1 proteinase and substrates containing -hydrophobic*hydrophobic- or -aromatic*pro- cleavage sites. , 1992, Biochemistry.

[18]  L J Davis,et al.  Active human immunodeficiency virus protease is required for viral infectivity. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[19]  A. Berger,et al.  On the size of the active site in proteases. I. Papain. , 1967, Biochemical and biophysical research communications.

[20]  E. Wimmer,et al.  Mutational analysis of a native substrate of the human immunodeficiency virus type 1 proteinase , 1990, Journal of virology.

[21]  M. Navia,et al.  Three-dimensional structure of aspartyl protease from human immunodeficiency virus HIV-1 , 1989, Nature.

[22]  K. Chou,et al.  A correlation-coefficient method to predicting protein-structural classes from amino acid compositions. , 1992, European journal of biochemistry.

[23]  K. Chou,et al.  Monte Carlo simulation studies on the prediction of protein folding types from amino acid composition. , 1992, Biophysical journal.

[24]  C. Zhang,et al.  Diagrammatization of codon usage in 339 human immunodeficiency virus proteins and its biological implication. , 1992, AIDS research and human retroviruses.

[25]  S. Swaminathan,et al.  Molecular dynamics of HIV‐1 protease , 1992, Proteins.

[26]  J. Mrázek,et al.  Unusual codon usage of HIV , 1987, Nature.

[27]  William R. Taylor,et al.  A structural model for the retroviral proteases , 1987, Nature.

[28]  What can AIDS virus codon usage tell us? , 1986, Nature.

[29]  W. Kabsch,et al.  How good are predictions of protein secondary structure? , 1983, FEBS letters.