Comparative analysis of human and bovine papillomaviruses.

A method is presented for the analysis and comparison of nucleic acid and protein sequences utilizing all identity blocks (the term "identity block" refers to a set of consecutive matches between two sequences) above a prescribed length. Moreover, such identity blocks are determined for various groupings of amino acids according to chemical, functional, charge, and hydrophobic classifications. Alignment maps based on these classifications and containing all statistically significant identity blocks between two or more sequences are constructed. New theoretical results for determining the expected length of the longest identity block between sequences are also presented and are used, along with permutation procedures, to ascertain the significance of sequence identity blocks. As an example of the type of information that can be obtained, comparison has been made of the complete DNA sequences and the E1, E2, L1, and L2 genes of human and bovine papillomaviruses based on the classification schemes described above.

[1]  A. Mclachlan,et al.  Repeating sequences and gene duplication in proteins. , 1972, Journal of molecular biology.

[2]  W. Fitch An improved method of testing for evolutionary homology. , 1966, Journal of molecular biology.

[3]  E. Chen,et al.  Comparative analysis of the human type 1a and bovine type 1 papillomavirus genomes , 1983, Journal of virology.

[4]  R. Grantham Amino Acid Difference Formula to Help Explain Protein Evolution , 1974, Science.

[5]  J. Haber,et al.  An evaluation of the relatedness of proteins based on comparison of amino acid sequences. , 1970, Journal of molecular biology.

[6]  L. J. Korn,et al.  New approaches for computer analysis of nucleic acid sequences. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[7]  M. Yaniv,et al.  Human papillomavirus 1a complete DNA sequence: a novel type of genome organization among papovaviridae. , 1982, The EMBO journal.

[8]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[9]  M. O. Dayhoff,et al.  Atlas of protein sequence and structure , 1965 .

[10]  T. Smith,et al.  Optimal sequence alignments. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[11]  D. Davoli,et al.  Molecular biology of papovaviruses. , 1977, Annual review of biochemistry.

[12]  M. I. Kanehisa,et al.  Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries , 1982, Nucleic Acids Res..

[13]  G. Orth,et al.  Chromatin-like structures obtained after alkaline disruption of bovine and human papillomaviruses , 1977, Journal of virology.

[14]  U. Pettersson,et al.  Sequences of bovine papillomavirus type 1 DNA--functional and evolutionary implications. , 1983, Nucleic acids research.

[15]  P. Sellers On the Theory and Computation of Evolutionary Distances , 1974 .

[16]  D. Lipman,et al.  Rapid similarity searches of nucleic acid and protein data banks. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[17]  M. Botchan,et al.  Bovine papilloma virus contains an activator of gene expression at the distal end of the early transcription unit , 1983, Molecular and cellular biology.

[18]  D. Lowy,et al.  Mouse cells transformed by bovine papillomavirus contain only extrachromosomal viral DNA sequences. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[19]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[20]  C. Manwell Molecular palaeogenetics: amino acid sequence homology in ribonuclease and lysozyme. , 1967, Comparative biochemistry and physiology.

[21]  Peter H. Seeburg,et al.  The primary structure and genetic organization of the bovine papillomavirus type 1 genome , 1982, Nature.