Establishing homologies in protein sequences.

Computer-based statistical techniques used to determine homologies between proteins occurring in different species are reviewed. The technique is based on comparison of two protein sequences, either by relating all segments of a given length in one sequence to all segments of the second or by finding the best alignment of the two sequences. Approaches discussed include selection using printed tabulations, identification of very similar sequences, and computer searches of a database. The use of the SEARCH, RELATE, and ALIGN programs (Dayhoff, 1979) is explained; sample data are presented in graphs, diagrams, and tables and the construction of scoring matrices is considered.

[1]  H. Matsubara,et al.  Amino acid sequence of chromatium vinosum ferredoxin: revisions. , 1977, Journal of biochemistry.

[2]  A. Cornish-Bowden The random character of protein evolution and its effects on the reliability of phylogenetic information deduced from amino acid sequences and compositions. , 1980, The Biochemical journal.

[3]  M. O. Dayhoff,et al.  Atlas of protein sequence and structure , 1965 .

[4]  M. O. Dayhoff,et al.  A surprising new protein superfamily containing ovalbumin, antithrombin-III, and alpha 1-proteinase inhibitor. , 1980, Biochemical and biophysical research communications.

[5]  A. Gibbs,et al.  The Diagram, a Method for Comparing Sequences , 1970 .

[6]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[7]  W. Fitch,et al.  An examination of the expected degree of sequence similarity that might arise in proteins that have converged to similar conformational states. The impact of such expectations on the search for homology between the structurally similar domains of rhodanese. , 1981, Journal of molecular biology.

[8]  R. G. Bartsch,et al.  The complete amino acid sequence of Chromatium high potential iron sulfur protein. , 1973, The Journal of biological chemistry.

[9]  D. Teller,et al.  Estimation of primary sequence homology from amino acid composition of evolutionary related proteins. , 1973, Journal of theoretical biology.

[10]  R. Laursen,et al.  The primary structure of an acidic protein from 50-S ribosomes of Escherichia coli which is involved in GTP hydrolysis dependent on elongation factors G and T. , 1973, European journal of biochemistry.

[11]  A. Mclachlan Tests for comparing related amino-acid sequences. Cytochrome c and cytochrome c 551 . , 1971, Journal of molecular biology.

[12]  P. Sellers On the Theory and Computation of Evolutionary Distances , 1974 .

[13]  M. Tanaka,et al.  The amino acid sequence of Clostridium pasteurianum ferredoxin. , 1966, Biochemical and biophysical research communications.

[14]  Choh Hao Li,et al.  Human pituitary thyrotropin. The primary structure of the α and β subunits , 1977 .

[15]  J. Weltman,et al.  Relatedness among proteins: A new method of estimation and its application to immunoglobulins , 1971 .

[16]  Douglas R. Smith,et al.  Nucleotide sequence of the E. coli gene coding for dihydrofolate reductase , 2022 .

[17]  N. Yoshida,et al.  The amino acid sequence of ribonuclease St , 1976, FEBS letters.

[18]  R. Doolittle Similar amino acid sequences: chance or common ancestry? , 1981, Science.

[19]  W. W. Jong,et al.  The Amino-Acid Sequence of the αA2 Chain of Bovine α-Crystallin , 1973 .

[20]  A. Holmgren,et al.  Thioredoxin. 6. The amino acid sequence of the protein from escherichia coli B. , 1968, European journal of biochemistry.

[21]  J. Fox,et al.  The amino acid sequence of Desulfovibrio vulgaris flavodoxin. , 1977, Biochemical and biophysical research communications.

[22]  T. Takagi Complete amino acid sequence of human ?1-microglobulin , 1981 .

[23]  A. Stangl,et al.  Die Sequenzanalyse des β-Lactoglobulins , 1973 .

[24]  W. Fitch An improved method of testing for evolutionary homology. , 1966, Journal of molecular biology.

[25]  W. Fitch,et al.  Further improvements in the method of testing for evolutionary homology among proteins. , 1970, Journal of molecular biology.

[26]  H. Jörnvall Horse liver alcohol dehydrogenase. On the primary structures of the isoenzymes. , 1970, European journal of biochemistry.

[27]  Howard M. Goodman,et al.  Isolation, cloning and sequence analysis of the cDNA for the α-subunit of human chorionic gonadotropin , 1979, Nature.

[28]  T. Meyer,et al.  Primary structures of high potential, four-iron-sulfur ferredoxins from the pruple sulfur photosynthetic bacteria, Thiocapsa roseopersicina and chromatium gracile. , 1981, The Journal of biological chemistry.

[29]  H. Shapiro A multivariate statistical method for comparing protein amino acid compositions: studies of muscle actins and proteins derived from membranes and microtubular organelles. , 1971, Biochimica et biophysica acta.

[30]  D Coit,et al.  Human prolactin. cDNA structural analysis and evolutionary comparisons. , 1981, The Journal of biological chemistry.

[31]  F. Šorm,et al.  Covalent structure of bovine trypsinogen. The position of the remaining amides. , 1966, Biochemical and biophysical research communications.