Patterns of Amino Acids near Signal‐Sequence Cleavage Sites

According to the signal hypothesis, a signal sequence, once having initiated export of a growing protein chain across the rough endoplasmic reticulum, is cleaved from the mature protein at a specific site. It has long been known that some part of the cleavage specificity resides in the last residue of the signal sequence, which invariably is one with a small, uncharged side-chain, but no further specific patterns of amino acids near the point of cleavage have been discovered so far. In this paper, some such patterns, based on a sample of 78 eukaryotic signal sequences, are presented and discussed, and a first attempt at formulating rules for the prediction of cleavage sites is made.

[1]  N. Rosenthal,et al.  The structure and evolution of the two nonallelic rat preproinsulin genes , 1979, Cell.

[2]  J. Taylor,et al.  Nucleotide sequence of rat alpha 1-acid glycoprotein messenger RNA. , 1981, The Journal of biological chemistry.

[3]  P. Seeburg,et al.  The structure of eight distinct cloned human leukocyte interferon cDNAs , 1981, Nature.

[4]  Stanley N Cohen,et al.  Nucleotide sequence of cloned cDNA for bovine corticotropin-β-lipotropin precursor , 1979, Nature.

[5]  A. Strauss,et al.  Compartmentation of newly synthesized proteins. , 1982, CRC critical reviews in biochemistry.

[6]  H. Jörnvall,et al.  Multiple mRNA species for the precursor to an adenovirus-encoded glycoprotein: identification and structure of the signal sequence. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[7]  W. Rutter,et al.  Nucleotide sequence of a cDNA clone encoding human preproinsulin , 1979, Nature.

[8]  D. Goeddel,et al.  Structure of the human immune interferon gene , 1982, Nature.

[9]  U. Vilas,et al.  Cleavage of honeybee prepromelittin by an endoprotease from rat liver microsomes: identification of intact signal peptide. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[10]  T. Taniguchi,et al.  Human leukocyte and fibroblast interferons are structurally related , 1980, Nature.

[11]  J. Donelson,et al.  Point mutations during generation of expression-linked extra copy of trypanosome surface glycoprotein gene , 1982, Nature.

[12]  D. N. Ward,et al.  Progesterone-induced secretory protein. NH2-Terminal sequence of pre-uteroglobin. , 1979, The Journal of biological chemistry.

[13]  D. Huylebroeck,et al.  Complete structure of the hemagglutinin gene from the human influenza A/Victoria/3/75 (H3N2) strain as determined from cloned DNA , 1980, Cell.

[14]  A. Alberts,et al.  Rat liver pre-proalbumin: complete amino acid sequence of the pre-piece. Analysis of the direct translation product of albumin messenger RNA. , 1977, The Journal of biological chemistry.

[15]  D. Hue,et al.  Cell-free synthesis, proteolytic processing, core glycosylation, and amino terminal sequence of rabbit pre-α-lactalbumin , 1982 .

[16]  W. Rutter,et al.  Rat preprocarboxypeptidase A: cDNA sequence and preliminary characterization of the gene. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[17]  C. Slaughter,et al.  Somatic mutation in genes for the variable portion of the immunoglobulin heavy chain. , 1982, Science.

[18]  R. Goodman,et al.  Nucleotide sequence of a cloned structural gene coding for a precursor of pancreatic somatostatin. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[19]  R. Goodman,et al.  Calcitonin messenger RNA encodes multiple polypeptides in a single precursor. , 1981, Science.

[20]  W. A. Bradley,et al.  Amino acid sequence of the signal peptide of apoVLDL-II, a major apoprotein in avian very low density lipoproteins. , 1980, The Journal of biological chemistry.

[21]  Pierre Corvol,et al.  Complete amino acid sequence and maturation of the mouse submaxillary gland renin precursor , 1982, Nature.

[22]  E. Appella,et al.  Complete sequence analysis of cDNA clones encoding rat whey phosphoprotein: homology to a protease inhibitor. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[23]  K. Kurachi,et al.  Sequence homology and structural comparison between the chromosomal human α1-antitrypsin and chicken ovalbumin genes , 1982, Nature.

[24]  G. Scheele,et al.  Amino acid sequences of transport peptides associated with canine exocrine pancreatic proteins. , 1982, The Journal of biological chemistry.

[25]  Y. Burstein,et al.  Primary structure of the NH2-terminal extra piece of the precursor to human placental lactogen. , 1979, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Peter H. Seeburg,et al.  Nucleotide sequence and amplification in bacteria of structural gene for rat growth hormone , 1977, Nature.

[27]  J. Shine,et al.  Molecular cloning and characterization of cDNA sequences coding for rat relaxin , 1981, Nature.

[28]  B. Austen,et al.  Predicted secondary structures of amino‐terminal extension sequences of secreted proteins , 1979, FEBS letters.

[29]  R. Goodman,et al.  Pancreatic preproglucagon cDNA contains two glucagon-related coding sequences arranged in tandem. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Howard M. Goodman,et al.  Isolation, cloning and sequence analysis of the cDNA for the α-subunit of human chorionic gonadotropin , 1979, Nature.

[31]  Shigetada Nakanishi,et al.  Cloning and sequence analysis of cDNA for porcine β-neo-endorphin/dynorphin precursor , 1982, Nature.

[32]  G. Blobel,et al.  Translocation of proteins across the endoplasmic reticulum. I. Signal recognition protein (SRP) binds to in-vitro-assembled polysomes synthesizing secretory protein , 1981, The Journal of cell biology.

[33]  W. Rutter,et al.  Comparison of the nucleic acid sequence of anglerfish and mammalian insulin mRNA's from cloned cDNA's. , 1980, Science.

[34]  D. Givol,et al.  Diversity of germ-line immunoglobulin VH genes , 1981, Nature.

[35]  P. Y. Chou,et al.  Empirical predictions of protein conformation. , 1978, Annual review of biochemistry.

[36]  G. Blobel,et al.  A signal sequence for the insertion of a transmembrane glycoprotein. Similarities to the signals of secretory proteins in primary structure and function. , 1978, The Journal of biological chemistry.

[37]  J. Martial,et al.  Molecular cloning of DNA complementary to bovine growth hormone mRNA. , 1980, The Journal of biological chemistry.

[38]  G. Blobel,et al.  Synthesis in vitro and translocation of apolipoprotein AI across microsomal vesicles. , 1981, European journal of biochemistry.

[39]  R. Canfield,et al.  The amino acid sequences of the prepeptides contained in the alpha and beta subunits of human choriogonadotropin. , 1981, The Journal of biological chemistry.

[40]  R. Deschenes,et al.  Sequence of a cDNA encoding pancreatic preprosomatostatin-22. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[41]  D. Steiner,et al.  Messenger RNA sequence and primary structure of preproinsulin in a primitive vertebrate, the Atlantic hagfish. , 1981, The Journal of biological chemistry.

[42]  J. Foster,et al.  Primary structure of the signal peptide of tropoelastin b. , 1981, The Journal of biological chemistry.

[43]  O. Yoo,et al.  Molecular cloning and nucleotide sequence of full-length of cDNA coding for porcine gastrin. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[44]  J. Martial,et al.  Human growth hormone: complementary DNA cloning and expression in bacteria. , 1979, Science.

[45]  G. Heijne Signal sequences are not uniformly hydrophobic , 1982 .

[46]  S. Law,et al.  Nucleotide sequence and the encoded amino acids of human serum albumin mRNA. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[47]  J. Bonner,et al.  Nucleotide sequence of cloned rat serum albumin messenger RNA. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[48]  W. Rutter,et al.  Cloning and sequence analysis of cDNAs encoding two distinct somatostatin precursors found in the endocrine pancreas of anglerfish , 1980, Nature.

[49]  Gunnar von Heijne,et al.  On the Hydrophobic Nature of Signal Sequences , 1981 .

[50]  Y. Burstein,et al.  IMMUNOGLOBULIN PRECURSORS: STRUCTURE, FUNCTION, GENE‐PROTEIN CORRELATION AND EVOLUTION * , 1980, Annals of the New York Academy of Sciences.

[51]  Peter H. Seeburg,et al.  Primary structure of the human Met- and Leu-enkephalin precursor and its mRNA , 1982, Nature.

[52]  M. Parker,et al.  Prostatic steroid binding protein: gene duplication and steroid binding , 1982, Nature.

[53]  M. Pincus,et al.  Prediction of the three-dimensional structure of the leader sequence of pre-kappa light chain, a hexadecapeptide. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[54]  A. Anilionis,et al.  Structure of the glycoprotein gene in rabies virus , 1981, Nature.

[55]  R. Palmiter,et al.  COTRANSLATIONAL SEQUESTRATION OF EGG WHITE PROTEINS AND PLACENTAL LACTOGEN INSIDE MEMBRANE VESICLES * , 1980, Annals of the New York Academy of Sciences.

[56]  R. Maurer,et al.  Complete amino acid sequence of the precursor region of rat prolactin. , 1978, Biochemistry.

[57]  H. Kronenberg,et al.  Nucleotide sequence of the mRNA encoding the pre-alpha-subunit of mouse thyrotropin. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[58]  P. Gaye,et al.  STUDY OF SECRETORY LACTOPROTEINS: PRIMARY STRUCTURES OF THE SIGNALS AND ENZYMATIC PROCESSING , 1980, Annals of the New York Academy of Sciences.

[59]  D. Goeddel,et al.  DNA sequence of two closely linked human leukocyte interferon genes. , 1981, Science.

[60]  J Nathans,et al.  Cloning and nucleotide sequence of DNA coding for bovine preproparathyroid hormone. , 1979, Proceedings of the National Academy of Sciences of the United States of America.