Phylogeny of the serpin superfamily: implications of patterns of amino acid conservation for structure and function.

We present a comprehensive alignment and phylogenetic analysis of the serpins, a superfamily of proteins with known members in higher animals, nematodes, insects, plants, and viruses. We analyze, compare, and classify 219 proteins representative of eight major and eight minor subfamilies, using a novel technique of consensus analysis. Patterns of sequence conservation characterize the family as a whole, with a clear relationship to the mechanism of function. Variations of these patterns within phylogenetically distinct groups can be correlated with the divergence of structure and function. The goals of this work are to provide a carefully curated alignment of serpin sequences, to describe patterns of conservation and divergence, and to derive a phylogenetic tree expressing the relationships among the members of this family. We extend earlier studies by Huber and Carrell as well as by Marshall, after whose publication the serpin family has grown functionally, taxonomically, and structurally. We used gene and protein sequence data, crystal structures, and chromosomal location where available. The results illuminate structure-function relationships in serpins, suggesting roles for conserved residues in the mechanism of conformational change. The phylogeny provides a rational evolutionary framework to classify serpins and enables identification of conserved amino acids. Patterns of conservation also provide an initial point of comparison for genes identified by the various genome projects. New homologs emerging from sequencing projects can either take their place within the current classification or, if necessary, extend it.

[1]  Sudhir Kumar,et al.  Divergence time estimates for the early history of animal phyla and the origin of plants, animals and fungi , 1999, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[2]  K. Söderhäll,et al.  Isolation of cDNA encoding a novel serpin of crayfish hemocytes. , 1995, Comparative biochemistry and physiology. Part B, Biochemistry & molecular biology.

[3]  A. Berger,et al.  On the size of the active site in proteases. I. Papain. , 1967, Biochemical and biophysical research communications.

[4]  G. Petersen,et al.  A Drosophila male accessory gland protein that is a member of the serpin superfamily of proteinase inhibitors is transferred to females during mating. , 1995, Insect biochemistry and molecular biology.

[5]  A. Zurn,et al.  A glia-derived nexin promotes neurite outgrowth in cultured chick sympathetic neurons. , 1988, Developmental neuroscience.

[6]  C. Lindoff,et al.  Significance of the Plasminogen Activator Inhibitor of Placental Type (PAI-2) In Pregnancy , 1998, Seminars in thrombosis and hemostasis.

[7]  J. Whisstock,et al.  Cleaved antitrypsin polymers at atomic resolution , 2008, Protein science : a publication of the Protein Society.

[8]  Eileen Remold-O'Donnell,et al.  The ovalbumin family of serpin proteins , 1993, FEBS letters.

[9]  P. Stein,et al.  Ovalbumin and angiotensinogen lack serpin S-R conformational change. , 1989, The Biochemical journal.

[10]  J. Felsenstein Inferring phylogenies from protein sequences by parsimony, distance, and likelihood methods. , 1996, Methods in enzymology.

[11]  R. Carrell,et al.  Implications of the three-dimensional structure of alpha 1-antitrypsin for structure and function of serpins. , 1989, Biochemistry.

[12]  R. Schleef,et al.  Molecular Cloning of Bomapin (Protease Inhibitor 10), a Novel Human Serpin That Is Expressed Specifically in the Bone Marrow (*) , 1995, The Journal of Biological Chemistry.

[13]  J. Abrahams,et al.  Inhibitory conformation of the reactive loop of α1-antitrypsin , 1996, Nature Structural Biology.

[14]  S. Bottomley,et al.  The intracellular serpin proteinase inhibitor 6 is expressed in monocytes and granulocytes and is a potent inhibitor of the azurophilic granule protease, cathepsin G. , 1999, Blood.

[15]  J. Trapani,et al.  A New Family of 10 Murine Ovalbumin Serpins Includes Two Homologs of Proteinase Inhibitor 8 and Two Homologs of the Granzyme B Inhibitor (Proteinase Inhibitor 9)* , 1997, The Journal of Biological Chemistry.

[16]  M. Teramura,et al.  Purification, cDNA Cloning, and Characterization of a New Serpin with Megakaryocyte Maturation Activity* , 1997, The Journal of Biological Chemistry.

[17]  E. Goldsmith,et al.  Engineering of plasminogen activator inhibitor-1 to reduce the rate of latency transition , 1995, Nature Structural Biology.

[18]  Manolo Gouy,et al.  SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogeny , 1996, Comput. Appl. Biosci..

[19]  J. Potempa,et al.  Purification and Characterization of a Novel Endopeptidase in Ragweed (Ambrosia artemisiifolia) Pollen* , 1996, The Journal of Biological Chemistry.

[20]  J. Trapani,et al.  Selective Regulation of Apoptosis: the Cytotoxic Lymphocyte Serpin Proteinase Inhibitor 9 Protects against Granzyme B-Mediated Apoptosis without Perturbing the Fas Cell Death Pathway , 1998, Molecular and Cellular Biology.

[21]  P. Sonderegger,et al.  Expression of Neuroserpin, an Inhibitor of Tissue Plasminogen Activator, in the Developing and Adult Nervous System of the Mouse , 1997, The Journal of Neuroscience.

[22]  A. Leslie,et al.  Crystal structure of ovalbumin as a model for the reactive centre of serpins , 1990, Nature.

[23]  S. Bottomley,et al.  The Mechanism of α1-Antitrypsin Polymerization Probed by Fluorescence Spectroscopy , 1998 .

[24]  K. Klinger,et al.  Plasminogen activator inhibitor type 1 gene is located at region q21.3-q22 of chromosome 7 and genetically linked with cystic fibrosis. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[25]  R. Read,et al.  A 2.6 A structure of a serpin polymer and implications for conformational disease. , 1999, Journal of molecular biology.

[26]  J. Whisstock,et al.  Serpins in the Caenorhabditis elegans genome. , 1999, Proteins.

[27]  R. Black,et al.  Viral inhibition of inflammation: Cowpox virus encodes an inhibitor of the interleukin-1β converting enzyme , 1992, Cell.

[28]  B. Svensson,et al.  A 39 kd barley seed protein of the serpin superfamily inhibits α-chymotrypsin , 1989 .

[29]  C. Marshall Evolutionary relationships among the serpins. , 1993, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[30]  K. Suzuki Protein C inhibitor. , 1993, Methods in enzymology.

[31]  J. Whisstock,et al.  An atlas of serpin conformations. , 1998, Trends in biochemical sciences.

[32]  R. Huber,et al.  Crystal structure of plakalbumin, a proteolytically nicked form of ovalbumin. Its relationship to the structure of cleaved alpha-1-proteinase inhibitor. , 1990, Journal of molecular biology.

[33]  M. Wilkinson Common Cladistic Information and its Consensus Representation: Reduced Adams and Reduced Cladistic Consensus Trees and Profiles , 1994 .

[34]  R. Fournier,et al.  A 370-kb cosmid contig of the serpin gene cluster on human chromosome 14q32.1: molecular linkage of the genes encoding alpha 1-antichymotrypsin, protein C inhibitor, kallistatin, alpha 1-antitrypsin, and corticosteroid-binding globulin. , 1997, Genomics.

[35]  G. Silverman,et al.  Cytoplasmic antiproteinase 2 (PI8) and bomapin (PI10) map to the serpin cluster at 18q21.3. , 1997, Genomics.

[36]  D. Lomas,et al.  Preparation and Characterization of Latent α1-Antitrypsin (*) , 1995, The Journal of Biological Chemistry.

[37]  D. Tollefsen,et al.  Heparin cofactor II. Purification and properties of a heparin-dependent inhibitor of thrombin in human plasma. , 1982, The Journal of biological chemistry.

[38]  P. Robson,et al.  Identification and characterization of a serpin with differential expression during the life cycle of the sea lamprey. , 1998, Comparative biochemistry and physiology. Part B, Biochemistry & molecular biology.

[39]  D. Lomas,et al.  A Kinetic Mechanism for the Polymerization of α1-Antitrypsin* , 1999, The Journal of Biological Chemistry.

[40]  W. Hol,et al.  The intact and cleaved human antithrombin III complex as a model for serpin–proteinase interactions , 1994, Nature Structural Biology.

[41]  D. Lomas,et al.  The mechanism of Z α1-antitrypsin accumulation in the liver , 1993, Nature.

[42]  C. Chothia,et al.  Serpin tertiary structure transformation. , 1991, Journal of molecular biology.

[43]  J. Irving,et al.  Human ovalbumin serpin evolution: phylogenic analysis, gene organization, and identification of new PI8-related genes suggest that two interchromosomal and several intrachromosomal duplications generated the gene clusters at 18q21-q23 and 6p25. , 1999, Genomics.

[44]  H. Chapman,et al.  Cross-class inhibition of the cysteine proteinases cathepsins K, L, and S by the serpin squamous cell carcinoma antigen 1: a kinetic analysis. , 1998, Biochemistry.

[45]  P. Stein,et al.  Biological implications of a 3 A structure of dimeric antithrombin. , 1994, Structure.

[46]  R. Hersh Atlas of Protein Sequence and Structure, 1966 , 1967 .

[47]  H. Wright Ovalbumin is an elastase substrate. , 1984, The Journal of biological chemistry.

[48]  B. Trask,et al.  A murine ortholog of the human serpin SCCA2 maps to chromosome 1 and inhibits chymotrypsin-like serine proteinases. , 1998, Genomics.

[49]  R. Kageyama,et al.  Primary structure of human preangiotensinogen deduced from the cloned cDNA sequence. , 1984, Biochemistry.

[50]  R. Blanton,et al.  Characterization of a native and recombinant Schistosoma haematobium serine protease inhibitor gene product. , 1994, Molecular and biochemical parasitology.

[51]  L. Chao,et al.  Human kallistatin, a new tissue kallikrein-binding protein: purification and characterization. , 1989, Advances in experimental medicine and biology.

[52]  M. Hendrix,et al.  Maspin, a serpin with tumor-suppressing activity in human mammary epithelial cells. , 1994, Science.

[53]  J Schultz,et al.  SMART, a simple modular architecture research tool: identification of signaling domains. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[54]  T. R. Hansen,et al.  Pepsin-inhibitory activity of the uterine serpins. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[55]  P. Sonderegger,et al.  Human neuroserpin (PI12): cDNA cloning and chromosomal localization to 3q26. , 1997, Genomics.

[56]  K. Arakawa,et al.  Species Specificity in Reaction between Renin and Angiotensinogen , 1965, Nature.

[57]  T. Sasaki Patchwork-structure serpins from silkworm (Bombyx mori) larval hemolymph. , 1991, European journal of biochemistry.

[58]  P. Stein,et al.  Hormone binding globulins undergo serpin conformational change in inflammation , 1988, Nature.

[59]  Myeong-Hee Yu,et al.  The Z type variation of human α1-antitrypsin causes a protein folding defect , 1995, Nature Structural Biology.

[60]  P. Kraulis A program to produce both detailed and schematic plots of protein structures , 1991 .

[61]  D. Lomas,et al.  Inhibition of plasmin, urokinase, tissue plasminogen activator, and C1S by a myxoma virus serine proteinase inhibitor. , 1993, The Journal of biological chemistry.

[62]  A. Pardee,et al.  Tissue-type plasminogen activator is a target of the tumor suppressor gene maspin. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[63]  E. Levin,et al.  Conversion of the active to latent plasminogen activator inhibitor from human endothelial cells , 1987 .

[64]  L. Nelles,et al.  Primary structure of human alpha 2-antiplasmin, a serine protease inhibitor (serpin). , 1987, The Journal of biological chemistry.

[65]  Chuan Yi Tang,et al.  A 2.|E|-Bit Distributed Algorithm for the Directed Euler Trail Problem , 1993, Inf. Process. Lett..

[66]  R. Read,et al.  Inactive conformation of the serpin alpha(1)-antichymotrypsin indicates two-stage insertion of the reactive loop: implications for inhibitory function and conformational disease. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[67]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[68]  Y. Higashimoto,et al.  Ovalbumin in Developing Chicken Eggs Migrates from Egg White to Embryonic Organs while Changing Its Conformation and Thermal Stability* , 1999, The Journal of Biological Chemistry.

[69]  E. Goldsmith,et al.  The structure of active serpin 1K from Manduca sexta. , 1999, Structure.

[70]  A. Nakai,et al.  Involvement of the stress protein HSP47 in procollagen processing in the endoplasmic reticulum , 1992, The Journal of cell biology.

[71]  Andrew Smith Genome sequence of the nematode C-elegans: A platform for investigating biology , 1998 .

[72]  J. Hoidal,et al.  Inhibitory properties of recombinant human monocyte/neutrophil elastase inhibitor. , 1995, American journal of respiratory cell and molecular biology.

[73]  Haobo Jiang,et al.  Characterization and Functional Analysis of 12 Naturally Occurring Reactive Site Variants of Serpin-1 from Manduca sexta* , 1997, The Journal of Biological Chemistry.

[74]  J. Dickinson,et al.  The C-D interhelical domain of the serpin plasminogen activator inhibitor-type 2 is required for protection from TNF-α induced apoptosis , 1998, Cell Death and Differentiation.

[75]  G. Mirza,et al.  A serpin gene cluster on human chromosome 6p25 contains PI6, PI9 and ELANH2 which have a common structure almost identical to the 18q21 ovalbumin serpin genes , 1998, Cytogenetic and Genome Research.

[76]  M. Wilkinson,et al.  Majority-rule reduced consensus trees and their use in bootstrapping. , 1996, Molecular biology and evolution.

[77]  Y. Nakamura,et al.  Structure of the gene encoding human colligin-2 (CBP2). , 1997, Gene.

[78]  W. Fitch,et al.  Construction of phylogenetic trees. , 1967, Science.

[79]  J. Abrahams,et al.  The anticoagulant activation of antithrombin by heparin. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[80]  Aiwu Zhou,et al.  Antithrombins Wibble and Wobble (T85M/K): archetypal conformational diseases with in vivo latent-transition, thrombosis, and heparin activation. , 1998, Blood.

[81]  S. Mousa,et al.  Recombinant plasminogen activator inhibitor type 1: a review of structural, functional, and biological aspects. , 1994, Blood coagulation & fibrinolysis : an international journal in haemostasis and thrombosis.

[82]  G. Silverman,et al.  SCCA1 and SCCA2 Are Proteinase Inhibitors That Map to the Serpin Cluster at 18q21.3 , 1998, Tumor Biology.

[83]  J. Bednar,et al.  MENT, a Heterochromatin Protein That Mediates Higher Order Chromatin Folding, Is a New Serpin Family Member* , 1999, The Journal of Biological Chemistry.

[84]  J. Thompson,et al.  Using CLUSTAL for multiple sequence alignments. , 1996, Methods in enzymology.

[85]  J. Whisstock,et al.  Conformational changes in serpins: II. The mechanism of activation of antithrombin by heparin. , 2000, Journal of molecular biology.

[86]  J. Felsenstein CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP , 1985, Evolution; international journal of organic evolution.

[87]  B. Sanwal,et al.  A collagen-binding protein in the endoplasmic reticulum of myoblasts exhibits relationship with serine protease inhibitors. , 1991, The Journal of biological chemistry.

[88]  I. Balazs,et al.  Assignment of the human antithrombin III structural gene to chromosome 1q23-25. , 1985, Cytogenetics and cell genetics.

[89]  Carolyn L. Smith,et al.  Primary structure of human corticosteroid binding globulin, deduced from hepatic and pulmonary cDNAs, exhibits homology with serine protease inhibitors. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[90]  T. Gustafson,et al.  Complete amino acid sequence of human thyroxine-binding globulin deduced from cloned DNA: close homology to the serine antiproteases. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[91]  J. Whisstock,et al.  Conformational changes in serpins: I. The native and cleaved conformations of alpha(1)-antitrypsin. , 2000, Journal of molecular biology.

[92]  W. Kisiel,et al.  Molecular Cloning, Expression, and Partial Characterization of Two Novel Members of the Ovalbumin Family of Serine Proteinase Inhibitors (*) , 1995, The Journal of Biological Chemistry.

[93]  D. Haussler,et al.  Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods. , 1998, Journal of molecular biology.

[94]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[95]  K. Cerosaletti,et al.  The gene for the serpin thrombin inhibitor (PI7), protease nexin I, is located on human chromosome 2q33-q35 and on syntenic regions in the mouse and sheep genomes. , 1995, Genomics.

[96]  J. Whisstock,et al.  The 2.6 A structure of antithrombin indicates a conformational change at the heparin binding site. , 1997, Journal of molecular biology.

[97]  R. Liddington,et al.  Crystal structure of the apoptotic suppressor CrmA in its cleaved form. , 2000, Structure.

[98]  D. Lomas,et al.  Familial dementia caused by polymerization of mutant neuroserpin , 1999, Nature.

[99]  R. Carrell,et al.  Effects of mutations in the hinge region of serpins. , 1993, Biochemistry.

[100]  J. Adachi,et al.  MOLPHY, programs for molecular phylogenetics , 1992 .

[101]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[102]  J. Enghild,et al.  Analysis of the plasma elimination kinetics and conformational stabilities of native, proteinase-complexed, and reactive site cleaved serpins: comparison of alpha 1-proteinase inhibitor, alpha 1-antichymotrypsin, antithrombin III, alpha 2-antiplasmin, angiotensinogen, and ovalbumin. , 1991, Biochemistry.

[103]  E. Goldsmith,et al.  Structural basis of latency in plasminogen activator inhibitor-1 , 1992, Nature.

[104]  T. Komiyama,et al.  Inhibition of interleukin-1 beta converting enzyme by the cowpox virus serpin CrmA. An example of cross-class inhibition. , 1994, The Journal of biological chemistry.

[105]  G. Chader,et al.  Pigment epithelium-derived factor: neurotrophic activity and identification as a member of the serine protease inhibitor gene family. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[106]  K. Katz,et al.  Introducing RefSeq and LocusLink: curated human genome resources at the NCBI. , 2000, Trends in genetics : TIG.

[107]  M. Griffith,et al.  Inhibition of chymotrypsin by heparin cofactor II. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[108]  P. Bird Serpins and regulation of cell death. , 1998, Results and problems in cell differentiation.

[109]  R. Huber,et al.  Human alpha 1-proteinase inhibitor. Crystal structure analysis of two crystal modifications, molecular model and preliminary analysis of the implications for function. , 1984, Journal of molecular biology.

[110]  G. Chader,et al.  The gene for PEDF, a retinal growth factor is a prime candidate for retinitis pigmentosa and is tightly linked to the RP13 locus on chromosome 17p13.3. , 1996, Molecular vision.

[111]  T. T. Wu,et al.  AN ANALYSIS OF THE SEQUENCES OF THE VARIABLE REGIONS OF BENCE JONES PROTEINS AND MYELOMA LIGHT CHAINS AND THEIR IMPLICATIONS FOR ANTIBODY COMPLEMENTARITY , 1970, The Journal of experimental medicine.

[112]  S. Patterson Mammalian α1-antitrypsins: Comparative biochemistry and genetics of the major plasma serpin , 1991 .

[113]  N. Blin,et al.  Complete nucleotide sequence of the gene for human heparin cofactor II and mapping to chromosomal band 22q11. , 1991, Biochemistry.

[114]  D. Tollefsen,et al.  Heparin cofactor II. , 1997, Advances in experimental medicine and biology.

[115]  J. Abrahams,et al.  Wild-type alpha 1-antitrypsin is in the canonical inhibitory conformation. , 1998, Journal of molecular biology.

[116]  W. Kisiel,et al.  The inhibitory specificity of human proteinase inhibitor 8 is expanded through the use of multiple reactive site residues. , 1998, Biochemical and biophysical research communications.

[117]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[118]  P. Stein,et al.  What do dysfunctional serpins tell us about molecular mobility and disease? , 1995, Nature Structural Biology.

[119]  R. Carrell,et al.  Plakalbumin, α1-antitrypsin, antithrombin and the mechanism of inflammatory thrombosis , 1985, Nature.

[120]  A. James,et al.  Isolation and Characterization of the Gene Encoding a Novel Factor Xa-directed Anticoagulant from the Yellow Fever Mosquito,Aedes aegypti * , 1998, The Journal of Biological Chemistry.

[121]  F. James Rohlf,et al.  Taxonomic Congruence in the Leptopodomorpha Re-examined , 1981 .

[122]  J. Nishioka,et al.  Protein C inhibitor. Purification from human plasma and characterization. , 1983, The Journal of biological chemistry.

[123]  S. Grigoryev,et al.  A novel nonhistone protein (MENT) promotes nuclear collapse at the terminal stage of avian erythropoiesis. , 1992, Experimental cell research.

[124]  S. Kawabata,et al.  A Limulus intracellular coagulation inhibitor with characteristics of the serpin superfamily. Purification, characterization, and cDNA cloning. , 1994, Journal of Biological Chemistry.

[125]  M. Hearn,et al.  Uterine Milk Protein, a Novel Activin-Binding Protein, Is Present in Ovine Allantoic Fluid1. , 1999, Endocrinology.