Multidomain Structure and Cellulosomal Localization of the Clostridium thermocellum Cellobiohydrolase CbhA

The nucleotide sequence of the Clostridium thermocellum F7 cbhA gene, coding for the cellobiohydrolase CbhA, has been determined. An open reading frame encoding a protein of 1,230 amino acids was identified. Removal of a putative signal peptide yields a mature protein of 1,203 amino acids with a molecular weight of 135,139. Sequence analysis of CbhA reveals a multidomain structure of unusual complexity consisting of an N-terminal cellulose binding domain (CBD) homologous to CBD family IV, an immunoglobulin-like b-barrel domain, a catalytic domain homologous to cellulase family E1, a duplicated domain similar to fibronectin type III (Fn3) modules, a CBD homologous to family III, a highly acidic linker region, and a C-terminal dockerin domain. The cellulosomal localization of CbhA was confirmed by Western blot analysis employing polyclonal antibodies raised against a truncated enzymatically active version of CbhA. CbhA was identified as cellulosomal subunit S3 by partial amino acid sequence analysis. Comparison of the multidomain structures indicates striking similarities between CbhA and a group of cellulases from actinomycetes. Average linkage cluster analysis suggests a coevolution of the N-terminal CBD and the catalytic domain and its spread by horizontal gene transfer among gram-positive cellulolytic bacteria.

[1]  K. Sakka,et al.  Thermocellum Cellulosome. Major Component of the Clostridium Sequence of Xync and Properties of Xync, A , 1997 .

[2]  V. Zverlov,et al.  Highly thermostable endo-1,3-beta-glucanase (laminarinase) LamA from Thermotoga neapolitana: nucleotide sequence of the gene and characterization of the recombinant gene product. , 1997, Microbiology.

[3]  E. Bayer,et al.  A cohesin domain from Clostridium thermocellum: the crystal structure provides new insights into cellulosome assembly. , 1997, Structure.

[4]  P. Argos,et al.  Seventy‐five percent accuracy in protein secondary structure prediction , 1997, Proteins.

[5]  M. D. Joshi,et al.  Structure of the N-terminal cellulose-binding domain of Cellulomonas fimi CenC determined by nuclear magnetic resonance spectroscopy. , 1996, Biochemistry.

[6]  C. Haynes,et al.  Interaction of polysaccharides with the N-terminal cellulose-binding domain of Cellulomonas fimi CenC. 1. Binding specificity and calorimetric analysis. , 1996, Biochemistry.

[7]  M. D. Joshi,et al.  Interaction of soluble cellooligosaccharides with the N-terminal cellulose-binding domain of Cellulomonas fimi CenC 2. NMR and ultraviolet absorption spectroscopy. , 1996, Biochemistry.

[8]  T. Steitz,et al.  Crystal structure of a bacterial family‐III cellulose‐binding domain: a general mechanism for attachment to cellulose. , 1996, The EMBO journal.

[9]  I D Campbell,et al.  Structure and function of fibronectin modules. , 1996, Matrix biology : journal of the International Society for Matrix Biology.

[10]  K. Sakka,et al.  Cloning, DNA sequencing, and expression of the gene encoding Clostridium thermocellum cellulase CelJ, the largest catalytic component of the cellulosome , 1996, Journal of bacteriology.

[11]  P. Béguin,et al.  The cellulosome: an exocellular, multiprotein complex specialized in cellulose degradation. , 1996, Critical reviews in biochemistry and molecular biology.

[12]  L. Ljungdahl,et al.  Dissociation of the cellulosome of Clostridium thermocellum in the presence of ethylenediaminetetraacetic acid occurs with the formation of trucated polypeptides. , 1996, Biochemistry.

[13]  J. Wu,et al.  Interactions of the CelS binding ligand with various receptor domains of the Clostridium thermocellum cellulosomal scaffolding protein, CipA , 1996, Journal of bacteriology.

[14]  J. Wu,et al.  Exoglucanase activities of the recombinant Clostridium thermocellum CelS, a major cellulosome component , 1995, Journal of bacteriology.

[15]  H. Hinssen,et al.  A gelsolin-related protein from lobster muscle: cloning, sequence analysis and expression. , 1995, The Biochemical journal.

[16]  E. Forano,et al.  Gene sequence and analysis of protein domains of EGB, a novel family E endoglucanase from Fibrobacter succinogenes S85. , 1994, FEMS microbiology letters.

[17]  I. Campbell,et al.  Building proteins with fibronectin type III modules. , 1994, Structure.

[18]  Raphael Lamed,et al.  Cellulase Ss (CelS) is synonymous with the major cellobiohydrolase (subunit S8) from the cellulosome ofClostridium thermocellum , 1993, Applied biochemistry and biotechnology.

[19]  D. Wilson,et al.  DNA sequences and expression in Streptomyces lividans of an exoglucanase gene and an endoglucanase gene from Thermomonospora fusca , 1993, Applied and environmental microbiology.

[20]  V. Akimenko,et al.  Isolation of a cellobiohydrolase of Clostridium thermocellum capable of degrading natural crystalline substrates. , 1993, Biochemical and Biophysical Research Communications - BBRC.

[21]  J. Aubert,et al.  Organization of a Clostridium thermocellum gene cluster encoding the cellulosomal scaffolding protein CipA and a protein possibly involved in attachment of the cellulosome to the cell surface , 1993, Journal of bacteriology.

[22]  J. Wu,et al.  Cloning and DNA sequence of the gene coding for Clostridium thermocellum cellulase Ss (CelS), a major cellulosome component , 1993, Journal of bacteriology.

[23]  H. Gilbert,et al.  Gene sequence and properties of CelI, a family E endoglucanase from Clostridium thermocellum. , 1993, Journal of general microbiology.

[24]  S. Walter,et al.  The gene encoding the cellulase (Avicelase) Cell from Streptomyces reticuli and analysis of protein domains , 1992, Molecular microbiology.

[25]  Jonathan Boyd,et al.  The three-dimensional structure of the tenth type III module of fibronectin: An insight into RGD-mediated interactions , 1992, Cell.

[26]  C. Gaudin,et al.  Sequence analysis of a gene cluster encoding cellulases from Clostridium cellulolyticum. , 1992, Gene.

[27]  C. K. Hansen,et al.  celA from Bacillus lautus PL236 encodes a novel cellulose-binding endo-beta-1,4-glucanase , 1992, Journal of bacteriology.

[28]  P. Alzari,et al.  Three-dimensional structure of a thermostable bacterial cellulase , 1992, Nature.

[29]  E. Bayer,et al.  Affinity digestion for the near-total recovery of purified cellulosome from Clostridium thermocellum , 1992 .

[30]  D. Kilburn,et al.  Nucleotide sequence of the endoglucanase C gene (cenC) of Cellulomonas fimi, its high‐level expression in Escherichia coli, and characterization of its products , 1991, Molecular microbiology.

[31]  R. A. Grayling,et al.  celB, a gene coding for a bifunctional cellulase from the extreme thermophile "Caldocellum saccharolyticum" , 1990, Applied and environmental microbiology.

[32]  B Henrissat,et al.  Hydrophobic cluster analysis: procedures to derive structural and functional information from 2-D-representation of protein sequences. , 1990, Biochimie.

[33]  V. Zverlov,et al.  Cloning and expression of Clostridium thermocellum genes coding for thermostable exoglucanases (cellobiohydrolases) in Escherichia coli cells. , 1990, Biochemical and biophysical research communications.

[34]  J. Risler,et al.  Amino acid substitutions in structurally related proteins. A pattern recognition approach. Determination of a new and efficient scoring matrix. , 1988, Journal of molecular biology.

[35]  B Henrissat,et al.  Hydrophobic-cluster analysis of plant protein sequences. A domain homology between storage and lipid-transfer proteins. , 1988, The Biochemical journal.

[36]  A. Demain,et al.  Two components of an extracellular protein aggregate of Clostridium thermocellum together degrade crystalline cellulose , 1988 .

[37]  Michael P. Coughlan,et al.  Macromolecular Organization of the Cellulolytic Enzyme Complex of Clostridium thermocellum as Revealed by Electron Microscopy , 1987, Applied and environmental microbiology.

[38]  J. Mornon,et al.  Hydrophobic cluster analysis: An efficient new way to compare and analyse amino acid sequences , 1987, FEBS letters.

[39]  J. Aubert,et al.  Nucleotide sequence of the cellulase gene celD encoding endoglucanase D of Clostridium thermocellum. , 1986, Nucleic acids research.

[40]  A. Demain,et al.  Chemically Defined Minimal Medium for Growth of the Anaerobic Cellulolytic Thermophile Clostridium thermocellum , 1981, Applied and environmental microbiology.

[41]  H. Ross Principles of Numerical Taxonomy , 1964 .

[42]  J. Thompson,et al.  Using CLUSTAL for multiple sequence alignments. , 1996, Methods in enzymology.

[43]  K. Katô [Immunoglobulin superfamily]. , 1996, Rinsho byori. The Japanese journal of clinical pathology.

[44]  R. Doolittle The multiplicity of domains in proteins. , 1995, Annual review of biochemistry.

[45]  N. Gilkes,et al.  Cellulose hydrolysis by bacteria and fungi. , 1995, Advances in microbial physiology.

[46]  M. Rabinovich,et al.  [Cellobiohydrolase from Clostridium thermocellum, synthesized by a recombinant E. coli strain]. , 1991, Биохимия.

[47]  G von Heijne,et al.  Signal sequences. The limits of variation. , 1985, Journal of molecular biology.

[48]  Kenneth M. Yamada,et al.  STRUCTURE AND FUNCTION OF FIBRONECTIN , 1982 .