Structure of coronavirus main proteinase reveals combination of a chymotrypsin fold with an extra α-helical domain

The key enzyme in coronavirus polyprotein processing is the viral main proteinase, Mpro, a protein with extremely low sequence similarity to other viral and cellular proteinases. Here, the crystal structure of the 33.1 kDa transmissible gastroenteritis (corona)virus Mpro is reported. The structure was refined to 1.96 Å resolution and revealed three dimers in the asymmetric unit. The mutual arrangement of the protomers in each of the dimers suggests that Mpro self‐processing occurs in trans. The active site, comprised of Cys144 and His41, is part of a chymotrypsin‐like fold that is connected by a 16 residue loop to an extra domain featuring a novel α‐helical fold. Molecular modelling and mutagenesis data implicate the loop in substrate binding and elucidate S1 and S2 subsites suitable to accommodate the side chains of the P1 glutamine and P2 leucine residues of Mpro substrates. Interactions involving the N‐terminus and the α‐helical domain stabilize the loop in the orientation required for trans‐cleavage activity. The study illustrates that RNA viruses have evolved unprecedented variations of the classical chymotrypsin fold.

[1]  S. Grzesiek,et al.  NMRPipe: A multidimensional spectral processing system based on UNIX pipes , 1995, Journal of biomolecular NMR.

[2]  J. Ziebuhr,et al.  Mutational analysis of the active centre of coronavirus 3C-like proteases. , 2002, The Journal of general virology.

[3]  Z. Otwinowski,et al.  Processing of X-ray diffraction data collected in oscillation mode. , 1997, Methods in enzymology.

[4]  H. Klenk,et al.  The Coronaviridae , 1995, The Viruses.

[5]  W A Hendrickson,et al.  Selenomethionyl proteins produced for analysis by multiwavelength anomalous diffraction (MAD): a vehicle for direct determination of three‐dimensional structure. , 1990, The EMBO journal.

[6]  M. James,et al.  Refined X-ray crystallographic structure of the poliovirus 3C gene product. , 1997, Journal of molecular biology.

[7]  L. Polgár Mercaptide—imidazolium ion‐pair: The reactive nucleophile in papain catalysis , 1974, FEBS letters.

[8]  L. F. Ng,et al.  Further Characterization of the Coronavirus Infectious Bronchitis Virus 3C-like Proteinase and Determination of a New Cleavage Site , 2000, Virology.

[9]  I. G. Kamphuis,et al.  Structure of papain refined at 1.65 A resolution. , 1984, Journal of molecular biology.

[10]  C. Sánchez,et al.  Complete Genome Sequence of Transmissible Gastroenteritis Coronavirus PUR46-MAD Clone and Evolution of the Purdue Virus Cluster , 2004, Virus Genes.

[11]  H. Laude,et al.  Complete Sequence (20 Kilobases) of the Polyprotein-Encoding Gene 1 of Transmissible Gastroenteritis Virus , 1995, Virology.

[12]  J. Ziebuhr,et al.  Biosynthesis, purification, and characterization of the human coronavirus 229E 3C-like proteinase , 1997, Journal of virology.

[13]  Xiaotao Lu,et al.  Intracellular andin Vitro-Translated 27-kDa Proteins Contain the 3C-like Proteinase Activity of the Coronavirus MHV-A59 , 1996, Virology.

[14]  K. Sharp,et al.  Protein folding and association: Insights from the interfacial and thermodynamic properties of hydrocarbons , 1991, Proteins.

[15]  E. Wimmer,et al.  Interaction between the 5'-terminal cloverleaf and 3AB/3CDpro of poliovirus is essential for RNA replication , 1995, Journal of virology.

[16]  S. Baker,et al.  Processing of the Coronavirus MHV-JHM Polymerase Polyprotein: Identification of Precursors and Proteolytic Products Spanning 400 Kilodaltons of ORF1a☆ , 1998, Virology.

[17]  J. Ziebuhr,et al.  Conservation of substrate specificities among coronavirus main proteases. , 2002, The Journal of general virology.

[18]  Herbert A. Hauptman A minimal principle in the phase problem , 1992 .

[19]  Stuart G. Siddell,et al.  Processing of the Human Coronavirus 229E Replicase Polyproteins by the Virus-Encoded 3C-Like Proteinase: Identification of Proteolytic Products and Cleavage Sites Common to pp1a and pp1ab , 1999, Journal of Virology.

[20]  Randy J. Read,et al.  Crystal and molecular structures of the complex of α-chymotrypsin with its inhibitor Turkey ovomucoid third domain at 1.8 Å resolution , 1987 .

[21]  D. Blow,et al.  Structure of alpha-chymotrypsin refined at 1.68 A resolution. , 1985, Journal of molecular biology.

[22]  J. D. den Boon,et al.  Equine arteritis virus is not a togavirus but belongs to the coronaviruslike superfamily , 1991, Journal of virology.

[23]  Russ Miller,et al.  The design and implementation of SnB version 2.0 , 1999 .

[24]  J. Ziebuhr,et al.  Expression and characterization of a recombinant murine coronavirus 3C-like proteinase. , 1997, The Journal of general virology.

[25]  D. Matthews,et al.  Structure-assisted design of mechanism-based irreversible inhibitors of human rhinovirus 3C protease with potent antiviral activity against multiple rhinovirus serotypes. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Xiaotao Lu,et al.  Identification and characterization of a serine-like proteinase of the murine coronavirus MHV-A59 , 1995, Journal of virology.

[27]  L. Enjuanes,et al.  Molecular Basis of Transmissible Gastroenteritis Virus Epidemiology , 1995 .

[28]  D. Bacon,et al.  A fast algorithm for rendering space-filling molecule pictures , 1988 .

[29]  A. Brunger Free R value: a novel statistical quantity for assessing the accuracy of crystal structures. , 1992 .

[30]  K D Cowtan,et al.  Phase combination and cross validation in iterated density-modification calculations. , 1996, Acta crystallographica. Section D, Biological crystallography.

[31]  A. Brünger Free R value: a novel statistical quantity for assessing the accuracy of crystal structures , 1992, Nature.

[32]  R. Andino,et al.  Poliovirus RNA synthesis utilizes an RNP complex formed around the 5′‐end of viral RNA. , 1993, The EMBO journal.

[33]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[34]  V. Blinov,et al.  Cysteine proteases of positive strand RNA viruses and chymotrypsin‐like serine proteases , 1989, FEBS letters.

[35]  T. Brown,et al.  Characterisation and Mutational Analysis of an ORF 1a-Encoding Proteinase Domain Responsible for Proteolytic Processing of the Infectious Bronchitis Virus 1a/1b Polyprotein , 1995, Virology.

[36]  A. Porter,et al.  Human rhinovirus-14 protease 3C (3Cpro) binds specifically to the 5'-noncoding region of the viral RNA. Evidence that 3Cpro has different domains for the RNA binding and proteolytic activities. , 1993, The Journal of biological chemistry.

[37]  A. Palmenberg Proteolytic processing of picornaviral polyprotein. , 1990, Annual review of microbiology.

[38]  B. Malcolm,et al.  The picornaviral 3C proteinases: Cysteine nucleophiles in serine proteinase folds , 1995, Protein science : a publication of the Protein Society.

[39]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[40]  D. Cavanagh Nidovirales: a new order comprising Coronaviridae and Arteriviridae. , 1997, Archives of virology.

[41]  M. Denison,et al.  Determinants of Mouse Hepatitis Virus 3C-like Proteinase Activity , 1997, Virology.

[42]  J. Thompson,et al.  The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. , 1997, Nucleic acids research.

[43]  D. Matthews,et al.  Structure of human rhinovirus 3C protease reveals a trypsin-like polypeptide fold, RNA-binding site, and means for cleaving precursor polyprotein , 1994, Cell.

[44]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[45]  David R. Gilbert,et al.  Motif-based searching in TOPS protein topology databases , 1999, Bioinform..

[46]  M. James,et al.  The structure of the 2A proteinase from a common cold virus: a proteinase responsible for the shut‐off of host‐cell protein synthesis , 1999, The EMBO journal.

[47]  M. James,et al.  The refined crystal structure of the 3C gene product from hepatitis A virus: specific proteinase activity and RNA recognition , 1997, Journal of virology.

[48]  S J Wodak,et al.  SFCHECK: a unified set of procedures for evaluating the quality of macromolecular structure-factor data and their agreement with the atomic model. , 1999, Acta crystallographica. Section D, Biological crystallography.

[49]  J. Zou,et al.  Improved methods for building protein models in electron density maps and the location of errors in these models. , 1991, Acta crystallographica. Section A, Foundations of crystallography.

[50]  L. Delbaere,et al.  Refined structure of alpha-lytic protease at 1.7 A resolution. Analysis of hydrogen bonding and solvent structure. , 1985, Journal of molecular biology.

[51]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[52]  P. Kraulis A program to produce both detailed and schematic plots of protein structures , 1991 .

[53]  Jean-Claude Thierry,et al.  Crystallographic computing 5: from chemistry to biology: papers presented at the International School in crystallographic computing held at Bischenberg, France, 29 July -- 5 August 1990 , 1992 .

[54]  V. Blinov,et al.  Coronavirus genome: prediction of putative functional domains in the non-structural polyprotein by comparative amino acid sequence analysis. , 1989, Nucleic acids research.

[55]  Collaborative Computational,et al.  The CCP4 suite: programs for protein crystallography. , 1994, Acta crystallographica. Section D, Biological crystallography.

[56]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[57]  J. Ziebuhr,et al.  Virus-encoded proteinases and proteolytic processing in the Nidovirales. , 2000, The Journal of general virology.

[58]  M. Weiss,et al.  On the use of the merging R factor as a quality indicator for X-ray data , 1997 .

[59]  J. Ziebuhr,et al.  Characterization of a human coronavirus (strain 229E) 3C-like proteinase activity , 1995, Journal of virology.

[60]  L. Delbaere,et al.  Structures of product and inhibitor complexes of Streptomyces griseus protease A at 1.8 A resolution. A model for serine protease catalysis. , 1980, Journal of molecular biology.

[61]  R. Fletterick,et al.  Viral cysteine proteases are homologous to the trypsin-like family of serine proteases: structural and functional implications. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[62]  Marc Allaire,et al.  Picornaviral 3C cysteine proteinases have a fold similar to chymotrypsin-like serine proteinases , 1994, Nature.

[63]  C. Grose,et al.  Site-directed mutagenesis of herpesvirus glycoprotein phosphorylation sites by recombination polymerase chain reaction. , 1992, PCR methods and applications.

[64]  B. Matthews Solvent content of protein crystals. , 1968, Journal of molecular biology.