The Complete Genome Sequence of Severe Acute Respiratory Syndrome Coronavirus Strain HKU-39849 (HK-39)

The complete genomic nucleotide sequence (29.7kb) of a Hong Kong severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV) strain HK-39 is determined. Phylogenetic analysis of the genomic sequence reveals it to be a distinct member of the Coronaviridae family. 5′ RACE assay confirms the presence of at least six subgenomic transcripts all containing the predicted intergenic sequences. Five open reading frames (ORFs), namely ORF1a, 1b, S, M, and N, are found to be homologues to other CoV members, and three more unknown ORFs (X1, X2, and X3) are unparalleled in all other known CoV species. Optimal alignment and computer analysis of the homologous ORFs has predicted the characteristic structural and functional domains on the putative genes. The overall nucleotides conservation of the homologous ORFs is low (<5%) compared with other known CoVs, implying that HK-39 is a newly emergent SARS-CoV phylogenetically distant from other known members. SimPlot analysis supports this finding, and also suggests that this novel virus is not a product of a recent recombinant from any of the known characterized CoVs. Together, these results confirm that HK-39 is a novel and distinct member of the Coronaviridae family, with unknown origin. The completion of the genomic sequence of the virus will assist in tracing its origin.

[1]  T. Brown,et al.  Cloning and sequencing of 5' terminal sequences from avian infectious bronchitis virus genomic RNA. , 1986, The Journal of general virology.

[2]  Marian C. Horzinek,et al.  Characterization and translation of transmissible gastroenteritis virus mRNAs , 1986, Journal of virology.

[3]  H. Laude,et al.  Enteric coronavirus TGEV: partial sequence of the genomic RNA, its organization and expression , 1987, Biochimie.

[4]  T. Brown,et al.  Completion of the sequence of the genome of the coronavirus avian infectious bronchitis virus. , 1987, The Journal of general virology.

[5]  V. Blinov,et al.  Coronavirus genome: prediction of putative functional domains in the non-structural polyprotein by comparative amino acid sequence analysis. , 1989, Nucleic acids research.

[6]  R. Woods,et al.  Nucleotide sequence of coronavirus TGEV genomic RNA: evidence for 3 mRNA species between the peplomer and matrix protein genes , 1989, Virus Research.

[7]  I. Brierley,et al.  Characterization of an efficient coronavirus ribosomal frameshifting signal: Requirement for an RNA pseudoknot , 1989, Cell.

[8]  P. Zoltick,et al.  Molecular cloning of the gene encoding the putative polymerase of mouse hepatitis coronavirus, strain A59 , 1989, Virology.

[9]  S. Weiss,et al.  The primary structure and expression of the second open reading frame of the polymerase gene of the coronavirus MHV-A59; a highly conserved polymerase is expressed by an efficient ribosomal frameshifting mechanism. , 1990, Nucleic acids research.

[10]  J. D. den Boon,et al.  Equine arteritis virus is not a togavirus but belongs to the coronaviruslike superfamily , 1991, Journal of virology.

[11]  A. Lupas,et al.  Predicting coiled coils from protein sequences , 1991, Science.

[12]  P. Zoltick,et al.  Identification of polypeptides encoded in open reading frame 1b of the putative polymerase gene of the murine coronavirus mouse hepatitis virus A59 , 1991, Journal of virology.

[13]  E. Koonin,et al.  The complete sequence (22 kilobases) of murine coronavirus gene 1 encoding the putative proteases and RNA polymerase , 1991, Virology.

[14]  J. Herold,et al.  An 'elaborated' pseudoknot is required for high frequency frameshifting during translation of HCV 229E polymerase mRNA. , 1993, Nucleic acids research.

[15]  T. Raabe,et al.  Nucleotide Sequence of the Human Coronavirus 229E RNA Polymerase Locus , 1993, Virology.

[16]  I. Brierley,et al.  A 100-kilodalton polypeptide encoded by open reading frame (ORF) 1b of the coronavirus infectious bronchitis virus is processed by ORF 1a products , 1994, Journal of virology.

[17]  Marian C. Horzinek,et al.  Folding of the mouse hepatitis virus spike protein and its association with the membrane protein. , 1994, Archives of virology. Supplementum.

[18]  H. Laude,et al.  Complete Sequence (20 Kilobases) of the Polyprotein-Encoding Gene 1 of Transmissible Gastroenteritis Virus , 1995, Virology.

[19]  H. Laude,et al.  The Coronavirus Nucleocapsid Protein , 1995 .

[20]  K. Faaberg,et al.  The envelope proteins of lactate dehydrogenase-elevating virus and their membrane topography. , 1995, Virology.

[21]  L. Enjuanes,et al.  The transmissible gastroenteritis coronavirus contains a spherical core shell consisting of M and N proteins , 1996, Journal of virology.

[22]  T. Ø. Jonassen,et al.  A common RNA motif in the 3' end of the genomes of astroviruses, avian infectious bronchitis virus and an equine rhinovirus. , 1998, The Journal of general virology.

[23]  M. Jackwood,et al.  Evidence of genetic diversity generated by recombination among avian coronavirus IBV , 2000, Archives of Virology.

[24]  Minglong Zhou,et al.  The amino and carboxyl domains of the infectious bronchitis virus nucleocapsid protein interact with 3′ genomic RNA , 2000, Virus Research.

[25]  Sudhir Kumar,et al.  MEGA2: molecular evolutionary genetics analysis software , 2001, Bioinform..

[26]  D. Yoo,et al.  Full-Length Genomic Sequence of Bovine Coronavirus (31kb) , 2001 .

[27]  D. Yoo,et al.  Full-length genomic sequence of bovine coronavirus (31 kb). Completion of the open reading frame 1a/1b sequences. , 2001, Advances in experimental medicine and biology.

[28]  Piero Carninci,et al.  Cloning full-length, cap-trapper-selected cDNAs by using the single-strand linker ligation method. , 2001, BioTechniques.

[29]  J. A. Comer,et al.  A novel coronavirus associated with severe acute respiratory syndrome. , 2003, The New England journal of medicine.

[30]  Y. Guan,et al.  Coronavirus as a possible cause of severe acute respiratory syndrome , 2003, The Lancet.

[31]  A. Bridgen,et al.  Completion of the Porcine Epidemic Diarrhoea Coronavirus (PEDV) Genome Sequence , 2004, Virus Genes.