Risk Assessment of Gastric Cancer Caused by Helicobacter pylori Using CagA Sequence Markers

Background As a marker of Helicobacter pylori, Cytotoxin-associated gene A (cagA) has been revealed to be the major virulence factor causing gastroduodenal diseases. However, the molecular mechanisms that underlie the development of different gastroduodenal diseases caused by cagA-positive H. pylori infection remain unknown. Current studies are limited to the evaluation of the correlation between diseases and the number of Glu-Pro-Ile-Tyr-Ala (EPIYA) motifs in the CagA strain. To further understand the relationship between CagA sequence and its virulence to gastric cancer, we proposed a systematic entropy-based approach to identify the cancer-related residues in the intervening regions of CagA and employed a supervised machine learning method for cancer and non-cancer cases classification. Methodology An entropy-based calculation was used to detect key residues of CagA intervening sequences as the gastric cancer biomarker. For each residue, both combinatorial entropy and background entropy were calculated, and the entropy difference was used as the criterion for feature residue selection. The feature values were then fed into Support Vector Machines (SVM) with the Radial Basis Function (RBF) kernel, and two parameters were tuned to obtain the optimal F value by using grid search. Two other popular sequence classification methods, the BLAST and HMMER, were also applied to the same data for comparison. Conclusion Our method achieved 76% and 71% classification accuracy for Western and East Asian subtypes, respectively, which performed significantly better than BLAST and HMMER. This research indicates that small variations of amino acids in those important residues might lead to the virulence variance of CagA strains resulting in different gastroduodenal diseases. This study provides not only a useful tool to predict the correlation between the novel CagA strain and diseases, but also a general new framework for detecting biological sequence biomarkers in population studies.

[1]  C. Mathers,et al.  Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008 , 2010, International journal of cancer.

[2]  L. Bravo,et al.  CagA C-terminal variations in Helicobacter pylori strains from Colombian patients with gastric precancerous lesions. , 2010, Clinical microbiology and infection : the official publication of the European Society of Clinical Microbiology and Infectious Diseases.

[3]  A. Zuckerman,et al.  IARC Monographs on the Evaluation of Carcinogenic Risks to Humans , 1995, IARC monographs on the evaluation of carcinogenic risks to humans.

[4]  Xiaolian Gao,et al.  A Comprehensive Sequence and Disease Correlation Analyses for the C-Terminal Region of CagA Protein of Helicobacter pylori , 2009, PloS one.

[5]  Y. Liu,et al.  TGF-beta promotes invasion and metastasis of gastric cancer cells by increasing fascin1 expression via ERK and JNK signal pathways. , 2009, Acta biochimica et biophysica Sinica.

[6]  C. Olsen,et al.  Polymorphism in the CagA EPIYA Motif Impacts Development of Gastric Cancer , 2009, Journal of Clinical Microbiology.

[7]  M. S. McClain,et al.  Genome sequence analysis of Helicobacter pylori strains associated with gastric ulceration and gastric cancer , 2009, BMC Genomics.

[8]  Hong-Mei Zhang,et al.  Structural and functional diversity in the PAR1b/MARK2‐binding region of Helicobacter pylori CagA , 2008, Cancer science.

[9]  C. Sander,et al.  Determinants of protein function revealed by combinatorial entropy optimization , 2007, Genome Biology.

[10]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[11]  K. Chayama,et al.  Helicobacter pylori-Associated Gastritis Is Related to babA2 Expression without Heterogeneity of the 3′ Region of the cagA Genotype in Gastric Biopsy Specimens , 2007, Pathobiology.

[12]  Daniel Falush,et al.  An African origin for the intimate association between humans and Helicobacter pylori , 2007, Nature.

[13]  Zunwu Zhang The Risk of Gastric Cancer in Patients with Duodenal and Gastric Ulcer: Research Progresses and Clinical Implications , 2007, Journal of gastrointestinal cancer.

[14]  Takeshi Azuma,et al.  Structural Basis and Functional Consequence of Helicobacter pylori CagA Multimerization in Cells* , 2006, Journal of Biological Chemistry.

[15]  D. Kang,et al.  CagA-producing Helicobacter pylori and increased risk of gastric cancer: a nested case–control study in Korea , 2006, British Journal of Cancer.

[16]  T. Azuma,et al.  Relationship between the diversity of the cagA gene of Helicobacter pylori and gastric cancer in Okinawa, Japan , 2006, Journal of Gastroenterology.

[17]  T. Azuma,et al.  Influence of EPIYA-repeat polymorphism on the phosphorylation-dependent biological activity of Helicobacter pylori CagA. , 2006, Gastroenterology.

[18]  T. Azuma,et al.  Focal Adhesion Kinase Is a Substrate and Downstream Effector of SHP-2 Complexed with Helicobacter pylori CagA , 2006, Molecular and Cellular Biology.

[19]  里見 聡子 Relationship between the diversity of the cagA gene of Helicobacter pylori and gastric cancer in Okinawa, Japan , 2006 .

[20]  M. Kidd,et al.  Determinants and consequences of different levels of CagA phosphorylation for clinical isolates of Helicobacter pylori. , 2004, Gastroenterology.

[21]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[22]  Hyeyoung Kim,et al.  Helicobacter pylori in a Korean isolate activates mitogen-activated protein kinases, AP-1, and NF-κB and induces chemokine expression in gastric epithelial AGS cells , 2004, Laboratory Investigation.

[23]  Jaw-Town Lin,et al.  CagA Tyrosine Phosphorylation in Gastric Epithelial Cells Caused by Helicobacter pylori in Patients with Gastric Adenocarcinoma , 2003, Helicobacter.

[24]  S. Falkow,et al.  Disruption of the Epithelial Apical-Junctional Complex by Helicobacter pylori CagA , 2003, Science.

[25]  W. Birchmeier,et al.  Helicobacter pylori CagA protein targets the c-Met receptor and enhances the motogenic response , 2003, The Journal of cell biology.

[26]  Takeshi Azuma,et al.  Biological activity of the Helicobacter pylori virulence factor CagA is determined by variation in the tyrosine phosphorylation sites , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[27]  M. Selbach,et al.  Src Is the Kinase of the Helicobacter pylori CagA Protein in Vitro and in Vivo * , 2002, The Journal of Biological Chemistry.

[28]  R. Rappuoli,et al.  c‐Src/Lyn kinases activate Helicobacter pylori CagA through tyrosine phosphorylation of the EPIYA motifs , 2002, Molecular microbiology.

[29]  C. O'Morain,et al.  Helicobacter pylori Infection , 1994 .

[30]  R. P. Blankfield,et al.  Helicobacter pylori infection and the development of gastric cancer. , 2001, The New England journal of medicine.

[31]  M. Blaser,et al.  Helicobacter pylori and gastrointestinal tract adenocarcinomas , 2002, Nature Reviews Cancer.

[32]  Takeshi Azuma,et al.  SHP-2 Tyrosine Phosphatase as an Intracellular Target of Helicobacter pylori CagA Protein , 2001, Science.

[33]  R. Haas,et al.  Translocation of Helicobacter pylori CagA into gastric epithelial cells by type IV secretion. , 2000, Science.

[34]  B. Gold,et al.  The disease spectrum of Helicobacter pylori: the immunopathogenesis of gastroduodenal ulcer and gastric cancer. , 2000, Annual review of microbiology.

[35]  D. Graham,et al.  Relationship between the cagA 3' repeat region of Helicobacter pylori, gastric histology, and susceptibility to low pH. , 1999, Gastroenterology.

[36]  Thorsten Joachims,et al.  Making large-scale support vector machine learning practical , 1999 .

[37]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[38]  H. Adami,et al.  The risk of stomach cancer in patients with gastric or duodenal ulcer disease. , 1996, The New England journal of medicine.

[39]  M. Blaser,et al.  Infection with Helicobacter pylori strains possessing cagA is associated with an increased risk of developing adenocarcinoma of the stomach. , 1995, Cancer research.

[40]  H. Vainio,et al.  Working group report on schistosomes, liver flukes and Helicobacter pylori. Meeting held at IARC, LYON, 7–14 june 1994 , 1995, International journal of cancer.

[41]  Schistosomes, liver flukes and Helicobacter pylori. IARC Working Group on the Evaluation of Carcinogenic Risks to Humans. Lyon, 7-14 June 1994. , 1994, IARC monographs on the evaluation of carcinogenic risks to humans.

[42]  R. Rappuoli,et al.  Molecular characterization of the 128-kDa immunodominant antigen of Helicobacter pylori associated with cytotoxicity and duodenal ulcer. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[43]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.