Finding genes discriminating smokers from non-smokers by applying a growing self-organizing clustering method to large airway epithelium cell microarray data.

BACKGROUND Cigarette smoking is the major risk factor for development of lung cancer. Identification of effects of tobacco on airway gene expression may provide insight into the causes. This research aimed to compare gene expression of large airway epithelium cells in normal smokers (n=13) and non-smokers (n=9) in order to find genes which discriminate the two groups and assess cigarette smoking effects on large airway epithelium cells. MATERIALS AND METHODS Genes discriminating smokers from non-smokers were identified by applying a neural network clustering method, growing self-organizing maps (GSOM), to microarray data according to class discrimination scores. An index was computed based on differentiation between each mean of gene expression in the two groups. This clustering approach provided the possibility of comparing thousands of genes simultaneously. RESULTS The applied approach compared the mean of 7,129 genes in smokers and non-smokers simultaneously and classified the genes of large airway epithelium cells which had differently expressed in smokers comparing with non-smokers. Seven genes were identified which had the highest different expression in smokers compared with the non-smokers group: NQO1, H19, ALDH3A1, AKR1C1, ABHD2, GPX2 and ADH7. Most (NQO1, ALDH3A1, AKR1C1, H19 and GPX2) are known to be clinically notable in lung cancer studies. Furthermore, statistical discriminate analysis showed that these genes could classify samples in smokers and non-smokers correctly with 100% accuracy. With the performed GSOM map, other nodes with high average discriminate scores included genes with alterations strongly related to the lung cancer such as AKR1C3, CYP1B1, UCHL1 and AKR1B10. CONCLUSIONS This clustering by comparing expression of thousands of genes at the same time revealed alteration in normal smokers. Most of the identified genes were strongly relevant to lung cancer in the existing literature. The genes may be utilized to identify smokers with increased risk for lung cancer. A large sample study is now recommended to determine relations between the genes ABHD2 and ADH7 and smoking.

[1]  Peng Guan,et al.  Lung cancer gene expression database analysis incorporating prior knowledge with support vector machine-based classification method , 2009, Journal of experimental & clinical cancer research : CR.

[2]  J. Herman,et al.  Promoter Hypermethylation of Resected Bronchial Margins , 2004, Clinical Cancer Research.

[3]  Michael Ruogu Zhang,et al.  Molecular characteristics of non-small cell lung cancer , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[4]  D. Schwartz,et al.  CYP1A1 and CYP1B1 polymorphisms and risk of lung cancer among never smokers: a population-based study. , 2005, Carcinogenesis.

[5]  Lakshmaiah Sreerama,et al.  ALDH1A1 and ALDH3A1 expression in lung cancers: correlation with histologic type and potential precursors. , 2008, Lung cancer.

[6]  Gang Liu,et al.  Effects of cigarette smoke on the human airway epithelial cell transcriptome. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[7]  A. Hochberg,et al.  The oncofetal H19 RNA connection: hypoxia, p53 and cancer. , 2010, Biochimica et biophysica acta.

[8]  Avrum Spira,et al.  Reversible and permanent effects of tobacco smoke exposure on airway epithelial gene expression , 2007, Genome Biology.

[9]  A. Heguy,et al.  Monoallelic up-regulation of the imprinted H19 gene in airway epithelium of phenotypically normal cigarette smokers. , 2003, Cancer research.

[10]  G. Muzio,et al.  Aldehyde dehydrogenases and cell proliferation. , 2012, Free radical biology & medicine.

[11]  Valerie L. Miller,et al.  Aldo-keto reductase family 1 member C3 (AKR1C3) is expressed in adenocarcinoma and squamous cell carcinoma but not small cell carcinoma. , 2012, International journal of clinical and experimental pathology.

[12]  Saman K. Halgamuge,et al.  Unsupervised Class Discovery and Feature Selection using an Improved Hierarchical Dynamic Self-Organizing Map , 2004 .

[13]  A. Jaiswal,et al.  NAD(P)H:quinone oxidoreductase 1 reduces the mutagenicity of DNA caused by NADPH:P450 reductase-activated metabolites of benzo(a)pyrene quinones. , 1998, British Journal of Cancer.

[14]  A. Hartmann,et al.  Smoking and cancer‐related gene expression in bronchial epithelium and non‐small‐cell lung cancers , 2006, The Journal of pathology.

[15]  Seungyoon Nam,et al.  Clinical validity of the lung cancer biomarkers identified by bioinformatics analysis of public expression data. , 2007, Cancer research.

[16]  Xiaosheng Hang,et al.  Current evidence on the relationship between CYP1B1 polymorphisms and lung cancer risk: a meta-analysis , 2011, Molecular Biology Reports.

[17]  Saman K. Halgamuge,et al.  An unsupervised hierarchical dynamic self-organizing approach to cancer class discovery and marker gene identification in microarray data , 2003, Bioinform..

[18]  A. Jemal,et al.  Global Cancer Statistics , 2011 .

[19]  Saman K. Halgamuge,et al.  Enhancement of topology preservation and hierarchical dynamic self-organising maps for data visualisation , 2003, Int. J. Approx. Reason..

[20]  H. McLeod,et al.  The NQO1*2/*2 polymorphism is associated with poor overall survival in patients following resection of stages II and IIIa non-small cell lung cancer. , 2011, Oncology reports.

[21]  W. Niu,et al.  Lack of Association between NADPH Quinone Oxidoreductase 1 (NQO1) Gene C609T Polymorphism and Lung Cancer: A Case-Control Study and a Meta-Analysis , 2012, PloS one.

[22]  Zuo-Feng Zhang,et al.  NAD(P)H:Quinone Oxidoreductase 1 (NQO1) Pro187Ser Polymorphism and the Risk of Lung, Bladder, and Colorectal Cancers: a Meta-analysis , 2006, Cancer Epidemiology Biomarkers & Prevention.

[23]  E. Scott,et al.  Aldehyde dehydrogenase activity as a functional marker for lung cancer. , 2009, Chemico-biological interactions.

[24]  Trevor M Penning,et al.  AKR1B10: A New Diagnostic Marker of Non–Small Cell Lung Carcinoma in Smokers , 2005, Clinical Cancer Research.

[25]  Laura A. Sullivan,et al.  Aldehyde dehydrogenase activity selects for lung adenocarcinoma stem cells dependent on notch signaling. , 2010, Cancer research.

[26]  Y. Nakanishi,et al.  NQO1, MPO, and the risk of lung cancer: A HuGE review , 2005, Genetics in Medicine.

[27]  A. Heguy,et al.  Up-regulation of expression of the ubiquitin carboxyl-terminal hydrolase L1 gene in human airway epithelium of cigarette smokers. , 2006, Cancer research.

[28]  Yi Jin,et al.  Aldo-keto reductases and bioactivation/detoxication. , 2007, Annual review of pharmacology and toxicology.

[29]  Molecular Damage in the Bronchial Epithelium of Current and Former Smokers , 1997 .

[30]  J. Hurst-Kennedy,et al.  Ubiquitin C-Terminal Hydrolase L1 in Tumorigenesis , 2012, Biochemistry research international.

[31]  Vasilis Vasiliou,et al.  Analysis and update of the human aldehyde dehydrogenase (ALDH) gene family , 2005, Human Genomics.

[32]  A. Hochberg,et al.  The H19 Non-Coding RNA Is Essential for Human Tumor Growth , 2007, PloS one.

[33]  Yun-Chul Hong,et al.  Influence of NQO1, ALDH2, and CYP2E1 genetic polymorphisms, smoking, and alcohol drinking on the risk of lung cancer in Koreans , 2009, Cancer Causes & Control.

[34]  Guangji Wang,et al.  An NQO1-Initiated and p53-Independent Apoptotic Pathway Determines the Anti-Tumor Effect of Tanshinone IIA against Non-Small Cell Lung Cancer , 2012, PloS one.

[35]  C. Powell,et al.  Loss of heterozygosity in epithelial cells obtained by bronchial brushing: clinical utility in lung cancer. , 1999, Clinical cancer research : an official journal of the American Association for Cancer Research.

[36]  S. Land,et al.  Tobacco and estrogen metabolic polymorphisms and risk of non-small cell lung cancer in women. , 2009, Carcinogenesis.

[37]  J. Seagrave,et al.  Effects of 10 cigarette smoke condensates on primary human airway epithelial cells by comparative gene and cytokine expression studies. , 2010, Toxicological sciences : an official journal of the Society of Toxicology.

[38]  S. Davis,et al.  Epigenomic alterations and gene expression profiles in respiratory epithelia exposed to cigarette smoke condensate , 2010, Oncogene.

[39]  A. Hochberg,et al.  Development of targeted therapy for a broad spectrum of cancers (pancreatic cancer, ovarian cancer, glioblastoma and HCC) mediated by a double promoter plasmid expressing diphtheria toxin under the control of H19 and IGF2-P4 regulatory sequences. , 2012, International journal of clinical and experimental medicine.

[40]  H. Seitz,et al.  Alcohol Metabolism and Cancer Risk , 2007, Alcohol research & health : the journal of the National Institute on Alcohol Abuse and Alcoholism.

[41]  Y. Miller,et al.  Widely dispersed p53 mutation in respiratory epithelium. A novel mechanism for field carcinogenesis. , 1997, The Journal of clinical investigation.

[42]  Bala Srinivasan,et al.  Dynamic self-organizing maps with controlled growth for knowledge discovery , 2000, IEEE Trans. Neural Networks Learn. Syst..

[43]  Kuang-Tai Kuo,et al.  Reversal of inflammation‐associated dihydrodiol dehydrogenases (AKR1C1 and AKR1C2) overexpression and drug resistance in nonsmall cell lung cancer cells by wogonin and chrysin , 2007, International journal of cancer.

[44]  S. Goodman,et al.  PGP9.5 as a candidate tumor marker for non-small-cell lung cancer. , 1999, The American journal of pathology.

[45]  A. Bosio,et al.  Gene expression profiling in respiratory tissues from rats exposed to mainstream cigarette smoke. , 2003, Carcinogenesis.

[46]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[47]  L. Tanoue Airway epithelial gene expression in the diagnostic evaluation of smokers with suspect lung cancer , 2009 .

[48]  G. Getz,et al.  Coupled two-way clustering analysis of gene microarray data. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[49]  N. Pedersen,et al.  Interaction between smoking and genetic factors in the development of chronic bronchitis. , 2008, American journal of respiratory and critical care medicine.

[50]  Jenny Chang-Claude,et al.  Genetic polymorphisms of MPO, GSTT1, GSTM1, GSTP1, EPHX1 and NQO1 as risk factors of early‐onset lung cancer , 2010, International journal of cancer.

[51]  Hui-Ping Zhao,et al.  A single nucleotide polymorphism in the alcohol dehydrogenase 7 gene (alanine to glycine substitution at amino acid 92) is associated with the risk of squamous cell carcinoma of the head and neck , 2010, Cancer.

[52]  Nadarajah Vigneswaran,et al.  Cigarette smoke condensate induces cytochromes P450 and aldo-keto reductases in oral cancer cells. , 2006, Toxicology letters.

[53]  R. Brigelius-Flohé,et al.  The Yin and Yang of Nrf2-Regulated Selenoproteins in Carcinogenesis , 2012, International journal of cell biology.

[54]  D. Nebert Analysis and update of the human aldehyde dehydrogenase ( ALDH )g ene , 2005 .

[55]  R. Yantiss,et al.  Effects of Cigarette Smoke on the Human Oral Mucosal Transcriptome , 2010, Cancer Prevention Research.

[56]  K. Yamamura,et al.  Elevated mature macrophage expression of human ABHD2 gene in vulnerable plaque. , 2008, Biochemical and biophysical research communications.

[57]  W. M. Brown,et al.  Potential prognostic marker ubiquitin carboxyl-terminal hydrolase-L1 does not predict patient survival in non-small cell lung carcinoma , 2011, Journal of experimental & clinical cancer research : CR.

[58]  松岡 由香 Telomerase expression in noncancerous bronchial epithelia is a possible marker of early development of lung cancer , 2007 .

[59]  M. Spitz,et al.  An association between a NQO1 genetic polymorphism and risk of lung cancer. , 2005, Mutation research.