Identification and functional assessment of novel gene sets towards better understanding of dysplasia associated oral carcinogenesis

Abstract Oral epithelial dysplasia (OED) often precedes oral cancer. Understanding the underlying complex biological aspects of dysplasia associated oral carcinogenesis using important gene sets is thus important. Computation assisted gene set identification through different feature ranking and visualization techniques was therefore attempted in this study. Result suggested that, weighted support vector machine (SVM) could be useful for feature ranking and SVM for attribute selection. Alteration in keratinization, cell–cell communication and peptidase activity was the major affected phenomena, while extracellular matrix dynamics was also found to be hampered. During best gene subset identification, set of six genes could classify normal (NOM) and oral squamous cell carcinoma (OSCC) conditions and two sets comprising four genes in each could classify NOM and dysplastic (DYS) conditions with 100% sensitivity and specificity. A gene set, comprising 32 genes showed best efficacy of 94.12% sensitivity, 99.40% specificity and 98.89% accuracy during classification of DYS and OSCC.

[1]  L. Liotta,et al.  Metastatic potential correlates with enzymatic degradation of basement membrane collagen , 1980, Nature.

[2]  Rosane Minghim,et al.  InteractiVenn: a web-based tool for the analysis of sets through Venn diagrams , 2015, BMC Bioinformatics.

[3]  R. Sanz-Pamplona,et al.  A 5-gene classifier from the carcinoma-associated fibroblast transcriptomic profile and clinical outcome in colorectal cancer , 2014, Oncotarget.

[4]  I. Cha,et al.  Combined genomic expressions as a diagnostic factor for oral squamous cell carcinoma. , 2014, Genomics.

[5]  J. Chatterjee,et al.  Fourier-transform-infrared-spectroscopy based spectral-biomarker selection towards optimum diagnostic differentiation of oral leukoplakia and cancer , 2015, Analytical and Bioanalytical Chemistry.

[6]  A. Ray,et al.  Computer‐aided molecular pathology interpretation in exploring prospective markers for oral submucous fibrosis progression , 2016, Head & neck.

[7]  P Manimaran,et al.  The prediction of candidate genes for cervix related cancer through gene ontology and graph theoretical approach. , 2014, Molecular bioSystems.

[8]  Avi Ma'ayan,et al.  Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool , 2013, BMC Bioinformatics.

[9]  J. Gutkind,et al.  Dysregulated molecular networks in head and neck carcinogenesis. , 2009, Oral oncology.

[10]  Stephen T. C. Wong,et al.  A gene signature based method for identifying subtypes and subtype-specific drivers in cancer with an application to medulloblastoma , 2012, 2012 IEEE 2nd International Conference on Computational Advances in Bio and medical Sciences (ICCABS).

[11]  P. V. van Diest,et al.  Comparative molecular and histological grading of epithelial dysplasia of the oral cavity and the oropharynx , 2003, The Journal of pathology.

[12]  Z. Werb,et al.  Extracellular matrix degradation and remodeling in development and disease. , 2011, Cold Spring Harbor perspectives in biology.

[13]  Lue Ping Zhao,et al.  Gene Expression Profiling Identifies Genes Predictive of Oral Squamous Cell Carcinoma , 2008, Cancer Epidemiology Biomarkers & Prevention.

[14]  K. Lê Cao,et al.  Malignant transformation of oral epithelial dysplasia: a real-world evaluation of histopathologic grading. , 2014, Oral surgery, oral medicine, oral pathology and oral radiology.

[15]  Youping Deng,et al.  Gene selection and classification for cancer microarray data based on machine learning and similarity measures , 2011, BMC Genomics.

[16]  Kuang-Chi Lai,et al.  Early induction of cytokines/cytokine receptors and Cox2, and activation of NF-κB in 4-nitroquinoline 1-oxide-induced murine oral cancer model. , 2012, Toxicology and applied pharmacology.

[17]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[18]  G. Pitiyage,et al.  Molecular markers in oral epithelial dysplasia: review. , 2009, Journal of oral pathology & medicine : official publication of the International Association of Oral Pathologists and the American Academy of Oral Pathology.

[19]  Hong-Wen Deng,et al.  Gene selection for classification of microarray data based on the Bayes error , 2007, BMC Bioinformatics.

[20]  Olga Stepánková,et al.  RadViz and Identification of Clusters in Multidimensional Data , 2009, 2009 13th International Conference Information Visualisation.

[21]  Blaz Zupan,et al.  Orange: From Experimental Machine Learning to Interactive Data Mining , 2004, PKDD.

[22]  Tsviya Olender,et al.  GeneCards Version 3: the human gene integrator , 2010, Database J. Biol. Databases Curation.

[23]  Fillia Makedon,et al.  Application of Relief-F feature filtering algorithm to selecting informative genes for cancer classification using microarray data , 2004 .

[24]  C. Tanyel,et al.  The role of components of the extracellular matrix and inflammation on oral squamous cell carcinoma metastasis. , 2014, Archives of oral biology.

[25]  W. Loewenstein,et al.  Intercellular Communication and the Control of Tissue Growth: Lack of Communication between Cancer Cells , 1966, Nature.

[26]  Shengwen Liu,et al.  Gene expression profiling of craniofacial fibrous dysplasia reveals ADAMTS2 overexpression as a potential marker. , 2014, International journal of clinical and experimental pathology.

[27]  J. Chatterjee,et al.  Molecular Pathology Signatures in Predicting Malignant Potentiality of Dysplastic Oral Pre-cancers , 2015, Springer Science Reviews.

[28]  Mustapha Kandouz,et al.  Intercellular Communication in Cancer , 2015, Springer Netherlands.

[29]  Li Mao,et al.  Transcriptomic dissection of tongue squamous cell carcinoma , 2008, BMC Genomics.

[30]  K. Saito,et al.  Network-based analysis of calcium-binding protein genes identifies Grp94 as a target in human oral carcinogenesis , 2007, British Journal of Cancer.

[31]  K. Powell,et al.  Regulation of procollagen amino-propeptide processing during mouse embryogenesis by specialization of homologous ADAMTS proteases: insights on collagen biosynthesis and dermatosparaxis , 2006, Development.

[32]  B. Nielsen,et al.  Intracellular collagen degradation mediated by uPARAP/Endo180 is a major pathway of extracellular matrix turnover during malignancy , 2005, The Journal of cell biology.

[33]  K. Müller,et al.  Extracellular matrix in preneoplastic lesions and early cancer of the lung. , 1990, Pathology, research and practice.

[34]  Hans A. Kestler,et al.  Generalized Venn diagrams: a new method of visualizing complex genetic set relations , 2005, Bioinform..

[35]  Blaz Zupan,et al.  Conquering the Curse of Dimensionality in Gene Expression Cancer Diagnosis: Tough Problem, Simple Models , 2005, AIME.

[36]  Hong Yan,et al.  Biomarker Identification and Cancer Classification Based on Microarray Data Using Laplace Naive Bayes Model with Mean Shrinkage , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[37]  H. Järveläinen,et al.  Extracellular matrix macromolecules: potential tools and targets in cancer gene therapy , 2014, Molecular and Cellular Therapies.