Identification of differentially expressed genes through a meta-analysis approach for oral cancer classification

We propose a meta-analysis scheme for identifying differentially expressed genes in Oral Squamous Cell Carcinoma (OSCC) from different microarray studies. We detect a subset of relevant features and further classify samples under two experimental conditions (i.e healthy and cancer samples) for better patient stratification. A well-established meta-analysis method is adopted and gene expression data sets are derived from a public functional genomics data repository. Our primary aim is the accurate identification of up- and down-regulated genes in order to extract valuable biological information concerning the changes in expression between healthy and cancer samples. According to our results and the extracted informative gene list, a high classification accuracy of healthy and OSCC tumors is achieved with as few genes as possible. Furthermore, the proposed scheme implies that the combination of datasets from different origins may reduce the estimated percentage of false predictions, while the power of gene identification and disease classification is increased.

[1]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[2]  M. Nicolau,et al.  Head and neck cancer subtypes with biological and clinical relevance: Meta-analysis of gene-expression data , 2015, Oncotarget.

[3]  Safar Farajnia,et al.  Recent advances in head and neck squamous cell carcinoma--a review. , 2014, Clinical biochemistry.

[4]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[5]  M. Roizen,et al.  Hallmarks of Cancer: The Next Generation , 2012 .

[6]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[7]  Joanna Polanska,et al.  Adaptive filtering of microarray gene expression data based on Gaussian mixture decomposition , 2013, BMC Bioinformatics.

[8]  Qing Zeng-Treitler,et al.  Predicting sample size required for classification performance , 2012, BMC Medical Informatics and Decision Making.

[9]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[10]  Alioune Ngom,et al.  Identifying Informative Genes for Prediction of Breast Cancer Subtypes , 2013, PRIB.

[11]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[12]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[13]  Kuldip K. Paliwal,et al.  Cancer classification by gradient LDA technique using microarray gene expression data , 2008, Data Knowl. Eng..

[14]  Rainer Breitling,et al.  Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments , 2004, FEBS letters.

[15]  Hadis Kord,et al.  Importance of Microarray Technology and its Applications , 2016 .

[16]  Kuldip K. Paliwal,et al.  A Gene Selection Algorithm using Bayesian Classification Approach , 2012 .

[17]  Mohamed F. Ghalwash,et al.  Structured feature selection using coordinate descent optimization , 2016, BMC Bioinformatics.

[18]  P. Lin,et al.  Epigenetic regulation of the X‐linked tumour suppressors BEX1 and LDOC1 in oral squamous cell carcinoma , 2013, The Journal of pathology.

[19]  Tatsuhiko Tsunoda,et al.  Gene masking - a technique to improve accuracy for cancer classification with high dimensionality in microarray data , 2016, BMC Medical Genomics.

[20]  Hiroshi Mamitsuka,et al.  Selecting features in microarray classification using ROC curves , 2006, Pattern Recognit..

[21]  Kuldip K. Paliwal,et al.  A feature selection method using improved regularized linear discriminant analysis , 2014, Machine Vision and Applications.

[22]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Lue Ping Zhao,et al.  Gene Expression Profiling Identifies Genes Predictive of Oral Squamous Cell Carcinoma , 2008, Cancer Epidemiology Biomarkers & Prevention.

[24]  M. Heller DNA microarray technology: devices, systems, and applications. , 2002, Annual review of biomedical engineering.

[25]  Sushmita Mitra,et al.  Feature Selection and Clustering of Gene Expression Profiles Using Biological Knowledge , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).