Concurrent analysis of copy number variation and gene expression: Application in paired non-smoking female lung cancer patients

This study developed a method to identify disease-correlated pathways by integrating copy numbers (CN) and gene expression (GE). To evaluate the correlation between CN and GE, a suitable window size was assessed by simulation. Gene Set Enrichment Analysis (GSEA) was utilized to identify the possible pathways by CN, GE, and their correlations, respectively. Each of those enriched pathways was further assigned a score to incorporate the information from CN, GE, and their correlations. A dataset of 44 female nonsmoking lung cancer patients with both normal and tumor tissues was used to evaluate the performance of this method. To further appraise the predicting abilities of those pathways, patients were classified by support vector machines using the pathways identified by only copy number, only gene expression and incorporating CN, GE, and their correlations. The results showed that the proposed method earned higher accuracy, sensitivity and specificity than traditional methods.

[1]  Pei Wang,et al.  Integrative analysis of DNA copy number and gene expression in metastatic oral squamous cell carcinoma identifies genes associated with poor survival , 2010, Molecular Cancer.

[2]  A. Reymond,et al.  Copy number variants, diseases and gene expression. , 2009, Human molecular genetics.

[3]  M. A. Wiel,et al.  Nonparametric testing for DNA copy number induced differential mRNA gene expression. , 2009 .

[4]  M. Bredel,et al.  Integrated high-resolution genome-wide analysis of gene dosage and gene expression in human brain tumors. , 2007, Methods in molecular biology.

[5]  Roberta Spinelli,et al.  A computational procedure to identify significant overlap of differentially expressed and genomic imbalanced regions in cancer datasets† , 2009, Nucleic acids research.

[6]  Ding‐Shinn Chen,et al.  Genetic characterization of fas-associated phosphatase-1 as a putative tumor suppressor gene on chromosome 4q21.3 in hepatocellular carcinoma. , 2006, Clinical cancer research : an official journal of the American Association for Cancer Research.

[7]  Renée X. de Menezes,et al.  Integrated analysis of DNA copy number and gene expression microarray data using gene sets , 2009, BMC Bioinformatics.

[8]  Yuri Kotliarov,et al.  Correlation analysis between single-nucleotide polymorphism and expression arrays in gliomas identifies potentially relevant target genes. , 2009, Cancer research.

[9]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Wen-Hui Su,et al.  OncoDB.HCC: an integrated oncogenomic database of hepatocellular carcinoma revealed aberrant cancer target genes and loci , 2006, Nucleic Acids Res..

[11]  M. Pellegrini,et al.  Copy number variation influences gene expression and metabolic traits in mice. , 2009, Human molecular genetics.

[12]  Tom Freeman,et al.  Changes in cervical keratinocyte gene expression associated with integration of human papillomavirus 16. , 2002, Cancer research.

[13]  R. Redon,et al.  Relative Impact of Nucleotide and Copy Number Variation on Gene Expression Phenotypes , 2007, Science.

[14]  Ann F. Chambers,et al.  Multi-Platform Whole-Genome Microarray Analyses Refine the Epigenetic Signature of Breast Cancer Metastasis with Gene Expression and Copy Number , 2010, PloS one.

[15]  E. Petretto,et al.  Gene copy number variation and common human disease , 2010, Clinical genetics.

[16]  M. Schmidt,et al.  The impact of genomic alterations on the transcriptome: a prostate cancer cell line case study , 2006, Chromosome Research.

[17]  Louise V Wain,et al.  Genomic copy number variation, human health, and disease , 2009, The Lancet.

[18]  Dejan Juric,et al.  Functional network analysis reveals extended gliomagenesis pathway maps and three novel MYC-interacting genes in human gliomas. , 2005, Cancer research.

[19]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[20]  Martin Schäfer,et al.  Integrated analysis of copy number alterations and gene expression: a bivariate assessment of equally directed abnormalities , 2009, Bioinform..

[21]  Peter J. Park,et al.  Integrative analysis reveals the direct and indirect interactions between DNA copy number aberrations and gene expression changes , 2008, Bioinform..

[22]  David B Allison,et al.  Expectations, validity, and reality in gene expression profiling. , 2010, Journal of clinical epidemiology.