A modular approach for integrative analysis of large-scale gene-expression and drug-response data

High-throughput technologies are now used to generate more than one type of data from the same biological samples. To properly integrate such data, we propose using co-modules, which describe coherent patterns across paired data sets, and conceive several modular methods for their identification. We first test these methods using in silico data, demonstrating that the integrative scheme of our Ping-Pong Algorithm uncovers drug-gene associations more accurately when considering noisy or complex data. Second, we provide an extensive comparative study using the gene-expression and drug-response data from the NCI-60 cell lines. Using information from the DrugBank and the Connectivity Map databases we show that the Ping-Pong Algorithm predicts drug-gene associations significantly better than other methods. Co-modules provide insights into possible mechanisms of action for a wide range of drugs and suggest new targets for therapy.

[1]  Eytan Domany,et al.  Coupled Two-way Clustering Analysis of Breast Cancer and Colon Cancer Gene Expression Data , 2002, Bioinform..

[2]  Alfonso Bellacosa,et al.  The phosphoinositide 3-kinase/AKT1 pathway involvement in drug and all-trans-retinoic acid resistance of leukemia cells. , 2003, Molecular cancer research : MCR.

[3]  Robert Kiss,et al.  Characterization of the activities of actin-affecting drugs on tumor cell migration. , 2006, Toxicology and applied pharmacology.

[4]  J. Gutheil,et al.  An etoposide-resistant lung cancer subline overexpresses the multidrug resistance-associated protein. , 1995, British Journal of Cancer.

[5]  C. Verfaillie,et al.  Methotrexate exacerbates tumor progression in a murine model of chronic myeloid leukemia. , 2002, The Journal of pharmacology and experimental therapeutics.

[6]  H. de Thé,et al.  All-trans retinoic acid modulates the retinoic acid receptor-alpha in promyelocytic cells. , 1991, The Journal of clinical investigation.

[7]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[8]  Jason H. Moore,et al.  Characterization of microRNA expression levels and their biological correlates in human cancer cell lines. , 2007, Cancer research.

[9]  M. Shibuya,et al.  Resistance to anticancer drugs in NIH3T3 cells transfected with c-myc and/or c-H-ras genes. , 1991, British Journal of Cancer.

[10]  G Mathé,et al.  Circadian rhythm in tolerance of mice for etoposide. , 1985, Cancer treatment reports.

[11]  H. Soejima,et al.  Cisplatin represses transcriptional activity from the minimal promoter of the O6-methylguanine methyltransferase gene and increases sensitivity of human gallbladder cancer cells to 1-(4-amino-2-methyl-5-pyrimidinyl) methyl-3-2-chloroethyl)-3-nitrosourea. , 2005, Oncology reports.

[12]  Anton J. Enright,et al.  Detection of functional modules from protein interaction networks , 2003, Proteins.

[13]  Jae K. Lee,et al.  Transcript and protein expression profiles of the NCI-60 cancer cell panel: an integromic microarray study , 2007, Molecular Cancer Therapeutics.

[14]  Sven Bergmann,et al.  Iterative signature algorithm for the analysis of large-scale gene expression data. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  P. Tran,et al.  Methylthioadenosine phosphorylase cDNA transfection alters sensitivity to depletion of purine and methionine in A549 lung cancer cells. , 1996, Cancer research.

[16]  Roded Sharan,et al.  Revealing modularity and organization in the yeast molecular network by integrated analysis of highly heterogeneous genomewide data. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Steven J. M. Jones,et al.  Meta-analysis and meta-review of thyroid cancer gene expression profiling studies identifies important diagnostic biomarkers. , 2006, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[18]  C. Mathieu,et al.  In Vitro and In Vivo Analysis of the Immune System of Vitamin D Receptor Knockout Mice , 2001, Journal of bone and mineral research : the official journal of the American Society for Bone and Mineral Research.

[19]  D. Pe’er,et al.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data , 2003, Nature Genetics.

[20]  A. Isaksson,et al.  Identification of molecular mechanisms for cellular drug resistance by combining drug activity and gene expression profiles , 2005, British Journal of Cancer.

[21]  D. Botstein,et al.  A gene expression database for the molecular pharmacology of cancer , 2000, Nature Genetics.

[22]  R. Clarke,et al.  Human X‐Box binding protein‐1 confers both estrogen independence and antiestrogen resistance in breast cancer cell lines , 2007, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[23]  G. S. Johnson,et al.  An Information-Intensive Approach to the Molecular Pharmacology of Cancer , 1997, Science.

[24]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[25]  Sergio Storari,et al.  Finding biological process modifications in cancer tissues by mining gene expression correlations , 2005, BMC Bioinformatics.

[26]  A. Harris,et al.  Mechanisms of multidrug resistance in cancer treatment. , 1992, Acta oncologica.

[27]  W. Ross,et al.  DNA binding by epipodophyllotoxins and N-acyl anthracyclines: implications for mechanism of topoisomerase II inhibition. , 1988, Molecular pharmacology.

[28]  D. Henry,et al.  Changing patterns of care in the management of anemia. , 1992, Seminars in oncology.

[29]  C. Lok,et al.  Characterization of the human topoisomerase IIbeta (TOP2B) promoter activity: essential roles of the nuclear factor-Y (NF-Y)- and specificity protein-1 (Sp1)-binding sites. , 2002, The Biochemical journal.

[30]  S. Eom,et al.  Caspase-mediated cleavage of p130cas in etoposide-induced apoptotic Rat-1 cells. , 2000, Molecular biology of the cell.

[31]  D. Botstein,et al.  Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Lothar Thiele,et al.  A systematic comparison and evaluation of biclustering methods for gene expression data , 2006, Bioinform..

[33]  Yaniv Ziv,et al.  Revealing modular organization in the yeast transcriptional network , 2002, Nature Genetics.

[34]  Rainer Breitling,et al.  Network theory to understand microarray studies of complex diseases. , 2006, Current molecular medicine.

[35]  Sven Bergmann,et al.  Rewiring of the Yeast Transcriptional Network Through the Evolution of Motif Usage , 2005, Science.

[36]  C. Allegra,et al.  Enzyme studies of methotrexate-resistant human leukemic cell (K562) subclones. , 1992, Leukemia research.

[37]  K. Miyazaki,et al.  Deficient MGMT and proficient hMLH1 expression renders gallbladder carcinoma cells sensitive to alkylating agents through G2-M cell cycle arrest. , 2005, International journal of oncology.

[38]  P. Cuq,et al.  Combined effects of GSTP1 and MRP1 in melanoma drug resistance , 2005, British Journal of Cancer.

[39]  Thomas Werner,et al.  Regulatory networks: Linking microarray data to systems biology , 2007, Mechanisms of Ageing and Development.

[40]  S. Bergmann,et al.  Similarities and Differences in Genome-Wide Expression Data of Six Organisms , 2003, PLoS biology.

[41]  R. Mirimanoff,et al.  MGMT gene silencing and benefit from temozolomide in glioblastoma. , 2005, The New England journal of medicine.

[42]  M. Blackburn,et al.  Adenosine deaminase deficiency: metabolic basis of immune deficiency and pulmonary inflammation. , 2005, Advances in immunology.

[43]  Didier Marot,et al.  High tumoral levels of Kiss1 and G-protein-coupled receptor 54 expression are correlated with poor prognosis of estrogen receptor-positive breast tumors. , 2007, Endocrine-related cancer.

[44]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[45]  M. Eisen,et al.  Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering , 2002, Genome Biology.

[46]  S. Bergmann,et al.  Comparative Gene Expression Analysis by a Differential Clustering Approach: Application to the Candida albicans Transcription Program , 2005, PLoS genetics.

[47]  David Botstein,et al.  A systematic approach to reconstructing transcription networks in Saccharomyces cerevisiae , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[48]  M. Sousa,et al.  Internalization of Transthyretin , 2001, The Journal of Biological Chemistry.

[49]  Zhenyu Ding,et al.  Predicting Cancer Drug Response by Proteomic Profiling , 2006, Clinical Cancer Research.

[50]  Jiong Hu,et al.  Mutant transcription factors and tyrosine kinases as therapeutic targets for leukemias: from acute promyelocytic leukemia to chronic myeloid leukemia and beyond. , 2007, Advances in cancer research.

[51]  J. Mesirov,et al.  Chemosensitivity prediction by transcriptional profiling , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Neal S. Holter,et al.  Fundamental patterns underlying gene expression profiles: simplicity from complexity. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[53]  H. Abdi Partial Least Square Regression PLS-Regression , 2007 .

[54]  M. Zweig,et al.  Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. , 1993, Clinical chemistry.

[55]  Sven Bergmann,et al.  Defining transcription modules using large-scale gene expression data , 2004, Bioinform..

[56]  K. Hande,et al.  Etoposide pharmacology. , 1992, Seminars in oncology.

[57]  William C Reinhold,et al.  Integrating data on DNA copy number with gene expression levels and drug sensitivities in the NCI-60 cell line panel , 2006, Molecular Cancer Therapeutics.

[58]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[59]  Mario Medvedovic,et al.  Bayesian hierarchical model for transcriptional module discovery by jointly modeling gene expression and ChIP-chip data , 2007, BMC Bioinformatics.

[60]  Feng Gao,et al.  Defining transcriptional networks through integrative modeling of mRNA expression and transcription factor binding data , 2004, BMC Bioinformatics.

[61]  D. Koller,et al.  A module map showing conditional activity of expression modules in cancer , 2004, Nature Genetics.

[62]  K. Strebhardt,et al.  Rational combinations of siRNAs targeting Plk1 with breast cancer drugs , 2007, Oncogene.

[63]  Y. Sugimoto,et al.  Breast cancer resistance protein: Molecular target for anticancer drug resistance and pharmacokinetics/pharmacodynamics , 2005, Cancer science.

[64]  Jos H. Beijnen,et al.  The Effect of Low pH on Breast Cancer Resistance Protein (ABCG2)-Mediated Transport of Methotrexate, 7-Hydroxymethotrexate, Methotrexate Diglutamate, Folic Acid, Mitoxantrone, Topotecan, and Resveratrol in In Vitro Drug Transport Models , 2007, Molecular Pharmacology.

[65]  H Phillip Koeffler,et al.  The anti-proliferative effects of 1α,25(OH)2D3 on breast and prostate cancer cells are associated with induction of BRCA1 gene expression , 2000, Oncogene.

[66]  D. Kokkinakis,et al.  Modulation of cell cycle and gene expression in pancreatic tumor cell lines by methionine deprivation (methionine stress): implications to the therapy of pancreatic adenocarcinoma , 2005, Molecular Cancer Therapeutics.

[67]  Scott H. Kaufmann,et al.  Elevated DNA Polymerase α, DNA Polymerase β, and DNA Topoisomerase II in a Melphalan-resistant Rhabdomyosarcoma Xenograft That Is Cross-Resistant to Nitrosoureas and Topotecan , 1994 .

[68]  Guy Perrière,et al.  Cross-platform comparison and visualisation of gene expression data using co-inertia analysis , 2003, BMC Bioinformatics.

[69]  Ned S. Wingreen,et al.  Finding regulatory modules through large-scale gene-expression data analysis , 2003, Bioinform..

[70]  M. Loda,et al.  Evaluation of markers for CpG island methylator phenotype (CIMP) in colorectal cancer by a large population-based sample. , 2007, The Journal of molecular diagnostics : JMD.

[71]  P. Talbot,et al.  Capillary plexus development in the day five to day six chick chorioallantoic membrane is inhibited by cytochalasin D and suramin. , 2002, The Journal of experimental zoology.

[72]  Daphne Koller,et al.  Genome-wide discovery of transcriptional modules from DNA sequence and gene expression , 2003, ISMB.

[73]  D A Scudiero,et al.  Display and analysis of patterns of differential activity of drugs against human tumor cell lines: development of mean graph and COMPARE algorithm. , 1989, Journal of the National Cancer Institute.

[74]  Y. Samuels,et al.  Oncogenic mutations of PIK3CA in human cancers. , 2004, Current topics in microbiology and immunology.

[75]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[76]  D. Botstein,et al.  Singular value decomposition for genome-wide expression data processing and modeling. , 2000, Proceedings of the National Academy of Sciences of the United States of America.