iFad: an integrative factor analysis model for drug-pathway association inference

MOTIVATION Pathway-based drug discovery considers the therapeutic effects of compounds in the global physiological environment. This approach has been gaining popularity in recent years because the target pathways and mechanism of action for many compounds are still unknown, and there are also some unexpected off-target effects. Therefore, the inference of drug-pathway associations is a crucial step to fully realize the potential of system-based pharmacological research. Transcriptome data offer valuable information on drug-pathway targets because the pathway activities may be reflected through gene expression levels. Hence, it is of great interest to jointly analyze the drug sensitivity and gene expression data from the same set of samples to investigate the gene-pathway-drug-pathway associations. RESULTS We have developed iFad, a Bayesian sparse factor analysis model to jointly analyze the paired gene expression and drug sensitivity datasets measured across the same panel of samples. The model enables direct incorporation of prior knowledge regarding gene-pathway and/or drug-pathway associations to aid the discovery of new association relationships. We use a collapsed Gibbs sampling algorithm for inference. Satisfactory performance of the proposed model was found for both simulated datasets and real data collected on the NCI-60 cell lines. Our results suggest that iFad is a promising approach for the identification of drug targets. This model also provides a general statistical framework for pathway-based integrative analysis of other types of -omics data. AVAILABILITY The R package 'iFad' and real NCI-60 dataset used are available at http://bioinformatics.med.yale.edu/group.

[1]  J. Mesirov,et al.  Chemosensitivity prediction by transcriptional profiling , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Michael M. Mysinger,et al.  Automated Docking Screens: A Feasibility Study , 2009, Journal of medicinal chemistry.

[3]  Feng Liu,et al.  The pharmacogenetics and pharmacogenomics knowledge base: accentuating the knowledge , 2007, Nucleic Acids Res..

[4]  Yu-Chun Lin,et al.  ' s response to reviews Title : Identifying Significant Genetic Regulatory Networks in the Prostate Cancer from Microarray Data Based on Transcription Factor Analysis and Conditional Independency , 2009 .

[5]  William C Reinhold,et al.  CellMiner: a relational database and query tool for the NCI-60 cancer cell lines , 2009, BMC Genomics.

[6]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[7]  R. Shoemaker The NCI60 human tumour cell line anticancer drug screen , 2006, Nature Reviews Cancer.

[8]  P. Bork,et al.  Drug Target Identification Using Side-Effect Similarity , 2008, Science.

[9]  Tianwei Yu,et al.  Inference of transcriptional regulatory network by two-stage constrained space factor analysis , 2005, Bioinform..

[10]  Gerhard F. Ecker,et al.  Computational models for prediction of interactions with ABC-transporters. , 2008, Drug discovery today.

[11]  Marina Bibikova,et al.  Genomic profiling of 766 cancer‐related genes in archived esophageal normal and carcinoma tissues , 2008, International journal of cancer.

[12]  D. Haber,et al.  Cell line-based platforms to evaluate the therapeutic efficacy of candidate anticancer agents , 2010, Nature Reviews Cancer.

[13]  Sven Bergmann,et al.  A modular approach for integrative analysis of large-scale gene-expression and drug-response data , 2008, Nature Biotechnology.

[14]  Lorenz Wernisch,et al.  Factor analysis for gene regulatory networks and transcription factor activity profiles , 2007, BMC Bioinformatics.

[15]  Gary D. Bader,et al.  Pathguide: a Pathway Resource List , 2005, Nucleic Acids Res..

[16]  Leslie A Kuhn,et al.  Side‐chain flexibility in protein–ligand binding: The minimal rotation hypothesis , 2005, Protein science : a publication of the Protein Society.

[17]  P. Aloy,et al.  Unveiling the role of network and systems biology in drug discovery. , 2010, Trends in pharmacological sciences.

[18]  Dana Pe'er,et al.  Harnessing gene expression to identify the genetic basis of drug resistance , 2009, Molecular systems biology.

[19]  R. Kishony,et al.  Functional classification of drugs by properties of their pairwise interactions , 2006, Nature Genetics.

[20]  S. Friend,et al.  A network view of disease and compound screening , 2009, Nature Reviews Drug Discovery.

[21]  Jae K. Lee,et al.  Transcript and protein expression profiles of the NCI-60 cancer cell panel: an integromic microarray study , 2007, Molecular Cancer Therapeutics.

[22]  Susumu Goto,et al.  KEGG for representation and analysis of molecular networks involving diseases and drugs , 2009, Nucleic Acids Res..

[23]  Byoung-Tak Zhang,et al.  Bayesian Network Learning with Feature Abstraction for Gene-drug Dependency Analysis , 2005, J. Bioinform. Comput. Biol..

[24]  William C Reinhold,et al.  Integrating data on DNA copy number with gene expression levels and drug sensitivities in the NCI-60 cell line panel , 2006, Molecular Cancer Therapeutics.

[25]  J. Irwin,et al.  Docking and chemoinformatic screens for new ligands and targets. , 2009, Current opinion in biotechnology.

[26]  Florian Nigsch,et al.  Computational toxicology: an overview of the sources of data and of modelling methods , 2009, Expert opinion on drug metabolism & toxicology.

[27]  Christian von Mering,et al.  STITCH: interaction networks of chemicals and proteins , 2007, Nucleic Acids Res..

[28]  S. Lampel,et al.  The druggable genome: an update. , 2005, Drug discovery today.

[29]  Thomas C. Wiegers,et al.  Comparative Toxicogenomics Database: a knowledgebase and discovery tool for chemical–gene–disease networks , 2008, Nucleic Acids Res..

[30]  Jan M. Kriegl,et al.  Computational approaches to predict drug metabolism , 2009, Expert opinion on drug metabolism & toxicology.

[31]  Richard Lugg,et al.  Mutation analysis of 24 known cancer genes in the NCI-60 cell line set , 2006, Molecular Cancer Therapeutics.

[32]  Yang Song,et al.  Therapeutic target database update 2012: a resource for facilitating target-oriented drug discovery , 2011, Nucleic Acids Res..

[33]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[34]  Corinna Blasse,et al.  CancerResource: a comprehensive database of cancer-relevant proteins and compound interactions supported by experimental knowledge , 2010, Nucleic Acids Res..

[35]  A. Barabasi,et al.  Drug—target network , 2007, Nature Biotechnology.

[36]  R. Glenny,et al.  Computational identification of key biological modules and transcription factors in acute lung injury. , 2006, American journal of respiratory and critical care medicine.

[37]  Yufei Huang,et al.  Bayesian non-negative factor analysis for reconstructing transcription factor mediated regulatory networks , 2011, Proteome Science.

[38]  A. Hopkins,et al.  The druggable genome , 2002, Nature Reviews Drug Discovery.

[39]  David L Mobley,et al.  Predicting ligand binding affinity with alchemical free energy methods in a polar model binding site. , 2009, Journal of molecular biology.