FacPad: Bayesian sparse factor modeling for the inference of pathways responsive to drug treatment

MOTIVATION It is well recognized that the effects of drugs are far beyond targeting individual proteins, but rather influencing the complex interactions among many relevant biological pathways. Genome-wide expression profiling before and after drug treatment has become a powerful approach for capturing a global snapshot of cellular response to drugs, as well as to understand drugs' mechanism of action. Therefore, it is of great interest to analyze this type of transcriptomic profiling data for the identification of pathways responsive to different drugs. However, few computational tools exist for this task. RESULTS We have developed FacPad, a Bayesian sparse factor model, for the inference of pathways responsive to drug treatments. This model represents biological pathways as latent factors and aims to describe the variation among drug-induced gene expression alternations in terms of a much smaller number of latent factors. We applied this model to the Connectivity Map data set (build 02) and demonstrated that FacPad is able to identify many drug-pathway associations, some of which have been validated in the literature. Although this method was originally designed for the analysis of drug-induced transcriptional alternation data, it can be naturally applied to many other settings beyond polypharmacology. AVAILABILITY AND IMPLEMENTATION The R package 'FacPad' is publically available at: http://cran.open-source-solution.org/web/packages/FacPad/.

[1]  Justin Lamb,et al.  The Connectivity Map: a new tool for biomedical research , 2007, Nature Reviews Cancer.

[2]  Sarah L. Kinnings,et al.  Novel computational approaches to polypharmacology as a means to define responses to individual drugs. , 2012, Annual review of pharmacology and toxicology.

[3]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[4]  R. Shoemaker The NCI60 human tumour cell line anticancer drug screen , 2006, Nature Reviews Cancer.

[5]  R. Iyengar,et al.  Systems pharmacology: network analysis to identify multiscale mechanisms of drug action. , 2012, Annual review of pharmacology and toxicology.

[6]  Matthew West,et al.  Bayesian factor regression models in the''large p , 2003 .

[7]  Gary D. Bader,et al.  Pathguide: a Pathway Resource List , 2005, Nucleic Acids Res..

[8]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[9]  E. Berti,et al.  Mometasone furoate decreases adhesion molecule expression in psoriasis. , 1998, European journal of dermatology : EJD.

[10]  Y. Fujii,et al.  Effects of mometasone furoate on a rat allergic rhinitis model. , 2005, European journal of pharmacology.

[11]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Chiara Sabatti,et al.  Bayesian sparse hidden components analysis for transcription regulation networks , 2005, Bioinform..

[13]  M. Daly,et al.  PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes , 2003, Nature Genetics.

[14]  John A. Tallarico,et al.  Integrating high-content screening and ligand-target prediction to identify mechanism of action. , 2008, Nature chemical biology.

[15]  Hongyu Zhao,et al.  iFad: an integrative factor analysis model for drug-pathway association inference , 2012, Bioinform..

[16]  Terence P. Speed,et al.  Quality Assessment for Short Oligonucleotide Microarray Data , 2007, Technometrics.

[17]  P. Bork,et al.  Drug Target Identification Using Side-Effect Similarity , 2008, Science.

[18]  Lorenz Wernisch,et al.  Factor analysis for gene regulatory networks and transcription factor activity profiles , 2007, BMC Bioinformatics.

[19]  Yi-Cheng Zhang,et al.  Solving the apparent diversity-accuracy dilemma of recommender systems , 2008, Proceedings of the National Academy of Sciences.

[20]  M. Lane,et al.  Role of retinoid signalling in the adult brain , 2005, Progress in Neurobiology.

[21]  K. Chou,et al.  Predicting Drug-Target Interaction Networks Based on Functional Groups and Biological Features , 2010, PloS one.

[22]  Peer Bork,et al.  Drug-Induced Regulation of Target Expression , 2010, PLoS Comput. Biol..

[23]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database: update 2011 , 2010, Nucleic Acids Res..

[24]  William Stafford Noble,et al.  Large-scale prediction of protein-protein interactions from structures , 2010, BMC Bioinformatics.

[25]  Hiroki Kobayashi,et al.  Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening , 2009, PLoS Comput. Biol..

[26]  Shu-Dong Zhang,et al.  Application of connectivity mapping in predictive toxicology based on gene-expression similarity. , 2010, Toxicology.

[27]  Kally C. O'Reilly,et al.  Chronic Administration of 13-Cis-Retinoic Acid Increases Depression-Related Behavior in Mice , 2006, Neuropsychopharmacology.

[28]  J. Bajorath,et al.  Docking and scoring in virtual screening for drug discovery: methods and applications , 2004, Nature Reviews Drug Discovery.

[29]  Yoshihiro Yamanishi,et al.  Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework , 2010, Bioinform..

[30]  Yoshihiro Yamanishi,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008, ISMB.

[31]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[32]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[33]  C A Ross,et al.  Decreased expression of striatal signaling genes in a mouse model of Huntington's disease. , 2000, Human molecular genetics.

[34]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[35]  Diego di Bernardo,et al.  Identifying Network of Drug Mode of Action by Gene Expression Profiling , 2009, J. Comput. Biol..

[36]  Eugenio Uriarte,et al.  Alignment-free prediction of a drug-target complex network based on parameters of drug connectivity and protein sequence of receptors. , 2009, Molecular pharmaceutics.

[37]  R. Tagliaferri,et al.  Discovery of drug mode of action and drug repositioning from transcriptional responses , 2010, Proceedings of the National Academy of Sciences.

[38]  M. Kanehisa,et al.  The KEGG databases and tools facilitating omics analysis: latest developments involving human diseases and pharmaceuticals. , 2012, Methods in molecular biology.

[39]  Magnus Rattray,et al.  Inference algorithms and learning theory for Bayesian sparse factor analysis , 2009 .