Assessing quality and completeness of human transcriptional regulatory pathways on a genome-wide scale

BackgroundPathway databases are becoming increasingly important and almost omnipresent in most types of biological and translational research. However, little is known about the quality and completeness of pathways stored in these databases. The present study conducts a comprehensive assessment of transcriptional regulatory pathways in humans for seven well-studied transcription factors: MYC, NOTCH1, BCL6, TP53, AR, STAT1, and RELA. The employed benchmarking methodology first involves integrating genome-wide binding with functional gene expression data to derive direct targets of transcription factors. Then the lists of experimentally obtained direct targets are compared with relevant lists of transcriptional targets from 10 commonly used pathway databases.ResultsThe results of this study show that for the majority of pathway databases, the overlap between experimentally obtained target genes and targets reported in transcriptional regulatory pathway databases is surprisingly small and often is not statistically significant. The only exception is MetaCore pathway database which yields statistically significant intersection with experimental results in 84% cases. Additionally, we suggest that the lists of experimentally derived direct targets obtained in this study can be used to reveal new biological insight in transcriptional regulation and suggest novel putative therapeutic targets in cancer.ConclusionsOur study opens a debate on validity of using many popular pathway databases to obtain transcriptional regulatory targets. We conclude that the choice of pathway databases should be informed by solid scientific evidence and rigorous empirical evaluation.ReviewersThis article was reviewed by Prof. Wing Hung Wong, Dr. Thiago Motta Venancio (nominated by Dr. L Aravind), and Prof. Geoff J McLachlan.

[1]  A. Statnikov,et al.  The Notch/Hes1 pathway sustains NF-κB activation through CYLD repression in T cell leukemia. , 2010, Cancer cell.

[2]  Alexander R. Pico,et al.  WikiPathways: Pathway Editing for the People , 2008, PLoS biology.

[3]  Sean Ekins,et al.  Pathway mapping tools for analysis of high content data. , 2007, Methods in molecular biology.

[4]  Allen D. Delaney,et al.  Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing , 2007, Nature Methods.

[5]  M. Khoury,et al.  Most Published Research Findings Are False—But a Little Replication Goes a Long Way , 2007, PLoS medicine.

[6]  T. Werner Bioinformatics applications for pathway analysis of microarray data. , 2008, Current opinion in biotechnology.

[7]  Andrea Califano,et al.  Integrated biochemical and computational approach identifies BCL6 direct target genes controlling multiple pathways in normal germinal center B cells. , 2008, Blood.

[8]  K. Pienta,et al.  A hierarchical network of transcription factors governs androgen receptor-dependent prostate cancer growth. , 2007, Molecular cell.

[9]  T. Schlange,et al.  Novel c‐MYC target genes mediate differential effects on cell proliferation and migration , 2007, EMBO reports.

[10]  Jeffrey T. Chang,et al.  Oncogenic pathway signatures in human cancers as a guide to targeted therapies , 2006, Nature.

[11]  Adam A. Margolin,et al.  NOTCH1 directly regulates c-MYC and activates a feed-forward-loop transcriptional network promoting leukemic cell growth , 2006, Proceedings of the National Academy of Sciences.

[12]  Andrea Califano,et al.  ChIP-on-chip significance analysis reveals large-scale binding and regulation by human transcription factor oncogenes , 2007, Proceedings of the National Academy of Sciences.

[13]  Malay Mandal,et al.  Targeting the NF-κB signaling pathway in Notch1-induced T-cell leukemia , 2007, Nature Medicine.

[14]  Z. Weng,et al.  A Global Map of p53 Transcription-Factor Binding Sites in the Human Genome , 2006, Cell.

[15]  G. Inghirami,et al.  Down-regulation of LFA-1 adhesion receptors by C-myc oncogene in human B lymphoblastoid cells. , 1990, Science.

[16]  Steven J. M. Jones,et al.  Genome-wide identification of DNA-protein interactions using chromatin immunoprecipitation coupled with flow cell sequencing. , 2009, The Journal of endocrinology.

[17]  Malay Mandal,et al.  Targeting the NF-kappaB signaling pathway in Notch1-induced T-cell leukemia. , 2007, Nature medicine.

[18]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[19]  E. Raetz,et al.  Molecular pathogenesis of T-cell leukaemia and lymphoma , 2008, Nature Reviews Immunology.

[20]  R. Weichselbaum,et al.  STAT1-dependent expression of energy metabolic pathways links tumour growth and radioresistance to the Warburg effect , 2009, BMC medicine.

[21]  J. Gearhart,et al.  Pluripotency redux--advances in stem-cell research. , 2007, The New England journal of medicine.

[22]  Alexander R. Pico,et al.  The public road to high-quality curated biological pathways. , 2008, Drug discovery today.

[23]  R. Eisenman,et al.  Myc target genes. , 1997, Trends in biochemical sciences.

[24]  Kenny Q. Ye,et al.  The BCL6 transcriptional program features repression of multiple oncogenes in primary B cells and is deregulated in DLBCL. , 2009, Blood.

[25]  Z. Szallasi,et al.  Reliability and reproducibility issues in DNA microarray measurements. , 2006, Trends in genetics : TIG.

[26]  Sue Povey,et al.  The HGNC Database in 2008: a resource for the human genome , 2007, Nucleic Acids Res..

[27]  D. W. Knowles,et al.  Transcription Factors Bind Thousands of Active and Inactive Regions in the Drosophila Blastoderm , 2008, PLoS biology.

[28]  Aaron N. Chang,et al.  Identification of SULF2 as a novel transcriptional target of p53 by use of integrated genomic analyses. , 2009, Cancer research.

[29]  A. Ferrando,et al.  Requirement for cyclin D3 in lymphocyte development and T cell leukemias. , 2003, Cancer cell.

[30]  Clifford A. Meyer,et al.  Androgen Receptor Regulates a Distinct Transcription Program in Androgen-Independent Prostate Cancer , 2009, Cell.

[31]  M. Gerstein,et al.  Variation in Transcription Factor Binding Among Humans , 2010, Science.

[32]  P. Farnham,et al.  Genomic Approaches That Aid in the Identification of Transcription Factor Target Genes , 2004, Experimental biology and medicine.

[33]  Richard S Larson,et al.  Interconnecting molecular pathways in the pathogenesis and drug sensitivity of T-cell acute lymphoblastic leukemia. , 2010, Blood.

[34]  R. Versteeg,et al.  c‐myc down‐regulates class I HLA expression in human melanomas. , 1988, The EMBO journal.

[35]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[36]  Michael Y. Galperin,et al.  The 2010 Nucleic Acids Research Database Issue and online Database Collection: a community of data resources , 2009, Nucleic Acids Res..