Transcriptional landscape of cell lines and their tissues of origin

Cell lines are an indispensable tool in biomedical research and often used as surrogates for tissues. An important question is how well a cell line’s transcriptional and regulatory processes reflect those of its tissue of origin. We analyzed RNA-Seq data from GTEx for 127 paired Epstein-Barr virus transformed lymphoblastoid cell lines and whole blood samples; and 244 paired fibroblast cell lines and skin biopsies. A combination of gene expression and network analyses shows that while cell lines carry the expression signatures of their primary tissues, albeit at reduced levels, they also exhibit changes in their patterns of transcription factor regulation. Cell cycle genes are over-expressed in cell lines compared to primary tissue, and they have a reduction of repressive transcription factor targeting. Our results provide insight into the expression and regulatory alterations observed in cell lines and suggest that these changes should be considered when using cell lines as models. Highlights Cell lines differ from their source tissues in gene expression and regulation Distinct cell lines share altered patterns of cell cycle regulation Cell cycle genes are less strongly targeted by repressive TFs in cell lines Cell lines share expression with their source tissue, but at reduced levels

[1]  John Quackenbush,et al.  Tissue-aware RNA-Seq processing and normalization for heterogeneous and sparse data , 2016, BMC Bioinformatics.

[2]  Shimon Sakaguchi,et al.  Immuno-Navigator, a batch-corrected coexpression database, reveals cell type-specific gene networks in the immune system , 2016, Proceedings of the National Academy of Sciences.

[3]  Minoru Kanehisa,et al.  KEGG as a reference resource for gene and protein annotation , 2015, Nucleic Acids Res..

[4]  Benjamin Haibe-Kains,et al.  A network model for angiogenesis in ovarian cancer , 2015, BMC Bioinformatics.

[5]  J. Azizkhan-Clifford,et al.  Sp1 and the ‘hallmarks of cancer’ , 2015, The FEBS journal.

[6]  Hailong Zhu,et al.  Predicting protein functions using incomplete hierarchical labels , 2015, BMC Bioinformatics.

[7]  Kate B. Cook,et al.  Determination and Inference of Eukaryotic Transcription Factor Sequence Specificity , 2014, Cell.

[8]  G. Melino,et al.  How the TP53 Family Proteins TP63 and TP73 Contribute to Tumorigenesis: Regulators and Effectors , 2014, Human mutation.

[9]  M. Aladjem,et al.  Inactivation of Rb and E2f8 Synergizes To Trigger Stressed DNA Replication during Erythroid Terminal Differentiation , 2014, Molecular and Cellular Biology.

[10]  Javier Rodríguez-Ubreva,et al.  Epstein–Barr virus-mediated transformation of B cells induces global chromatin changes independent to the acquisition of proliferation , 2013, Nucleic acids research.

[11]  Charity W. Law,et al.  voom: precision weights unlock linear model analysis tools for RNA-seq read counts , 2014, Genome Biology.

[12]  C. Huttenhower,et al.  Passing Messages between Biological Networks to Refine Predicted Interactions , 2013, PloS one.

[13]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[14]  Fatima Al-Shahrour,et al.  The B cell transcription program mediates hypomethylation and overexpression of key genes in Epstein-Barr virus-associated proliferative conversion , 2013, Genome Biology.

[15]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[16]  Shane J. Neph,et al.  Circuitry and Dynamics of Human Transcription Factor Regulatory Networks , 2012, Cell.

[17]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[18]  P. Tessari,et al.  Skin fibroblasts as a tool for identifying the risk of nephropathy in the type 1 diabetic population , 2012, Diabetes/metabolism research and reviews.

[19]  Zhifu Sun,et al.  Genetic Variation Predicting Cisplatin Cytotoxicity Associated with Overall Survival in Lung Cancer Patients Receiving Platinum-Based Chemotherapy , 2011, Clinical Cancer Research.

[20]  Yoav Gilad,et al.  The effects of EBV transformation on gene expression levels and methylation profiles. , 2011, Human molecular genetics.

[21]  William Stafford Noble,et al.  FIMO: scanning for occurrences of a given motif , 2011, Bioinform..

[22]  E. Smeland,et al.  TGF-β-induced growth inhibition in B-cell lymphoma correlates with Smad1/5 signalling and constitutively active p38 MAPK , 2010, BMC Immunology.

[23]  P. Higgins,et al.  Upstream stimulatory factor‐2 mediates quercetin‐induced suppression of PAI‐1 gene expression in human endothelial cells , 2010, Journal of cellular biochemistry.

[24]  C. Glass,et al.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. , 2010, Molecular cell.

[25]  Saptarsi M. Haldar,et al.  Klf15 Deficiency Is a Molecular Link Between Heart Failure and Aortic Aneurysm Formation , 2010, Science Translational Medicine.

[26]  H. Parkinson,et al.  A global map of human gene expression , 2010, Nature Biotechnology.

[27]  Mark I McCarthy,et al.  Variability of gene expression profiles in human blood and lymphoblastoid cell lines , 2010, BMC Genomics.

[28]  C. Goding,et al.  Target Gene Specificity of USF-1 Is Directed via p38-mediated Phosphorylation-dependent Acetylation* , 2009, The Journal of Biological Chemistry.

[29]  M. Mann,et al.  Comparative Proteomic Phenotyping of Cell Lines and Primary Cells to Assess Preservation of Cell Type-specific Functions , 2009, Molecular & Cellular Proteomics.

[30]  R. Shamir,et al.  Regulatory networks define phenotypic classes of human stem cell lines , 2008, Nature.

[31]  S. Orkin,et al.  An Extended Transcriptional Network for Pluripotency of Embryonic Stem Cells , 2008, Cell.

[32]  S. Chien,et al.  Tumor cell cycle arrest induced by shear stress: Roles of integrins and Smad , 2008, Proceedings of the National Academy of Sciences.

[33]  S. Duan,et al.  Susceptibility loci involved in cisplatin-induced cytotoxicity and apoptosis , 2008, Pharmacogenetics and genomics.

[34]  A. de Bruin,et al.  Synergistic function of E2F7 and E2F8 is essential for cell survival and embryonic development. , 2008, Developmental cell.

[35]  Bryan Frank,et al.  Gene expression profiling of lymphoblastoid cell lines from monozygotic twins discordant in severity of autism reveals differential regulation of neurologically relevant genes , 2006, BMC Genomics.

[36]  G. Stacey Primary Cell Cultures and Immortal Cell Lines , 2006 .

[37]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[38]  K. Helin,et al.  Characterization of E2F8, a novel E2F-like cell-cycle regulated repressor of E2F-activated transcription , 2005, Nucleic acids research.

[39]  R. Sandberg,et al.  The molecular portrait of in vitro growth by meta-analysis of gene-expression profiles , 2005, Genome Biology.

[40]  W. Simonds,et al.  topors, a p53 and topoisomerase I-binding RING finger protein, is a coactivator of p53 in growth suppression induced by DNA damage , 2005, Oncogene.

[41]  L. Young,et al.  Epstein–Barr virus: 40 years on , 2004, Nature Reviews Cancer.

[42]  H. Hermeking,et al.  Evidence for a cancer-specific switch at the CDK4 promoter with loss of control by both USF and c-Myc , 2004, Oncogene.

[43]  Aldi Kraja,et al.  Genome-wide discovery of loci influencing chemotherapy cytotoxicity. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[44]  S. Cawley,et al.  Unbiased Mapping of Transcription Factor Binding Sites along Human Chromosomes 21 and 22 Points to Widespread Regulation of Noncoding RNAs , 2004, Cell.

[45]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[46]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[47]  Ying E. Zhang,et al.  Smad-dependent and Smad-independent pathways in TGF-β family signalling , 2003, Nature.

[48]  V. Cheung,et al.  Transcriptional response of lymphoblastoid cells to ionizing radiation. , 2003, Genome research.

[49]  X. Yang,et al.  Disruption of Smad5 gene leads to enhanced proliferation of high-proliferative potential precursors during embryonic hematopoiesis. , 2003, Blood.

[50]  L. Hood,et al.  A Genomic Regulatory Network for Development , 2002, Science.

[51]  R. DePinho,et al.  Cellular Senescence Minireview Mitotic Clock or Culture Shock? , 2000, Cell.

[52]  J. Shay,et al.  Telomere dynamics in cancer progression and prevention: fundamental differences in human and mouse telomere biology , 2000, Nature Medicine.

[53]  G Melino,et al.  The p53/p63/p73 family of transcription factors: overlapping and distinct functions. , 2000, Journal of cell science.

[54]  T. Lu,et al.  Loss of USF transcriptional activity in breast cancer cell lines , 1999, Oncogene.

[55]  C. Vinson,et al.  Cell-Type-Dependent Activity of the Ubiquitous Transcription Factor USF in Cellular Proliferation and Transcriptional Activation , 1999, Molecular and Cellular Biology.

[56]  B. Wold,et al.  Skeletal muscle determination and differentiation: story of a core regulatory network and its context. , 1996, Current opinion in cell biology.

[57]  Xu-Gang Luo,et al.  Antiproliferative properties of the USF family of helix-loop-helix transcription factors. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[58]  K. Kamiya,et al.  Transcriptional repressor ZF5 identifies a new conserved domain in zinc finger proteins. , 1993, Nucleic acids research.