Global gene expression response of a population exposed to benzene: A pilot study exploring the use of RNA‐sequencing technology

The mechanism of toxicity of the leukemogen benzene is not entirely known. This pilot study used RNA‐sequencing (RNA‐seq) technology to examine the effect of benzene exposure on gene expression in peripheral blood mononuclear cells obtained from 10 workers occupationally exposed to high levels of benzene (≥5 ppm) in air and 10 matched unexposed control workers, from a large study (n = 125) in which gene expression was previously measured by microarray. RNA‐seq is more sensitive and has a wider dynamic range for the quantification of gene expression. Further, it has the ability to detect novel transcripts and alternative splice variants. The main conclusions from our analysis of the 20 workers by RNA‐seq are as follows: The Pearson correlation between the two technical replicates for the RNA‐seq experiments was 0.98 and the correlation between RNA‐seq and microarray signals for the 20 subjects was around 0.6. 60% of the transcripts with detected reads from the RNA‐seq experiments did not have corresponding probes on the microarrays. Fifty‐three percent of the transcripts detected by RNA‐seq and 99% of those with probes on the microarray were protein‐coding. There was a significant overlap (P < 0.05) in transcripts declared differentially expressed due to benzene exposure using the two technologies. About 20% of the transcripts declared differentially expressed using the RNA‐seq data were non‐coding transcripts. Six transcripts were determined (false‐discovery rate < 0.05) to be alternatively spliced as a result of benzene exposure. Overall, this pilot study shows that RNA‐seq can complement the information obtained by microarray in the analysis of changes in transcript expression from chemical exposures. Environ. Mol. Mutagen. 54:566‐573, 2013. © 2013 Wiley Periodicals, Inc.

[1]  S. Lamm,et al.  Benzene Exposure and Hematotoxicity , 2006, Science.

[2]  D. Levy,et al.  A systematic comparison and evaluation of high density exon arrays and RNA-seq technology used to unravel the peripheral blood transcriptome of sickle cell disease , 2012, BMC Medical Genomics.

[3]  Luoping Zhang,et al.  Benzene, the exposome and future investigations of leukemia etiology. , 2011, Chemico-biological interactions.

[4]  Cole Trapnell,et al.  Improving RNA-Seq expression estimates by correcting for fragment bias , 2011, Genome Biology.

[5]  Hans Kromhout,et al.  Flexible Meta-Regression to Assess the Shape of the Benzene–Leukemia Exposure–Response Curve , 2009, Environmental health perspectives.

[6]  Bart De Moor,et al.  BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis , 2005, Bioinform..

[7]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[8]  M. Stephens,et al.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. , 2008, Genome research.

[9]  Crispin J. Miller,et al.  A comparison of massively parallel nucleotide sequencing with oligonucleotide microarrays for global transcription profiling , 2010, BMC Genomics.

[10]  R. Hayes,et al.  Detailed exposure assessment for a molecular epidemiology study of benzene in two shoe factories in China. , 2004, The Annals of occupational hygiene.

[11]  Dan Wang,et al.  A comparison of RNA-Seq and high-density exon array for detecting differential gene expression between closely related species , 2010, Nucleic Acids Res..

[12]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[13]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[14]  Damian Smedley,et al.  BioMart Central Portal: an open database network for the biological community , 2011, Database J. Biol. Databases Curation.

[15]  Thomas R Gingeras,et al.  Origin of phenotypes: genes and transcripts. , 2007, Genome research.

[16]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[17]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[18]  Kristin E. Porter,et al.  Global Gene Expression Profiling of a Population Exposed to a Range of Benzene Levels , 2010, Environmental health perspectives.

[19]  Leming Shi,et al.  Comparing next-generation sequencing and microarray technologies in a toxicological study of the effects of aristolochic acid on rat kidneys. , 2011, Chemical research in toxicology.

[20]  Avrum Spira,et al.  Characterizing the Impact of Smoking and Lung Cancer on the Airway Transcriptome Using RNA-Seq , 2011, Cancer Prevention Research.

[21]  N. Rothman,et al.  Systems biology of human benzene exposure. , 2010, Chemico-biological interactions.

[22]  Luoping Zhang,et al.  Current understanding of the mechanism of benzene-induced leukemia in humans: implications for risk assessment. , 2012, Carcinogenesis.

[23]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[24]  Mingyao Li,et al.  RNA-sequence analysis of human B-cells. , 2011, Genome research.

[25]  Damian Smedley,et al.  BioMart – biological queries made easy , 2009, BMC Genomics.

[26]  Nathaniel Rothman,et al.  Evidence That Humans Metabolize Benzene via Two Pathways , 2009, Environmental health perspectives.

[27]  Lior Pachter,et al.  Sequence Analysis , 2020, Definitions.

[28]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[29]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[30]  Junjun Zhang,et al.  BioMart Central Portal—unified access to biological data , 2009, Nucleic Acids Res..

[31]  A. Smith,et al.  Meta-analysis of benzene exposure and non-Hodgkin lymphoma: biases could mask an important association , 2008, Occupational and Environmental Medicine.

[32]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[33]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[34]  M. Gerstein,et al.  What is a gene, post-ENCODE? History and updated definition. , 2007, Genome research.

[35]  Geffrey F. Stopper,et al.  Choosing the right path: enhancement of biologically relevant sets of genes or proteins using pathway structure , 2009, Genome Biology.

[36]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..