Whole-genome sequencing and gene network modules predict gemcitabine/carboplatin-induced myelosuppression in non-small cell lung cancer patients

Gemcitabine/carboplatin chemotherapy commonly induces myelosuppression, including neutropenia, leukopenia, and thrombocytopenia. Predicting patients at risk of these adverse drug reactions (ADRs) and adjusting treatments accordingly is a long-term goal of personalized medicine. This study used whole-genome sequencing (WGS) of blood samples from 96 gemcitabine/carboplatin-treated non-small cell lung cancer (NSCLC) patients and gene network modules for predicting myelosuppression. Association of genetic variants in PLINK found 4594, 5019, and 5066 autosomal SNVs/INDELs with p  ≤ 1 × 10 −3 for neutropenia, leukopenia, and thrombocytopenia, respectively. Based on the SNVs/INDELs we identified the toxicity module, consisting of 215 unique overlapping genes inferred from MCODE-generated gene network modules of 350, 345, and 313 genes, respectively. These module genes showed enrichment for differentially expressed genes in rat bone marrow, human bone marrow, and human cell lines exposed to carboplatin and gemcitabine ( p  < 0.05). Then using 80% of the patients as training data, random LASSO reduced the number of SNVs/INDELs in the toxicity module into a feasible prediction model consisting of 62 SNVs/INDELs that accurately predict both the training and the test (remaining 20%) data with high (CTCAE 3–4) and low (CTCAE 0–1) maximal myelosuppressive toxicity completely, with the receiver-operating characteristic (ROC) area under the curve (AUC) of 100%. The present study shows how WGS, gene network modules, and random LASSO can be used to develop a feasible and tested model for predicting myelosuppressive toxicity. Although the proposed model predicts myelosuppression in this study, further evaluation in other studies is required to determine its reproducibility, usability, and clinical effect.

[1]  C. Lozzio,et al.  Human chronic myelogenous leukemia cell-line with positive Philadelphia chromosome. , 1975, Blood.

[2]  C. Lozzio,et al.  Properties and usefulness of the original K-562 human myelogenous leukemia cell line. , 1979, Leukemia research.

[3]  C. Lozzio,et al.  A Multipotential Leukemia Cell Line (K-562) of Human Origin 1 , 1981, Proceedings of the Society for Experimental Biology and Medicine. Society for Experimental Biology and Medicine.

[4]  Y. Hayashi,et al.  Establishment of a human leukaemic cell line (CMK) with megakaryocytic characteristics from a Down's syndrome patient with acute megakaryoblastic leukaemia , 1989, British journal of haematology.

[5]  T. Suda,et al.  Growth and differentiation of a human megakaryoblastic cell line, CMK. , 1989, Blood.

[6]  Y. Matsuo,et al.  Establishment and characterization of a novel megakaryoblastic cell line, MOLM-1, from a patient with chronic myelogenous leukemia. , 1991, Human cell.

[7]  K. Tanaka,et al.  Structurally altered Evi-1 protein generated in the 3q21q26 syndrome. , 1996, Oncogene.

[8]  H. Drexler,et al.  Leukemia cell lines: in vitro models for the study of Philadelphia chromosome-positive leukemia. , 1999, Leukemia research.

[9]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[10]  I. Smith,et al.  Early clinical studies with cis-diammine-1,1-cyclobutane dicarboxylate platinum II , 2004, Cancer Chemotherapy and Pharmacology.

[11]  Bart De Moor,et al.  BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis , 2005, Bioinform..

[12]  J. Robert,et al.  Predicting drug response and toxicity based on gene polymorphisms. , 2005, Critical reviews in oncology/hematology.

[13]  Michel Eichelbaum,et al.  Pharmacogenomics and individualized drug therapy. , 2006, Annual review of medicine.

[14]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[15]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[16]  S. Kaasa,et al.  Phase III study by the Norwegian lung cancer study group: pemetrexed plus carboplatin compared with gemcitabine plus carboplatin as first-line chemotherapy in advanced non-small-cell lung cancer. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[17]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[18]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[19]  E. Birney,et al.  Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt , 2009, Nature Protocols.

[20]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[21]  T. Ugarova,et al.  Plasminogen on the surfaces of fibrin clots prevents adhesion of leukocytes and platelets , 2010, Journal of thrombosis and haemostasis : JTH.

[22]  P. Robinson,et al.  Whole-exome sequencing for finding de novo mutations in sporadic mental retardation , 2010, Genome Biology.

[23]  Richard Durbin,et al.  Fast and accurate long-read alignment with Burrows–Wheeler transform , 2010, Bioinform..

[24]  Charles Auffray,et al.  An integrative systems biology approach to understanding pulmonary diseases. , 2010, Chest.

[25]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[26]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[27]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[28]  Andrew P Morris,et al.  Basic statistical analysis in genetic case-control studies , 2011, Nature Protocols.

[29]  Q. Wei,et al.  Association of ABCC2 polymorphisms with platinum-based chemotherapy response and severe toxicity in non-small cell lung cancer patients. , 2011, Lung cancer.

[30]  Sijian Wang,et al.  RANDOM LASSO. , 2011, The annals of applied statistics.

[31]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[32]  M. Tsuboi,et al.  Randomized Phase II Study of Two Schedules of Carboplatin and Gemcitabine for Stage IIIB and IV Advanced Non-Small Cell Lung Cancer (JACCRO LC-01 Study) , 2011, Chemotherapy.

[33]  David Levine,et al.  A high-performance computing toolset for relatedness and principal component analysis of SNP data , 2012, Bioinform..

[34]  Joaquín Dopazo,et al.  Qualimap: evaluating next-generation sequencing alignment data , 2012, Bioinform..

[35]  Davis J. McCarthy,et al.  Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation , 2012, Nucleic acids research.

[36]  M. Jarvelin,et al.  Highly interconnected genes in disease-specific networks are enriched for disease-associated polymorphisms , 2012, Genome Biology.

[37]  Q. Wei,et al.  Association between CASP8 and CASP10 polymorphisms and toxicity outcomes with platinum-based chemotherapy in Chinese patients with non-small cell lung cancer. , 2012, The oncologist.

[38]  J. Lundeberg,et al.  Identification of candidate SNPs for drug induced toxicity from differentially expressed genes in associated tissues. , 2012, Gene.

[39]  Yusuke Nakamura,et al.  A genome-wide association study identifies four genetic markers for hematological toxicities in cancer patients receiving gemcitabine therapy , 2012, Pharmacogenetics and genomics.

[40]  Guangchuang Yu,et al.  clusterProfiler: an R package for comparing biological themes among gene clusters. , 2012, Omics : a journal of integrative biology.

[41]  J. Quigley,et al.  Cell Surface Remodeling by Plasmin: A New Function for an Old Enzyme , 2012, Journal of biomedicine & biotechnology.

[42]  M. Eileen Dolan,et al.  Cancer pharmacogenomics: strategies and challenges , 2012, Nature Reviews Genetics.

[43]  Jonathan E. Clark,et al.  Co-expression network analysis identifies Spleen Tyrosine Kinase (SYK) as a candidate oncogenic driver in a subset of small-cell lung cancer , 2013, BMC Systems Biology.

[44]  Yusuke Nakamura,et al.  Genome‐wide association study of chemotherapeutic agent‐induced severe neutropenia/leucopenia for patients in Biobank Japan , 2013, Cancer science.

[45]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[46]  Pablo Villoslada,et al.  Modules, networks and systems medicine for understanding disease and aiding diagnosis , 2014, Genome Medicine.

[47]  R. Rosenfeld Patients , 2012, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[48]  Chun-Ho Shih,et al.  A critical role for the regulation of Syk from agglutination to aggregation in human platelets. , 2014, Biochemical and biophysical research communications.

[49]  N. Wu,et al.  Low Expression of the E3 Ubiquitin Ligase CBL Confers Chemoresistance in Human Pancreatic Cancer and Is Targeted by Epidermal Growth Factor Receptor Inhibition , 2014, Clinical Cancer Research.

[50]  Wei Shi,et al.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features , 2013, Bioinform..

[51]  A. Barabasi,et al.  Uncovering disease-disease relationships through the incomplete interactome , 2015, Science.

[52]  Joakim Lundeberg,et al.  Using Whole-Exome Sequencing to Identify Genetic Markers for Carboplatin and Gemcitabine-Induced Toxicities , 2015, Clinical Cancer Research.

[53]  L. Medeiros,et al.  Hematolymphoid neoplasms associated with rearrangements of PDGFRA, PDGFRB, and FGFR1. , 2015, American journal of clinical pathology.

[54]  Rémy Bruggmann,et al.  Clinical sequencing: is WGS the better WES? , 2016, Human Genetics.

[55]  J. Heemskerk,et al.  Plasminogen associates with phosphatidylserine-exposing platelets and contributes to thrombus lysis under flow. , 2015, Blood.

[56]  Lei Shang,et al.  Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants , 2014, Proceedings of the National Academy of Sciences.

[57]  Qing-Yu He,et al.  ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization , 2015, Bioinform..

[58]  Albert-László Barabási,et al.  A DIseAse MOdule Detection (DIAMOnD) Algorithm Derived from a Systematic Analysis of Connectivity Patterns of Disease Proteins in the Human Interactome , 2015, PLoS Comput. Biol..

[59]  L. Donehower,et al.  WIP1 Phosphatase as a Potential Therapeutic Target in Neuroblastoma , 2015, PloS one.

[60]  K. Stegmaier,et al.  Increased SYK activity is associated with unfavorable outcome among patients with acute myeloid leukemia , 2015, Oncotarget.

[61]  Li'na Hu,et al.  Tyrosine kinase LYN is an oncotarget in human cervical cancer: A quantitative proteomic based study , 2016, Oncotarget.

[62]  Antonia K. Roseweir,et al.  Nuclear expression of Lyn, a Src family kinase member, is associated with poor prognosis in renal cancer patients , 2016, BMC Cancer.

[63]  Y. Shin,et al.  LYN expression predicts the response to dasatinib in a subpopulation of lung adenocarcinoma patients , 2016, OncoTarget.

[64]  D. Postma,et al.  Paving the way of systems biology and precision medicine in allergic diseases: the MeDALL success story , 2016, Allergy.

[65]  Y. Bang,et al.  Src as a Therapeutic Target in Biliary Tract Cancer , 2016, Molecular Cancer Therapeutics.

[66]  Wen-Lun Wang,et al.  Low disabled-2 expression promotes tumor progression and determines poor survival and high recurrence of esophageal squamous cell carcinoma , 2016, Oncotarget.

[67]  G. Jin,et al.  Genome-wide association study of myelosuppression in non-small-cell lung cancer patients with platinum-based chemotherapy , 2015, The Pharmacogenomics Journal.

[68]  Mika Gustafsson,et al.  Dynamic Response Genes in CD4+ T Cells Reveal a Network of Interactive Proteins that Classifies Disease Activity in Multiple Sclerosis. , 2016, Cell reports.

[69]  Måns Magnusson,et al.  MultiQC: summarize analysis results for multiple tools and samples in a single report , 2016, Bioinform..

[70]  A. Reiter,et al.  Myeloid neoplasms with eosinophilia. , 2017, Blood.

[71]  Damian Szklarczyk,et al.  The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible , 2016, Nucleic Acids Res..

[72]  Reinhard Guthke,et al.  ModuleDiscoverer: Identification of regulatory modules in protein-protein interaction networks , 2017, Scientific Reports.

[73]  A. Zick,et al.  Treatment inferred from mutations identified using massive parallel sequencing leads to clinical benefit in some heavily pretreated cancer patients , 2016, Medicine.

[74]  Hui-Ju Tsai,et al.  Functional links between Disabled‐2 Ser723 phosphorylation and thrombin signaling in human platelets , 2017, Journal of thrombosis and haemostasis : JTH.

[75]  P. Sahlén,et al.  Comparison of Variant Calls from Whole Genome and Whole Exome Sequencing Data Using Matched Samples , 2018 .

[76]  K. Kiwerska,et al.  Frequent methylation of DAB2, a Wnt pathway antagonist, in oral and oropharyngeal squamous cell carcinomas. , 2017, Pathology, research and practice.

[77]  A. Jemal,et al.  Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries , 2018, CA: a cancer journal for clinicians.

[78]  J. Lundeberg,et al.  Genes and variants in hematopoiesis-related pathways are associated with gemcitabine/carboplatin-induced thrombocytopenia , 2019, The Pharmacogenomics Journal.

[79]  Mika Gustafsson,et al.  MODifieR: an Ensemble R Package for Inference of Disease Modules from Transcriptomics Networks , 2020, Bioinform..