Machine learning techniques combined with dose profiles indicate radiation response biomarkers

Abstract The focus of this research is to combine statistical and machine learning tools in application to a high-throughput biological data set on ionizing radiation response. The analyzed data consist of two gene expression sets obtained in studies of radiosensitive and radioresistant breast cancer patients undergoing radiotherapy. The data sets were similar in principle; however, the treatment dose differed. It is shown that introducing mathematical adjustments in data preprocessing, differentiation and trend testing, and classification, coupled with current biological knowledge, allows efficient data analysis and obtaining accurate results. The tools used to customize the analysis workflow were batch effect filtration with empirical Bayes models, identifying gene trends through the Jonckheere–Terpstra test and linear interpolation adjustment according to specific gene profiles for multiple random validation. The application of non-standard techniques enabled successful sample classification at the rate of 93.5% and the identification of potential biomarkers of radiation response in breast cancer, which were confirmed with an independent Monte Carlo feature selection approach and by literature references. This study shows that using customized analysis workflows is a necessary step towards novel discoveries in complex fields such as personalized individual therapy.

[1]  Simon Bouffler,et al.  Assessing cancer risks of low-dose radiation , 2009, Nature Reviews Cancer.

[2]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[3]  Bonnie Berger,et al.  Making sense out of massive data by going beyond differential expression , 2012, Proceedings of the National Academy of Sciences.

[4]  S. Kabacik,et al.  Time, Dose and Ataxia Telangiectasia Mutated (ATM) Status Dependency of Coding and Noncoding RNA Expression after Ionizing Radiation Exposure , 2015, Radiation research.

[5]  Daohong Zhou,et al.  Hematopoietic stem cell injury induced by ionizing radiation. , 2014, Antioxidants & redox signaling.

[6]  T. J. Terpstra,et al.  The asymptotic normality and consistency of kendall's test against trend, when ties are present in one ranking , 1952 .

[7]  Joanna Polanska,et al.  Integrating Expression Data from Different Microarray Platforms in Search of Biomarkers of Radiosensitivity , 2014, IWBBIO.

[8]  R. Myers,et al.  Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data , 2005, Nucleic acids research.

[9]  Roger Owen,et al.  Fractionation sensitivity and dose response of late adverse effects in the breast after radiotherapy for early breast cancer: long-term results of a randomised trial. , 2005, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[10]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[11]  R. Brodsky,et al.  Resistance to apoptosis caused by PIG-A gene mutations in paroxysmal nocturnal hemoglobinuria. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[13]  D. Hallahan,et al.  Radiation induction of immediate early genes: effectors of the radiation-stress response. , 1994, International journal of radiation oncology, biology, physics.

[14]  Lukasz Krol Distributed Monte Carlo Feature Selection: Extracting Informative Features Out of Multidimensional Problems with Linear Speedup , 2016, BDAS.

[15]  A. R. Jonckheere,et al.  A DISTRIBUTION-FREE k-SAMPLE TEST AGAINST ORDERED ALTERNATIVES , 1954 .

[16]  C. Yee,et al.  The Effect of Radiation on the Immune Response to Cancers , 2014, International journal of molecular sciences.

[17]  Q. Zhan Gadd45a, a p53- and BRCA1-regulated stress protein, in cellular response to DNA damage. , 2005, Mutation research.

[18]  Frank Lohr,et al.  Expert system classifier for adaptive radiation therapy in prostate cancer , 2017, Australasian physical & engineering sciences in medicine.

[19]  J. Berger,et al.  The Intrinsic Bayes Factor for Model Selection and Prediction , 1996 .

[20]  Christophe Badie,et al.  High and low dose responses of transcriptional biomarkers in ex vivo X-irradiated human blood , 2013, International journal of radiation biology.

[21]  Ola Nilsson,et al.  NAMPT Inhibitor GMX1778 Enhances the Efficacy of 177Lu-DOTATATE Treatment of Neuroendocrine Tumors , 2016, The Journal of Nuclear Medicine.

[22]  Christophe Badie,et al.  Correlation of in vitro lymphocyte radiosensitivity and gene expression with late normal tissue reactions following curative radiotherapy for breast cancer. , 2012, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[23]  Michael C. Joiner,et al.  A simple α/β-independent method to derive fully isoeffective schedules following changes in dose per fraction , 2004 .

[24]  Dinesh Gupta,et al.  Machine learning for biomarker identification in cancer research - developments toward its clinical application. , 2015, Personalized medicine.

[25]  Alexander D. Diehl,et al.  Ontology based molecular signatures for immune cell types via gene expression analysis , 2013, BMC Bioinformatics.

[26]  R. Doll,et al.  Cancer risks attributable to low doses of ionizing radiation: Assessing what we really know , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[27]  David M. Rocke,et al.  Comparison of low and high dose ionising radiation using topological analysis of gene coexpression networks , 2012, BMC Genomics.

[28]  Laurent Albera,et al.  On feature extraction and classification in prostate cancer radiotherapy using tensor decompositions. , 2015, Medical engineering & physics.

[29]  K. Wiman,et al.  Wig-1 regulates cell cycle arrest and cell death through the p53 targets FAS and 14-3-3σ , 2014, Oncogene.

[30]  Christophe Badie,et al.  Influence of Confounding Factors on Radiation Dose Estimation Using In Vivo Validated Transcriptional Biomarkers , 2018, Health physics.

[31]  X. Kong,et al.  Bioinformatics analysis of biomarkers and transcriptional factor motifs in Down syndrome , 2014, Brazilian journal of medical and biological research = Revista brasileira de pesquisas medicas e biologicas.

[32]  Nations United sources and effects of ionizing radiation , 2000 .

[33]  Christophe Badie,et al.  Gene expression following ionising radiation: Identification of biomarkers for dose estimation and prediction of individual response , 2011, International journal of radiation biology.

[34]  Alison Abbott,et al.  Researchers pin down risks of low-dose radiation , 2015, Nature.

[35]  P. Lambin,et al.  Machine Learning methods for Quantitative Radiomic Biomarkers , 2015, Scientific Reports.

[36]  R Iwata,et al.  Assessment of cancer recurrence in residual tumors after fractionated radiotherapy: a comparison of fluorodeoxyglucose, L-methionine and thymidine. , 1997, Journal of nuclear medicine : official publication, Society of Nuclear Medicine.

[37]  Cesare Furlanello,et al.  Multi-omics integration for neuroblastoma clinical endpoint prediction , 2018, Biology Direct.