Translating microarray data for diagnostic testing in childhood leukaemia

BackgroundRecent findings from microarray studies have raised the prospect of a standardized diagnostic gene expression platform to enhance accurate diagnosis and risk stratification in paediatric acute lymphoblastic leukaemia (ALL). However, the robustness as well as the format for such a diagnostic test remains to be determined. As a step towards clinical application of these findings, we have systematically analyzed a published ALL microarray data set using Robust Multi-array Analysis (RMA) and Random Forest (RF).MethodsWe examined published microarray data from 104 ALL patients specimens, that represent six different subgroups defined by cytogenetic features and immunophenotypes. Using the decision-tree based supervised learning algorithm Random Forest (RF), we determined a small set of genes for optimal subgroup distinction and subsequently validated their predictive power in an independent patient cohort.ResultsWe achieved very high overall ALL subgroup prediction accuracies of about 98%, and were able to verify the robustness of these genes in an independent panel of 68 specimens obtained from a different institution and processed in a different laboratory. Our study established that the selection of discriminating genes is strongly dependent on the analysis method. This may have profound implications for clinical use, particularly when the classifier is reduced to a small set of genes. We have demonstrated that as few as 26 genes yield accurate class prediction and importantly, almost 70% of these genes have not been previously identified as essential for class distinction of the six ALL subgroups.ConclusionOur finding supports the feasibility of qRT-PCR technology for standardized diagnostic testing in paediatric ALL and should, in conjunction with conventional cytogenetics lead to a more accurate classification of the disease. In addition, we have demonstrated that microarray findings from one study can be confirmed in an independent study, using an entirely independent patient cohort and with microarray experiments being performed by a different research team.

[1]  M. Ringnér,et al.  Gene expression profiles in a panel of childhood leukemia cell lines mirror critical features of the disease. , 2003, Molecular cancer therapeutics.

[2]  W. Bleyer,et al.  Children's Cancer Group trials in childhood acute lymphoblastic leukemia: 1983–1995 , 2000, Leukemia.

[3]  J. Downing,et al.  Gene Expression Profiling of Pediatric Acute Myelogenous Leukemia Materials and Methods , 2022 .

[4]  R. Gisler,et al.  Cloning and Characterization of a Promoter Flanking the Early B Cell Factor (EBF) Gene Indicates Roles for E-Proteins and Autoregulation in the Control of EBF Expression1 , 2002, The Journal of Immunology.

[5]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[6]  J. Martínez-Climent Molecular cytogenetics of childhood hematological malignancies , 1997, Leukemia.

[7]  A. Fleming,et al.  Childhood leukaemia , 1991, The Lancet.

[8]  Eliot Marshall,et al.  Getting the Noise Out of Gene Arrays , 2004, Science.

[9]  J. Downing,et al.  Oncogenic homeodomain transcription factor E2A-Pbx1 activates a novel WNT gene in pre-B acute lymphoblastoid leukemia. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[10]  J. Downing,et al.  Pediatric acute lymphoblastic leukemia. , 2003, Hematology. American Society of Hematology. Education Program.

[11]  Guidel Ines,et al.  Expression profiling — best practices for data generation and interpretation in clinical trials , 2004, Nature Reviews Genetics.

[12]  J. Downing,et al.  Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling. , 2002, Cancer cell.

[13]  Martin J Firth,et al.  The gene expression signature of relapse in paediatric acute lymphoblastic leukaemia: implications for mechanisms of therapy failure , 2005, British journal of haematology.

[14]  E. Lander,et al.  Gene expression signatures define novel oncogenic pathways in T cell acute lymphoblastic leukemia. , 2002, Cancer cell.

[15]  C. Pui,et al.  Molecular diagnostics in the treatment of leukemia. , 1999, Current opinion in hematology.

[16]  S. Chellappan,et al.  Prohibitin Induces the Transcriptional Activity of p53 and Is Exported from the Nucleus upon Apoptotic Signaling* , 2003, Journal of Biological Chemistry.

[17]  M. Martineau,et al.  The Leukaemia Research Fund/United Kingdom Cancer Cytogenetics Group Karyotype Database in acute lymphoblastic leukaemia: a valuable resource for patient management , 2001, British journal of haematology.

[18]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[19]  C. Pui,et al.  Recent advances in the treatment and understanding of childhood acute lymphoblastic leukaemia. , 2003, Cancer treatment reviews.

[20]  Geoffrey J McLachlan,et al.  Selection bias in gene extraction on the basis of microarray gene-expression data , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[21]  S. Knudsen,et al.  Prediction of immunophenotype, treatment response, and relapse in childhood acute lymphoblastic leukemia using DNA microarrays , 2004, Leukemia.

[22]  T. Golub,et al.  Genomic approaches to hematologic malignancies. , 2004, Blood.

[23]  M. Xiong,et al.  Recursive partitioning for tumor classification with gene expression microarray data , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[24]  C. Bloomfield,et al.  Cytogenetics in Acute Leukemia , 2022 .

[25]  Aniko Szabo,et al.  Identification of gene expression profiles that segregate patients with childhood leukemia. , 2002, Clinical cancer research : an official journal of the American Association for Cancer Research.

[26]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[27]  Heping Zhang,et al.  Cell and tumor classification using gene expression data: Construction of forests , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[28]  J. Shuster,et al.  Long-term results of treatment studies for childhood acute lymphoblastic leukemia: Pediatric Oncology Group studies from 1986–1994 , 2000, Leukemia.

[29]  J. Downing,et al.  Treatment-specific changes in gene expression discriminate in vivo drug response in human leukemia cells , 2003, Nature Genetics.

[30]  J. Kersey Fifty years of studies of the biology and therapy of childhood leukemia. , 1997, Blood.

[31]  Todd,et al.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning , 2002, Nature Medicine.

[32]  Katrin Hoffmann,et al.  Gene expression levels assessed by oligonucleotide microarray analysis and quantitative real-time RT-PCR – how well do they correlate? , 2005, BMC Genomics.

[33]  M. Radmacher,et al.  Pitfalls in the use of DNA microarray data for diagnostic and prognostic classification. , 2003, Journal of the National Cancer Institute.

[34]  Andy Greenfield,et al.  Using DNA microarrays. , 2008, Methods in molecular biology.

[35]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[36]  The eMERGE Clinical Annotation Working Group,et al.  Expression profiling — best practices for data generation and interpretation in clinical trials , 2004 .

[37]  Li M Fu,et al.  Multi‐class cancer subtype classification based on gene expression signatures with reliability analysis , 2004, FEBS letters.

[38]  M. Greaves Science, medicine, and the future: Childhood leukaemia , 2002 .

[39]  C. Pui,et al.  Uniform approach to risk classification and treatment assignment for children with acute lymphoblastic leukemia. , 1996, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[40]  E. Lander,et al.  MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia , 2002, Nature Genetics.

[41]  Martin J Firth,et al.  Gene expression levels in small specimens from patients detected using oligonucleotide arrays , 2005, Molecular biotechnology.

[42]  Stefan Michiels,et al.  Prediction of cancer outcome with microarrays: a multiple random validation strategy , 2005, The Lancet.

[43]  D. Stone,et al.  Prediction of clinical drug efficacy by classification of drug-induced genomic expression profiles in vitro , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Misao Ohki,et al.  Identification of a gene expression signature associated with pediatric AML prognosis. , 2003, Blood.

[45]  J. Downing,et al.  Classification of pediatric acute lymphoblastic leukemia by gene expression profiling. , 2003, Blood.

[46]  Misao Ohki,et al.  Two distinct gene expression signatures in pediatric acute lymphoblastic leukemia with MLL rearrangements. , 2003, Cancer research.

[47]  C. Harrison,et al.  The detection and significance of chromosomal abnormalities in childhood acute lymphoblastic leukaemia. , 2001, Blood reviews.