A novel algorithm for simplification of complex gene classifiers in cancer.

The clinical application of complex molecular classifiers as diagnostic or prognostic tools has been limited by the time and cost needed to apply them to patients. Using an existing 50-gene expression signature known to separate two molecular subtypes of the pediatric cancer rhabdomyosarcoma, we show that an exhaustive iterative search algorithm can distill this complex classifier down to two or three features with equal discrimination. We validated the two-gene signatures using three separate and distinct datasets, including one that uses degraded RNA extracted from formalin-fixed, paraffin-embedded material. Finally, to show the generalizability of our algorithm, we applied it to a lung cancer dataset to find minimal gene signatures that can distinguish survival. Our approach can easily be generalized and coupled to existing technical platforms to facilitate the discovery of simplified signatures that are ready for routine clinical use.

[1]  T. Triche,et al.  Identification of a PAX-FKHR gene expression signature that defines molecular classes and determines the prognosis of alveolar rhabdomyosarcomas. , 2006, Cancer research.

[2]  Igor Jurisica,et al.  Gene expression–based survival prediction in lung adenocarcinoma: a multi-site, blinded validation study , 2008, Nature Medicine.

[3]  S. Skapek,et al.  Childhood Rhabdomyosarcoma: New Insight on Biology and Treatment , 2010, Current oncology reports.

[4]  R. Arceci Fusion Gene–Negative Alveolar Rhabdomyosarcoma Is Clinically and Molecularly Indistinguishable From Embryonal Rhabdomyosarcoma , 2010 .

[5]  P. Cin,et al.  Recurrent t(2;2) and t(2;8) translocations in rhabdomyosarcoma without the canonical PAX‐FOXO1 fuse PAX3 to members of the nuclear receptor transcriptional coactivator family , 2009, Genes, chromosomes & cancer.

[6]  P. Sorensen,et al.  Molecular classification of rhabdomyosarcoma--genotypic and phenotypic determinants of diagnosis: a report from the Children's Oncology Group. , 2009, The American journal of pathology.

[7]  M. Ladanyi,et al.  Diagnosing alveolar rhabdomyosarcoma: morphology must be coupled with fusion confirmation. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[8]  Jannik N. Andersen,et al.  Cancer genomics: from discovery science to personalized medicine , 2011, Nature Medicine.

[9]  J. Verweij,et al.  Future treatment of soft tissue sarcomas will be driven by histological subtype and molecular aberrations. , 2010, European journal of cancer.

[10]  Daniel S. Katz,et al.  Swift: A language for distributed parallel scripting , 2011, Parallel Comput..

[11]  Jennifer L. Osborn,et al.  Direct multiplexed measurement of gene expression with color-coded probe pairs , 2008, Nature Biotechnology.

[12]  R. Arceci PAX3/FOXO1 Fusion Gene Status Is the Key Prognostic Molecular Marker in Rhabdomyosarcoma and Significantly Improves Current Risk Stratification , 2012 .

[13]  D. Hawkins,et al.  Rhabdomyosarcoma: Review of the Children's Oncology Group (COG) soft‐tissue Sarcoma committee experience and rationale for current COG studies , 2012, Pediatric blood & cancer.

[14]  S. Spunt,et al.  Soft tissue sarcomas of childhood. , 2004, Cancer treatment reviews.

[15]  J. Bridge,et al.  Molecular diagnostics of soft tissue tumors. , 2011, Archives of pathology & laboratory medicine.

[16]  Silvia Behnke,et al.  Subtype and prognostic classification of rhabdomyosarcoma by immunohistochemistry. , 2006, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[17]  James R. Anderson,et al.  Prognostic factors in metastatic rhabdomyosarcomas: results of a pooled analysis from United States and European cooperative groups. , 2008, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[18]  R. Arceci,et al.  Gene Expression Signatures Identify Rhabdomyosarcoma Subtypes and Detect a Novel t(2;2)(q35;p23) Translocation Fusing PAX3 to NCOA1 , 2006 .

[19]  Lynette M. Smith,et al.  Examination of gene fusion status in archival samples of alveolar rhabdomyosarcoma entered on the Intergroup Rhabdomyosarcoma Study-III trial: a report from the Children's Oncology Group. , 2006, The Journal of molecular diagnostics : JMD.

[20]  Robert J. Marinelli,et al.  Diffuse Myogenin Expression by Immunohistochemistry is an Independent Marker of Poor Survival in Pediatric Rhabdomyosarcoma: A Tissue Microarray Study of 71 Primary Tumors Including Correlation With Molecular Phenotype , 2008, The American journal of surgical pathology.

[21]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[22]  James R. Anderson,et al.  Gene expression profiling for survival prediction in pediatric rhabdomyosarcomas: a report from the children's oncology group. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.