Improved prediction of treatment response using microarrays and existing biological knowledge.

A desired application for microarrays in the clinic is to predict treatment response from an often diverse patient population. We present a method for analyzing microarray data that is predicated on biological pathway and function knowledge as opposed to a purely data-driven initial analysis. From an analysis perspective, this methodology takes advantage of information that is available across genes on a single array, as well as differences in those patterns across measurements. By using biological knowledge in the initial analysis, the accuracy and robustness of microarray profile classification is enhanced, especially when low numbers of samples are available. For clinical studies, particularly Phase I or I/II studies, this technique is exceptionally advantageous.

[1]  P. Park,et al.  Discovering statistically significant pathways in expression profiling studies. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[2]  J. Zhu,et al.  An integrative genomics approach to the reconstruction of gene networks in segregating populations , 2004, Cytogenetic and Genome Research.

[3]  D. Khatry,et al.  Expression profiling of blood samples from an SU5416 Phase III metastatic colorectal cancer clinical trial: a novel strategy for biomarker identification , 2003, BMC Cancer.

[4]  E. Wit Design and Analysis of DNA Microarray Investigations , 2004, Human Genomics.

[5]  Jian Huang,et al.  Regularized ROC method for disease classification and biomarker selection with microarray data , 2005, Bioinform..

[6]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[7]  Eytan Domany,et al.  Outcome signature genes in breast cancer: is there a unique set? , 2004, Breast Cancer Research.

[8]  Yudong D. He,et al.  Expression profiling predicts outcome in breast cancer , 2002, Breast Cancer Research.

[9]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[10]  Cheng Cheng,et al.  Gene-expression patterns in drug-resistant acute lymphoblastic leukemia cells and response to treatment. , 2004, The New England journal of medicine.

[11]  H. White,et al.  Data‐Snooping, Technical Trading Rule Performance, and the Bootstrap , 1999 .

[12]  James R. Downing,et al.  TGF-β Signaling, Tumor Suppression, and Acute Lymphoblastic Leukemia , 2004 .

[13]  J. Foekens,et al.  Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer , 2005, The Lancet.

[14]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[15]  David E. Misek,et al.  Gene-expression profiles predict survival of patients with lung adenocarcinoma , 2002, Nature Medicine.

[16]  J. Pronk,et al.  Reproducibility of Oligonucleotide Microarray Transcriptome Analyses , 2002, The Journal of Biological Chemistry.

[17]  R. Spang,et al.  Predicting the clinical status of human breast cancer by using gene expression profiles , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[18]  J. Michael Cherry,et al.  Microarray data quality analysis: lessons from the AFGC project , 2004, Plant Molecular Biology.

[19]  H. Jürgens,et al.  Inhibition of hypercoagulation by antithrombin substitution in E. coli L‐asparaginase‐treated children , 2009, European journal of haematology.

[20]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[21]  E. Lander,et al.  Gene expression correlates of clinical prostate cancer behavior. , 2002, Cancer cell.

[22]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[23]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[25]  N. Gerry,et al.  Reliability and reproducibility of gene expression measurements using amplified RNA from laser-microdissected primary breast tissue with oligonucleotide arrays. , 2005, The Journal of molecular diagnostics : JMD.

[26]  Simon Lin,et al.  Methods of microarray data analysis III , 2002 .