The development of composite circulating biomarker models for use in anticancer drug clinical development

The development of informative composite circulating biomarkers predicting cancer presence or therapy response is clinically attractive but optimal approaches to modeling are as yet unclear. This study investigated multidimensional relationships within an example panel of serum insulin‐like growth factor (IGF) peptides using logistic regression (LR), fractional polynomial (FP), regression, artificial neural networks (ANNs) and support vector machines (SVMs) to derive predictive models for colorectal cancer (CRC). Two phase 2 biomarker validation analyses were performed: controls were ambulant adults (n = 722); cases were: (i) CRC patients (n = 100) and (ii) patients with acromegaly (n = 52), the latter as “positive” discriminators. Serum IGF‐I, IGF‐II, IGF binding protein (IGFBP)‐2 and ‐3 were measured. Discriminatory characteristics were compared within and between models. For the LR, FP and ANN models, and to a lesser extent SVMs, the addition of covariates at several steps improved discrimination characteristics. The optimum biomarker combination discriminating CRC vs. controls was achieved using ANN models [sensitivity, 94%; specificity, 90%; accuracy, 0.975 (95% CIs: 0.948 1.000)]. ANN modeling significantly outperformed LR, FP and SVM in terms of discrimination (p < 0.0001) and calibration. The acromegaly analysis demonstrated expected high performance characteristics in the ANN model [accuracy, 0.993 (95% CIs: 0.977, 1.000)]. Curved decision surfaces generated from the ANNs revealed the potential clinical utility. This example demonstrated improved discriminatory characteristics within the composite biomarker ANN model and a final model that outperformed the three other models. This modeling approach forms the basis to evaluate composite biomarkers as pharmacological and predictive biomarkers in future clinical trials.

[1]  G. Jayson,et al.  'Fit-for-purpose' validation of SearchLight multiplex ELISAs of angiogenesis for clinical trial use. , 2009, Journal of immunological methods.

[2]  G. Jayson,et al.  New therapeutic agents in ovarian cancer , 2009, Current opinion in obstetrics & gynecology.

[3]  J. Meyerhardt,et al.  Plasma Insulin-like Growth Factors, Insulin-like Binding Protein-3, and Outcome in Metastatic Colorectal Cancer: Results from Intergroup Trial N9741 , 2008, Clinical Cancer Research.

[4]  Christophe Lemetre,et al.  An introduction to artificial neural networks in bioinformatics - application to complex microarray and mass spectrometry datasets in cancer studies , 2008, Briefings Bioinform..

[5]  T. Yeatman,et al.  On the eve of personalized medicine in oncology. , 2008, Cancer research.

[6]  Graham R. Ball,et al.  Identification of gene transcript signatures predictive for estrogen receptor and lymph node status using a stepwise forward selection artificial neural network modelling approach , 2008, Artif. Intell. Medicine.

[7]  C. Sawyers The cancer biomarker problem , 2008, Nature.

[8]  M. Pencina,et al.  Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond , 2008, Statistics in medicine.

[9]  A. Howell,et al.  The effects of sex steroid replacement therapy on an expanded panel of IGF-related peptides. , 2007, Growth hormone & IGF research : official journal of the Growth Hormone Research Society and the International IGF Research Society.

[10]  A. Cuschieri,et al.  Artificial Neural Network: Predicted vs. Observed Survival in Patients with Colonic Cancer , 2007, Diseases of the colon and rectum.

[11]  William Stafford Noble,et al.  Support vector machine , 2013 .

[12]  I. Gram,et al.  Body mass index, waist circumference and waist–hip ratio and serum levels of IGF-I and IGFBP-3 in European women , 2006, International Journal of Obesity.

[13]  A. Renehan,et al.  Obesity and cancer risk: the role of the insulin–IGF axis , 2006, Trends in Endocrinology & Metabolism.

[14]  Steve Halligan,et al.  Systematic reviews of diagnostic tests in cancer: review of methods and reporting , 2006, BMJ : British Medical Journal.

[15]  D. Ward,et al.  Identification of serum biomarkers for colon cancer by proteomic analysis , 2006, British Journal of Cancer.

[16]  Paulo J. G. Lisboa,et al.  The Use of Artificial Neural Networks in Decision Support in Cancer: a Systematic Review , 2005 .

[17]  Farid E Ahmed,et al.  Molecular Cancer BioMed Central Review , 2005 .

[18]  Graham R. Ball,et al.  Classification of bacterial species from proteomic data using combinatorial approaches incorporating artificial neural networks, cluster analysis and principal components analysis , 2005, Bioinform..

[19]  Ah-Hwee Tan,et al.  Predictive neural networks for gene expression data analysis , 2005, Neural Networks.

[20]  A. Renehan,et al.  The effect of cigarette smoking use and cessation on serum insulin-like growth factors , 2004, British Journal of Cancer.

[21]  Yi-Zeng Liang,et al.  Monte Carlo cross‐validation for selecting a model and estimating the prediction error in multivariate calibration , 2004 .

[22]  Michael W Kattan,et al.  Evaluating a New Marker’s Predictive Contribution , 2004, Clinical Cancer Research.

[23]  Andrew G Renehan,et al.  Determination of IGF-I, IGF-II, IGFBP-2, and IGFBP-3 levels in serum and plasma: comparisons using the Bland-Altman method. , 2003, Growth hormone & IGF research : official journal of the Growth Hormone Research Society and the International IGF Research Society.

[24]  David Kerr,et al.  Neural networks in the prediction of survival in patients with colorectal cancer. , 2003, Clinical colorectal cancer.

[25]  W. Ryder,et al.  Paradoxical elevations in serum IGF‐II and IGF binding protein‐2 in acromegaly: insights into the regulation of these peptides , 2001, Clinical endocrinology.

[26]  M S Pepe,et al.  Phases of biomarker development for early detection of cancer. , 2001, Journal of the National Cancer Institute.

[27]  D M Rodvold,et al.  Neural network and regression predictions of 5‐year survival after colon carcinoma treatment , 2001, Cancer.

[28]  P. Bruzzi,et al.  Diagnostic value of the acid-labile subunit in acromegaly: evaluation in comparison with insulin-like growth factor (IGF) I, and IGF-binding protein-1, -2, and -3. , 2001, The Journal of clinical endocrinology and metabolism.

[29]  C. Potten,et al.  Elevated serum insulin-like growth factor (IGF)-II and IGF binding protein-2 in patients with colorectal cancer , 2000, British Journal of Cancer.

[30]  W. Vach,et al.  On the misuses of artificial neural networks for prognostic and diagnostic classification in oncology. , 2000, Statistics in medicine.

[31]  P Royston,et al.  The use of fractional polynomials to model continuous risk variables in epidemiology. , 1999, International journal of epidemiology.

[32]  P. O'neill,et al.  The diagnosis of severe growth hormone deficiency in elderly patients with hypothalamic–pituitary disease , 1998, Clinical endocrinology.

[33]  L. Bottaci,et al.  Artificial neural networks applied to outcome prediction for colorectal cancer patients in separate institutions , 1997, The Lancet.

[34]  A. Karasik,et al.  Increased insulin-like growth factor binding protein-2 (IGFBP-2) gene expression and protein production lead to high IGFBP-2 content in malignant ovarian cyst fluid. , 1996, British Journal of Cancer.

[35]  D. Clemmons,et al.  Insulin-like growth factors and their binding proteins: biological actions. , 1995, Endocrine reviews.

[36]  N. Skakkebaek,et al.  The ratio between serum levels of insulin‐like growth factor (IGF)‐I and the IGF binding proteins (IGFBP‐1, 2 and 3) decreases with age in healthy adults and is increased in acromegalic patients , 1994, Clinical endocrinology.

[37]  T. Stamey,et al.  Elevated levels of insulin-like growth factor-binding protein-2 in the serum of prostate cancer patients. , 1993, The Journal of clinical endocrinology and metabolism.

[38]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[39]  N. Bleehen,et al.  Production of immunoreactive insulin-like growth factor-I (IGF-I) and IGF-I binding proteins by human lung tumours. , 1990, British Journal of Cancer.

[40]  M. Preece,et al.  A monoclonal antibody to human insulin-like growth factor-I: characterization, use in radioimmunoassay and effect on the biological activities of the growth factor. , 1989, Journal of molecular endocrinology.

[41]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[42]  B. Efron How Biased is the Apparent Error Rate of a Prediction Rule , 1986 .

[43]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[44]  OU Wei-hua,et al.  Linear Model Selection by Cross-validation , 2009 .

[45]  P. Royston,et al.  Building Multivariable Regression Models with Continuous Covariates in Clinical Epidemiology , 2005, Methods of Information in Medicine.

[46]  Ronald,et al.  Learning representations by backpropagating errors , 2004 .

[47]  Gisbert Schneider,et al.  Support vector machine applications in bioinformatics. , 2003, Applied bioinformatics.

[48]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .