Metabolomic-based biomarker discovery for non-invasive lung cancer screening: A case study.

BACKGROUND Lung cancer (LC) is one of the leading lethal cancers worldwide, with an estimated 18.4% of all cancer deaths being attributed to the disease. Despite developments in cancer diagnosis and treatment over the previous thirty years, LC has seen little to no improvement in the overall five year survival rate after initial diagnosis. METHODS In this paper, we extended a recent study which profiled the metabolites in sputum from patients with lung cancer and age-matched volunteers smoking controls using flow infusion electrospray ion mass spectrometry. We selected key metabolites for distinguishing between different classes of lung cancer, and employed artificial neural networks and leave-one-out cross-validation to evaluate the predictive power of the identified biomarkers. RESULTS The neural network model showed excellent performance in classification between lung cancer and control groups with the area under the receiver operating characteristic curve of 0.99. The sensitivity and specificity of for detecting cancer from controls were 96% and 94% respectively. Furthermore, we have identified six putative metabolites that were able to discriminate between sputum samples derived from patients suffering small cell lung cancer (SCLC) and non-small cell lung cancer. These metabolites achieved excellent cross validation performance with a sensitivity of 80% and specificity of 100% for predicting SCLC. CONCLUSIONS These results indicate that sputum metabolic profiling may have potential for screening of lung cancer and lung cancer recurrence, and may greatly improve effectiveness of clinical intervention. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang.

[1]  Tobias Faust Pathology And Genetics Of Tumours Of The Lung Pleura Thymus And Heart , 2016 .

[2]  F. Aguayo,et al.  Analysis of aberrant methylation on promoter sequences of tumor suppressor genes and total DNA in sputum samples: a promising tool for early detection of COPD and lung cancer in smokers , 2012, Diagnostic Pathology.

[3]  Amoroso,et al.  Nephrology Dialysis Transplantation , 1999 .

[4]  Brian L Hood,et al.  Lung cancer serum biomarker discovery using glycoprotein capture and liquid chromatography mass spectrometry. , 2010, Journal of proteome research.

[5]  A. Shelton,et al.  Changes in neuronal activation patterns in response to androgen deprivation therapy: a pilot study , 2010, BMC Cancer.

[6]  J. Jankowski,et al.  Increased plasma phenylacetic acid in patients with end-stage renal failure inhibits iNOS expression. , 2003, The Journal of clinical investigation.

[7]  J. Jankowski,et al.  The uraemic toxin phenylacetic acid impairs macrophage function. , 2008, Nephrology, dialysis, transplantation : official publication of the European Dialysis and Transplant Association - European Renal Association.

[8]  M. Beckmann,et al.  The metabolomic detection of lung cancer biomarkers in sputum. , 2016, Lung cancer.

[9]  Enyou Li,et al.  Exhaled volatile organic compounds as lung cancer biomarkers during one-lung ventilation , 2014, Scientific Reports.

[10]  D. Timmerman,et al.  Using artificial neural networks to predict malignancy of ovarian tumors , 2001, 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[11]  Kevin Gleeson,et al.  Detection of lung cancer with volatile markers in the breath. , 2003, Chest.

[12]  Rich Caruana,et al.  Overfitting in Neural Nets: Backpropagation, Conjugate Gradient, and Early Stopping , 2000, NIPS.

[13]  E. Thunnissen,et al.  Molecular sputum analysis for the diagnosis of lung cancer , 2013, British Journal of Cancer.

[14]  H. Matsumoto,et al.  Fucosylation Is a Promising Target for Cancer Diagnosis and Therapy , 2012, Biomolecules.

[15]  M. Fiegl,et al.  Noninvasive detection of lung cancer by analysis of exhaled breath , 2009, BMC Cancer.

[16]  W. Hörl,et al.  The uraemic toxin phenylacetic acid contributes to inflammation by priming polymorphonuclear leucocytes. , 2013, Nephrology, dialysis, transplantation : official publication of the European Dialysis and Transplant Association - European Renal Association.

[17]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[18]  C. Mathers,et al.  Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008 , 2010, International journal of cancer.

[19]  Y. Wu,et al.  Artificial neural networks in mammography: application to decision making in the diagnosis of breast cancer. , 1993, Radiology.

[20]  D. Kavanagh,et al.  Is overexpression of HER-2 a predictor of prognosis in colorectal cancer? , 2009, BMC Cancer.

[21]  Jstor Journal of the Royal Statistical Society. Series D, (The statistician) , 1993 .

[22]  Elisabeth Brambilla,et al.  Pathology and genetics of tumours of the lung , pleura, thymus and heart , 2004 .

[23]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[24]  D. Samid,et al.  Phase I study of phenylacetate administered twice daily to patients with cancer , 1995, Cancer.

[25]  Hongwei Li,et al.  Metabolomic profiling of human serum in lung cancer patients using liquid chromatography/hybrid quadrupole time-of-flight mass spectrometry and gas chromatography/mass spectrometry , 2015, Journal of Cancer Research and Clinical Oncology.

[26]  A. Andermann,et al.  Revisiting Wilson and Jungner in the genomic age: a review of screening criteria over the past 40 years. , 2008, Bulletin of the World Health Organization.

[27]  A. Jemal,et al.  Cancer Statistics, 2010 , 2010, CA: a cancer journal for clinicians.

[28]  A. Lloyd,et al.  Evaluation of FTIR Spectroscopy as a diagnostic tool for lung cancer using sputum , 2010, BMC Cancer.

[29]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[30]  B. Bergman,et al.  Reactivity of monoclonal antibodies with ganglioside antigens in human small cell lung cancer tissues. , 2000, Lung cancer.

[31]  J. Lowe,et al.  Fucose: biosynthesis and biological function in mammals. , 2003, Glycobiology.

[32]  N. Shah,et al.  Glycine Decarboxylase Activity Drives Non-Small Cell Lung Cancer Tumor-Initiating Cells and Tumorigenesis , 2012 .

[33]  Paulo J. G. Lisboa,et al.  The Use of Artificial Neural Networks in Decision Support in Cancer: a Systematic Review , 2005 .

[34]  Ø. Langsrud,et al.  50–50 multivariate analysis of variance for collinear responses , 2002 .

[35]  R. Cataneo,et al.  Volatile organic compounds in breath as markers of lung cancer: a cross-sectional study , 1999, The Lancet.

[36]  R. Dybowski,et al.  Artificial neural networks in pathology and medical laboratories , 1995, The Lancet.