Early disease correlated protein detection using early response index (ERI)

Finding disease correlated proteins with significant expression changes before the clinical onset stage of disease is of great interest to researchers for developing early disease diagnosis biomarkers. Since disease correlated proteins have relatively low level of abundance change at early stages, it is hard to find them using existing bioinformatic tools in high throughput data, which has limited dynamic range and significant noise. Most existing biomarker discovery algorithms can only detect proteins with high abundance changes, and early disease diagnostic markers are frequently missed. We proposed a new ranking score called early response index (ERI) for prioritizing disease correlated protein as potential early diagnostic markers. Rather than classification accuracy, ERI estimates the average classification accuracy improvement achievable by proteins when they are combined with other proteins as features of classifiers. ERI is more sensitive to abundance changes than other ranking statistics. In a validation study to detect proteins with sustained expression changes from the pre-clinical onset to the clinical onset stage of multiples sclerosis using a mouse model, the proposed algorithm outperforms other tested algorithms in both sensitivity and specificity.

[1]  Jianqiu Zhang,et al.  MZDASoft: a software architecture that enables large-scale comparison of protein expression levels over multiple samples based on liquid chromatography/tandem mass spectrometry. , 2015, Rapid communications in mass spectrometry : RCM.

[2]  Nikola Bogunovic,et al.  A review of feature selection methods with applications , 2015, 2015 38th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[3]  Itay Raphael,et al.  Microwave & Magnetic (M2) Proteomics Reveals CNS-Specific Protein Expression Waves that Precede Clinical Symptoms of Experimental Autoimmune Encephalomyelitis , 2014, Scientific Reports.

[4]  Kenji Suzuki,et al.  Artificial Neural Networks - Methodological Advances and Biomedical Applications , 2011 .

[5]  P. Massion,et al.  The State of Molecular Biomarkers for the Early Detection of Lung Cancer , 2012, Cancer Prevention Research.

[6]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Aidong Zhang,et al.  Cluster analysis for gene expression data: a survey , 2004, IEEE Transactions on Knowledge and Data Engineering.

[8]  Itay Raphael,et al.  Immunoenrichment microwave and magnetic proteomics for quantifying CD47 in the experimental autoimmune encephalomyelitis model of multiple sclerosis , 2012, Electrophoresis.

[9]  K. Aihara,et al.  Early Diagnosis of Complex Diseases by Molecular Biomarkers, Network Biomarkers, and Dynamical Network Biomarkers , 2014, Medicinal research reviews.

[10]  Morten Østergaard,et al.  Proteomic Strategies to Reveal Tumor Heterogeneity among Urothelial Papillomas * , 2002, Molecular & Cellular Proteomics.

[11]  Mathukumalli Vidyasagar,et al.  Identifying predictive features in drug response using machine learning: opportunities and challenges. , 2015, Annual review of pharmacology and toxicology.

[12]  Xuepo Ma,et al.  PeakLink: a new peptide peak linking method in LC-MS/MS using wavelet and SVM , 2014, Bioinform..

[13]  Xuefeng Bruce Ling,et al.  Multiclass cancer classification and biomarker discovery using GA-based algorithms , 2005, Bioinform..

[14]  Francesco Chiappelli,et al.  Biomarkers for early detection of high risk cancers: From gliomas to nasopharyngeal carcinoma , 2009, Bioinformation.

[15]  G. Li,et al.  An integrated approach utilizing artificial neural networks and SELDI mass spectrometry for the classification of human tumours and rapid identification of potential biomarkers , 2002, Bioinform..

[16]  Paul Kearney,et al.  A Blood-Based Proteomic Classifier for the Molecular Characterization of Pulmonary Nodules , 2013, Science Translational Medicine.

[17]  E. Petricoin,et al.  Early detection: Proteomic applications for the early detection of cancer , 2003, Nature Reviews Cancer.

[18]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[19]  Xuepo Ma,et al.  Suppression correction and characteristic study in liquid chromatography/Fourier transform mass spectrometry measurements. , 2011, Rapid communications in mass spectrometry : RCM.

[20]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[21]  Robert Howarth,et al.  Cardiac myosin-binding protein C: a potential early biomarker of myocardial injury , 2015, Basic Research in Cardiology.

[22]  Itay Raphael,et al.  Microwave and magnetic (M2) proteomics of the experimental autoimmune encephalomyelitis animal model of multiple sclerosis , 2012, Electrophoresis.

[23]  Claudio Ronco,et al.  Early biomarkers of renal injury. , 2010, Congestive heart failure.

[24]  W. Ding,et al.  Early markers of obesity-related renal injury in childhood , 2014, Pediatric Nephrology.

[25]  R. Coral-Vázquez,et al.  Reduction of no-reflow and reperfusion injury with the synthetic 17β-aminoestrogen compound Prolame is associated with PI3K/Akt/eNOS signaling cascade , 2015, Basic Research in Cardiology.

[26]  Maxey C M Chung,et al.  Hepatocellular carcinoma: From bedside to proteomics , 2001, Proteomics.