Biomarker Identification and Rule Extraction from Mass Spectral Serum Profiles

In this paper, we introduce a novel feature selection method that combines ant colony optimization (ACO) with support vector machine (SVM) to identify candidate biomarkers from mass spectral serum profiles. In addition, we present an innovative rule extraction algorithm that uses ACO to select accurate if-then rules for the classification of mass spectra. We applied the proposed feature selection and rule extraction methods to identify candidate biomarkers and extract if-then classification rules from MALDI-TOF spectra of enriched serum. The candidate biomarkers and the associated rules distinguished hepatocellular carcinoma patients from matched controls with high sensitivity and specificity

[1]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[2]  Habtom W. Ressom,et al.  Analysis of mass spectral serum profiles for biomarker selection , 2005, Bioinform..

[3]  Tamer Hifnawy,et al.  Associations of pesticides, HCV, HBV, and hepatocellular carcinoma in Egypt. , 2005, International journal of hygiene and environmental health.

[4]  Lothar Litz,et al.  Learning Premises of Fuzzy Rules for Knowledge Acquisition in Classification Problems , 2002, Knowledge and Information Systems.

[5]  Luca Maria Gambardella,et al.  Ant Algorithms for Discrete Optimization , 1999, Artificial Life.

[6]  Habtom W. Ressom,et al.  Analysis of MALDI-TOF Serum Profiles for Biomarker Selection and Sample Classification , 2005, 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[7]  T. Veenstra,et al.  Characterization of the Low Molecular Weight Human Serum Proteome*S , 2003, Molecular & Cellular Proteomics.

[8]  Jiebo Luo,et al.  Data Mining. Multimedia, Soft Computing, and Bioinformatics , 2005, IEEE Transactions on Neural Networks.

[9]  Richard Weber,et al.  Fuzzy-ID3: A class of methods for automatic knowledge acquisition , 1992 .

[10]  Huiqing Liu,et al.  Simple rules underlying gene expression profiles of more than six subtypes of acute lymphoblastic leukemia (ALL) patients , 2003, Bioinform..

[11]  E. Holland,et al.  Serum peptide profiling by magnetic particle-assisted, automated sample processing and MALDI-TOF mass spectrometry. , 2004, Analytical chemistry.

[12]  Elena Marchiori,et al.  Feature selection in proteomic pattern data with support vector machines , 2004, 2004 Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[13]  David Ward,et al.  Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data , 2003, Bioinform..

[14]  E. Petricoin,et al.  Use of proteomic patterns in serum to identify ovarian cancer , 2002, The Lancet.

[15]  Hisao Ishibuchi,et al.  Adaptive fuzzy rule-based classification systems , 1996, IEEE Trans. Fuzzy Syst..

[16]  Yoichi Hayashi,et al.  Fuzzy neural expert system with automated extraction of fuzzy If-Then rules from a trained neural network , 1990, [1990] Proceedings. First International Symposium on Uncertainty Modeling and Analysis.

[17]  Chad Creighton,et al.  Mining gene expression databases for association rules , 2003, Bioinform..

[18]  Jan Komorowski,et al.  Learning Rule-based Models of Biological Process from Gene Expression Time Profiles Using Gene Ontology , 2003, Bioinform..

[19]  Nada Lavrac,et al.  Induction of comprehensible models for gene expression datasets by subgroup discovery methodology , 2004, J. Biomed. Informatics.

[20]  D. Chan,et al.  Proteomics and bioinformatics approaches for identification of serum biomarkers to detect breast cancer. , 2002, Clinical chemistry.

[21]  Chin-Teng Lin,et al.  Neural-Network-Based Fuzzy Logic Control and Decision System , 1991, IEEE Trans. Computers.

[22]  J. Glimm,et al.  Detection of cancer-specific markers amid massive mass spectral data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[23]  H. Ishibuchi,et al.  Distributed representation of fuzzy rules and its application to pattern classification , 1992 .

[24]  M. Shigenaga,et al.  Evaluation of a novel, integrated approach using functionalized magnetic beads, bench-top MALDI-TOF-MS with prestructured sample supports, and pattern recognition software for profiling potential biomarkers in human plasma. , 2004, Journal of biomolecular techniques : JBT.

[25]  M. Shaw,et al.  Induction of fuzzy decision trees , 1995 .

[26]  Sushmita Mitra,et al.  Data Mining , 2003 .

[27]  Hisao Ishibuchi,et al.  Single-objective and two-objective genetic algorithms for selecting linguistic rules for pattern classification problems , 1997, Fuzzy Sets Syst..

[28]  Rency S Varghese,et al.  Enrichment of low molecular weight fraction of serum for MS analysis of peptides associated with hepatocellular carcinoma , 2006, Proteomics.

[29]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[30]  A. Olshen,et al.  Differential exoprotease activities confer tumor-specific serum peptidome patterns. , 2005, The Journal of clinical investigation.