Ovarian cancer detection by logical analysis of proteomic data

A new type of efficient and accurate proteomic ovarian cancer diagnosis systems is proposed. The system is developed using the combinatorics and optimization‐based methodology of logical analysis of data (LAD) to the Ovarian Dataset 8‐7‐02 (http://clinicalproteomics.steem.com), which updates the one used by Petricoin et al. in The Lancet 2002, 359, 572–577. This mass spectroscopy‐generated dataset contains expression profiles of 15 154 peptides defined by their mass/charge ratios (m/z) in serum of 162 ovarian cancer and 91 control cases. Several fully reproducible models using only 7–9 of the 15 154 peptides were constructed, and shown in multiple cross‐validation tests (k‐folding and leave‐one‐out) to provide sensitivities and specificities of up to 100%. A special diagnostic system for stage I ovarian cancer patients is shown to have similarly high accuracy. Other results: (i) expressions of peptides with relatively low m/z values in the dataset are shown to be better at distinguishing ovarian cancer cases from controls than those with higher m/z values; (ii) two large groups of patients with a high degree of similarities among their formal (mathematical) profiles are detected; (iii) several peptides with a blocking or promoting effect on ovarian cancer are identified.

[1]  Y. Crama,et al.  Cause-effect relationships and partially defined Boolean functions , 1988 .

[2]  C. Perez,et al.  Principles and Practice of Gynecologic Oncology , 1992 .

[3]  Gilbert Laporte,et al.  Annals of Operations Research , 1996 .

[4]  Toshihide Ibaraki,et al.  Logical analysis of numerical data , 1997, Math. Program..

[5]  Toshihide Ibaraki,et al.  An Implementation of Logical Analysis of Data , 2000, IEEE Trans. Knowl. Data Eng..

[6]  B. Kramer,et al.  Trends in biomarker research for cancer detection. , 2001, The Lancet. Oncology.

[7]  D. Fishman,et al.  Three-dimensional power Doppler ultrasound improves the diagnostic accuracy for ovarian cancer prediction. , 2001, Gynecologic oncology.

[8]  E. Petricoin,et al.  Use of proteomic patterns in serum to identify ovarian cancer , 2002, The Lancet.

[9]  E. Petricoin,et al.  Proteomics of human breast ductal carcinoma in situ. , 2002, Cancer research.

[10]  E. Petricoin,et al.  Serum proteomic patterns for detection of prostate cancer. , 2002, Journal of the National Cancer Institute.

[11]  P. Schellhammer,et al.  Serum protein fingerprinting coupled with a pattern-matching algorithm distinguishes prostate cancer from benign prostate hyperplasia and healthy men. , 2002, Cancer research.

[12]  Robert E. Brown HER-2/neu-Positive breast carcinoma: molecular concomitants by proteomic analysis and their therapeutic implications. , 2002, Annals of clinical and laboratory science.

[13]  Ying Liu,et al.  The Maximum Box Problem and its Application to Data Analysis , 2002, Comput. Optim. Appl..

[14]  E. Sauter,et al.  Proteomic analysis of nipple aspirate fluid to detect biologic markers of breast cancer , 2002, British Journal of Cancer.

[15]  A contribution to breast cancer cell proteomics: Detection of new sequences , 2002, Proteomics.

[16]  P. Schellhammer,et al.  Boosted decision tree analysis of surface-enhanced laser desorption/ionization mass spectral serum profiles discriminates prostate cancer from noncancer patients. , 2002, Clinical chemistry.

[17]  K. Possinger,et al.  Nuclear matrix proteins as biomarkers for breast cancer , 2002, Expert review of molecular diagnostics.

[18]  E. Fung,et al.  Proteomic approaches to tumor marker discovery. , 2002, Archives of pathology & laboratory medicine.

[19]  George L. Wright,et al.  SELDI proteinchip MS: a platform for biomarker discovery and cancer diagnosis. , 2002, Expert review of molecular diagnostics.

[20]  E. Petricoin,et al.  Clinical potential of proteomics in the diagnosis of ovarian cancer , 2002, Expert review of molecular diagnostics.

[21]  B. Ye,et al.  New technologies for the identification of markers for early detection of ovarian cancer , 2003, Current opinion in obstetrics & gynecology.

[22]  Peter L. Hammer,et al.  Coronary Risk Prediction by Logical Analysis of Data , 2003, Ann. Oper. Res..

[23]  Peter L. Hammer,et al.  Accelerated algorithm for pattern detection in logical analysis of data , 2006, Discret. Appl. Math..

[24]  Peter L. Hammer,et al.  Spanned patterns for the logical analysis of data , 2006, Discret. Appl. Math..