Data Mining Approach to Evaluating the use of Skin Surface Electropotentials for Breast Cancer Detection

The Biofield Diagnostic System (BDS) uses a score formed with measured skin surface electropotentials and a prior Level Of Suspicion (LOS) value (predicted by the physician based on the patient's ultrasound or mammography results) to calculate a revised Post-BDS LOS to indicate the presence of breast cancer. The demographic details, BDS test results, and the recorded electropotential values form a potentially useful dataset, which can be further explored with data mining tools to extract important information that can be used to improve the current predictive accuracy of the device. According to the proposed data mining framework, the BDS dataset with 291 cases was first pre-processed to remove outliers and then used to select relevant and informative features for classifier development and finally to evaluate the capability of the built classifiers in detecting the presence of the disease. Two popular feature selection techniques, namely, the filter and wrapper methods, were used in parallel for feature selection. A few statistical inference based classifiers and neural networks were used for classification. The proposed technique significantly improved the BDS prediction accuracy. Also, the use of prior LOS and, hence, the Post-BDS LOS, associates a mild subjective interpretation to the current prediction methodology used by BDS. However, the feature subset selected in our analysis that gave the best accuracy did not use either of these features. This result indicates the possibility of using BDS as a better objective assessment tool for breast cancer detection.

[1]  Martina Dickhaut,et al.  The Value of BBE in the Assessment of Breast Lesions , 1996 .

[2]  Susumu Yamaguchi,et al.  Prospective Evaluation of Skin Surface Electropotentials in Japanese Patients with Suspicious Breast Lesions , 1996, Japanese journal of cancer research : Gann.

[3]  S Vinitha Sree,et al.  The Use of Tissue Electrical Characteristics for Breast Cancer Detection: A Perspective Review , 2008, Technology in cancer research & treatment.

[4]  Richard J. Davies,et al.  Underlying Mechanisms Involved in Surface Electrical Potential Measurements for the Diagnosis of Breast Cancer: An Electrophysiological Approach to Cancer Diagnosis , 1996 .

[5]  Jeremy Foster,et al.  Understanding and Using Advanced Statistics , 2005 .

[6]  R J Davies,et al.  Transmural electrical potential difference as an early marker in colon cancer. , 1986, Archives of surgery.

[7]  H WittenIan,et al.  The WEKA data mining software , 2009 .

[8]  Volker Barth,et al.  Electropotential measurements as a new diagnostic modality for breast cancer , 1998, The Lancet.

[9]  R Holland,et al.  Electropotential evaluation as a new technique for diagnosing breast lesions. , 1997, European journal of radiology.

[10]  Ron Kohavi,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998 .

[11]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[12]  Joseph P. Crowe,et al.  Use of Non-Directed (Screening) Arrays in the Evaluation of Symptomatic and Asymptomatic Breast Patients , 1996 .

[13]  H. Gallager,et al.  Early phases in the development of breast cancer , 1969, Cancer.

[14]  J. M. Dixon Electropotentials in the Clinical Assessment of Breast Neoplasia , 1996, ESO Monographs.

[15]  David W. Aha,et al.  Feature Weighting for Lazy Learning Algorithms , 1998 .

[16]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.