论文信息 - The Statistical Classification of Breast Cancer Data

The Statistical Classification of Breast Cancer Data

In this article, we study the statistical classification of breast cancer of two well-known large breast cancer databases. We use various classification rules, such as linear, quadratic, logistic, k nearest neighbor (k-NN), and k rank nearest neighbor (k-RNN) rules and compare the performances. We also conduct feature analysis for both data sets using logistic regression model.

[1] W. N. Street,et al. Computer-derived nuclear features distinguish malignant from benign breast cytology. , 1995, Human pathology.

[2] Richard A. Johnson,et al. Applied Multivariate Statistical Analysis , 1983 .

[3] O. Mangasarian,et al. Robust linear programming discrimination of two linearly inseparable sets , 1992 .

[4] Kuhu Pal,et al. Breast cancer detection using rank nearest neighbor classification rules , 2003, Pattern Recognit..

[5] O. Mangasarian,et al. Multisurface method of pattern separation for medical diagnosis applied to breast cytology. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[6] Jianping Zhang,et al. Selecting Typical Instances in Instance-Based Learning , 1992, ML.

[7] Olvi L. Mangasarian,et al. Nuclear feature extraction for breast tumor diagnosis , 1993, Electronic Imaging.

[8] Peter E. Hart,et al. Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[9] W. N. Street,et al. Machine learning techniques to diagnose breast cancer from image-processed nuclear features of fine needle aspirates. , 1994, Cancer letters.

[10] William Nick Street,et al. Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..

[11] M. C. Jones,et al. E. Fix and J.L. Hodges (1951): An Important Contribution to Nonparametric Discriminant Analysis and Density Estimation: Commentary on Fix and Hodges (1951) , 1989 .

[12] B. Everitt,et al. Applied Multivariate Data Analysis. , 1993 .