Data Mining for Cancer Management in Egypt Case Study: Childhood Acute Lymphoblastic Leukemia

Data Mining aims at discovering knowledge out of data and presenting it in a form that is easily comprehensible to humans. One of the useful applications in Egypt is the Cancer management, especially the management of Acute Lymphoblastic Leukemia or ALL, which is the most common type of cancer in children. This paper discusses the process of designing a prototype that can help in the management of childhood ALL, which has a great significance in the health care field. Besides, it has a social impact on decreasing the rate of infection in children in Egypt. It also provides valubale information about the distribution and segmentation of ALL in Egypt, which may be linked to the possible risk factors. Undirected Knowledge Discovery is used since, in the case of this research project, there is no target field as the data provided is mainly subjective. This is done in order to quantify the subjective variables. Therefore, the computer will be asked to identify significant patterns in the provided medical data about ALL. This may be achieved through collecting the data necessary for the system, determimng the data mining technique to be used for the system, and choosing the most suitable implementation tool for the domain. The research makes use of a data mining tool, Clementine, so as to apply Decision Trees technique. We feed it with data extracted from real-life cases taken from specialized Cancer Institutes. Relevant medical cases details such as patient medical history and diagnosis are analyzed, classified, and clustered in order to improve the disease

[1]  Introduction to Data Mining and Knowledge Discovery Third Edition by Two Crows Corporation Introduction to Data Mining and Knowledge Discovery , .

[2]  Kenneth A. Marx,et al.  Data Mining the NCI Cancer Cell Line Compound GI50 Values: Identifying Quinone Subtypes Effective Against Melanoma and Leukemia Cell Classes. , 2003 .

[3]  Paul Gray,et al.  Introduction to Data Mining and Knowledge Discovery , 1998, Proceedings of the Thirty-First Hawaii International Conference on System Sciences.

[4]  Joseph P. Bigus,et al.  Data mining with neural networks , 1996 .

[5]  R. Chang,et al.  Data mining with decision trees for diagnosis of breast tumor in medical ultrasonic images , 2001, Breast Cancer Research and Treatment.

[6]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[7]  Gersende Fort,et al.  Classification using partial least squares with penalized logistic regression , 2005, Bioinform..

[8]  S Bicciato,et al.  Marker Identification and Classification of Cancer Types Using Gene Expression Data and SIMCA , 2004, Methods of Information in Medicine.

[9]  Patrick Hoffman,et al.  Data Mining the NCI Cancer Cell Line Compound GI50 Values: Identifying Quinone Subtypes Effective Against Melanoma and Leukemia Cell Classes , 2003, J. Chem. Inf. Comput. Sci..

[10]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[11]  G. Forgionne,et al.  Cancer surveillance using data warehousing, data mining, and decision support systems. , 2000, Topics in health information management.

[12]  A. W.,et al.  Journal of chemical information and computer sciences. , 1995, Environmental science & technology.

[13]  Michael Negnevitsky,et al.  Artificial Intelligence: A Guide to Intelligent Systems , 2001 .