Implementation of various data processing and evaluation techniques on ICDDR, B surveillance data to generate optimal decision tree for patients classification

The surveillance system established in ICDDR,B collect information of diarrhoeal disease. Our research focuses on generating decision tree models to categorise diarrhoeal patients according to the severance of disease. From the decision tree generated based on earlier cases stored in the surveillance data, decision rules are generated. These rules are used to classify patients into three classes according to their criticality: high, mid, low. This would help the hospital authority to take prudent actions on critical patients. Different techniques are used to build an optimal decision tree by considering various set of data, and generated trees are compared with various performance metrics, e.g., accuracy, precision, recall, area under ROC curve, etc.

[1]  Jie Wang,et al.  Combination Data Mining Methods with New Medical Data to Predicting Outcome of Coronary Heart Disease , 2007, 2007 International Conference on Convergence Information Technology (ICCIT 2007).

[2]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[3]  Stephen E Brossette,et al.  Data mining and infection control. , 2008, Clinics in laboratory medicine.

[4]  W. T. Jones,et al.  Application of data mining to intensive care unit microbiologic data. , 1999, Emerging infectious diseases.

[5]  James L. Larson A decision tree for the early diagnosis of acute myocardial infarction in non-traumatic chest pain patients at hospital admission , 1996 .

[6]  Hsinchun Chen,et al.  Medical Data Mining on the Internet: Research on a Cancer Information System , 1999, Artificial Intelligence Review.

[7]  Max Bramer,et al.  Principles of Data Mining , 2013, Undergraduate Topics in Computer Science.

[8]  Lili Ma,et al.  A Framework for Infection Control Surveillance Using Association Rules , 2003, AMIA.

[9]  James P. Ignizio An introduction to expert systems : the development and implementation of rule-based expert systems , 1991 .

[10]  Max Bramer Principles of Data Mining , 2013, Undergraduate Topics in Computer Science.

[11]  Nicos Maglaveras,et al.  Mining Association Rules from Clinical Databases: An Intelligent Diagnostic Process in Healthcare , 2001, MedInfo.

[12]  Evelina Lamma,et al.  Artificial intelligence techniques for monitoring dangerous infections , 2006, IEEE Transactions on Information Technology in Biomedicine.

[13]  Jack J. Dongarra,et al.  MPI collective algorithm selection and quadtree encoding , 2007, Parallel Comput..

[14]  Roman Timofeev,et al.  Classification and Regression Trees(CART)Theory and Applications , 2004 .

[15]  J. Mair,et al.  A decision tree for the early diagnosis of acute myocardial infarction in nontraumatic chest pain patients at hospital admission. , 1995, Chest.

[16]  J. Hardin,et al.  Association rules and data mining in hospital infection control and public health surveillance. , 1998, Journal of the American Medical Informatics Association : JAMIA.

[17]  Vincent S. Tseng,et al.  An Integrated Data Mining System for Patient Monitoring with Applications on Asthma Care , 2008, 2008 21st IEEE International Symposium on Computer-Based Medical Systems.

[18]  Ignizio Introduction to expert systems , 1985 .

[19]  S. Jackson,et al.  Data‐mining of medication records to improve asthma management , 2008, The Medical journal of Australia.

[20]  Dorian Pyle,et al.  Data Preparation for Data Mining , 1999 .

[21]  Jack J. Dongarra,et al.  Decision Trees and MPI Collective Algorithm Selection Problem , 2007, Euro-Par.

[22]  Ivan Bratko,et al.  Machine Learning and Data Mining; Methods and Applications , 1998 .