Extending CRISP-DM to incorporate temporal data mining of multidimensional medical data streams: A neonatal intensive care unit case study

Using a Neonatal Intensive Care Unit (NICU) case study, this work investigates the current CRoss Industry Standard Process for Data Mining (CRISP-DM) approach for modeling Intelligent Data Analysis (IDA)-based systems that perform temporal data mining (TDM). The case study highlights the need for an extended CRISP-DM approach when modeling clinical systems applying Data Mining (DM) and Temporal Abstraction (TA). As the number of such integrated TA/DM systems continues to grow, this limitation becomes significant and motivated our proposal of an extended CRISP-DM methodology to support TDM, known as CRISP-TDM. This approach supports clinical investigations on multi-dimensional time series data. This research paper has three key objectives: 1) Present a summary of the extended CRISP-TDM methodology; 2) Demonstrate the applicability of the proposed model to the NICU data, focusing on the challenges associated with multi-dimensional time series data; and 3) Describe the proposed IDA architecture for applying integrated TDM.

[1]  Carolyn McGregor,et al.  The e-Baby data warehouse: a case study , 2002, Proceedings of the 35th Annual Hawaii International Conference on System Sciences.

[2]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[3]  Tu Bao Ho,et al.  Combining temporal abstraction and data mining to study hepatitis data (特集 「医療及び化学情報マイニング」および一般) , 2004 .

[4]  Carolyn McGregor,et al.  Compression of XML physiological data streams to support neonatal intensive care unit Web services , 2005, 2005 IEEE International Conference on e-Technology, e-Commerce and e-Service.

[5]  Raymond Lister,et al.  The e-Babies Project: Integrated Data Monitoring and Decision Making in Neo-Natal Intensive Care , 2000, ECIS.

[6]  Andrew R. Post,et al.  Temporal data mining. , 2008, Clinics in laboratory medicine.

[7]  M. Stacey,et al.  An architecture for multi-dimensional temporal abstraction and its application to support neonatal intensive care , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[8]  James H Harrison,et al.  Introduction to the mining of clinical data. , 2008, Clinics in laboratory medicine.

[9]  Catherine Garbay,et al.  Multi‐level temporal abstraction for medical scenario construction , 2005 .

[10]  Maguelonne Teisseire,et al.  Mining spatio-temporal data , 2006, Journal of Intelligent Information Systems.