JTSA: An open source framework for time series abstractions

BACKGROUND AND OBJECTIVE The evaluation of the clinical status of a patient is frequently based on the temporal evolution of some parameters, making the detection of temporal patterns a priority in data analysis. Temporal abstraction (TA) is a methodology widely used in medical reasoning for summarizing and abstracting longitudinal data. METHODS This paper describes JTSA (Java Time Series Abstractor), a framework including a library of algorithms for time series preprocessing and abstraction and an engine to execute a workflow for temporal data processing. The JTSA framework is grounded on a comprehensive ontology that models temporal data processing both from the data storage and the abstraction computation perspective. The JTSA framework is designed to allow users to build their own analysis workflows by combining different algorithms. Thanks to the modular structure of a workflow, simple to highly complex patterns can be detected. The JTSA framework has been developed in Java 1.7 and is distributed under GPL as a jar file. RESULTS JTSA provides: a collection of algorithms to perform temporal abstraction and preprocessing of time series, a framework for defining and executing data analysis workflows based on these algorithms, and a GUI for workflow prototyping and testing. The whole JTSA project relies on a formal model of the data types and of the algorithms included in the library. This model is the basis for the design and implementation of the software application. Taking into account this formalized structure, the user can easily extend the JTSA framework by adding new algorithms. Results are shown in the context of the EU project MOSAIC to extract relevant patterns from data coming related to the long term monitoring of diabetic patients. CONCLUSIONS The proof that JTSA is a versatile tool to be adapted to different needs is given by its possible uses, both as a standalone tool for data summarization and as a module to be embedded into other architectures to select specific phenotypes based on TAs in a large dataset.

[1]  David Levine,et al.  The Analytic Information Warehouse (AIW): A platform for analytics using electronic health record data , 2013, J. Biomed. Informatics.

[2]  Evert de Jonge,et al.  Temporal abstraction for feature extraction: A comparative case study in prediction from intensive care monitoring data , 2007, Artif. Intell. Medicine.

[3]  Joel H. Saltz,et al.  Temporal Abstraction-based Clinical Phenotyping with Eureka! , 2013, AMIA.

[4]  Carlo Combi,et al.  Visually defining and querying consistent multi-granular clinical temporal abstractions , 2012, Artif. Intell. Medicine.

[5]  Riccardo Bellazzi,et al.  Intelligent analysis of clinical time series: an application in the diabetes mellitus domain , 2000, Artif. Intell. Medicine.

[6]  Andrew R. Post,et al.  Model Formulation: PROTEMPA: A Method for Specifying and Identifying Temporal Sequences in Retrospective Data for Patient Selection , 2007, J. Am. Medical Informatics Assoc..

[7]  Cristiana Larizza,et al.  M-HTP: A system for monitoring heart transplant patients , 1992, Artif. Intell. Medicine.

[8]  Riccardo Bellazzi,et al.  Precedence Temporal Networks to represent temporal relationships in gene expression data , 2007, J. Biomed. Informatics.

[9]  Yuval Shahar,et al.  Distributed, intelligent, interactive visualization and exploration of time-oriented clinical data and their abstractions , 2006, Artif. Intell. Medicine.

[10]  Alberto Riva,et al.  Temporal Abstractions for Interpreting Diabetic Patients Monitoring Data , 1998, Intell. Data Anal..

[11]  M A Musen,et al.  RESUME: a temporal-abstraction system for patient monitoring. , 1993, Computers and biomedical research, an international journal.

[12]  Yuval Shahar,et al.  A Framework for Knowledge-Based Temporal Abstraction , 1997, Artif. Intell..

[13]  Massimo Franceschet,et al.  Representing and Reasoning about Temporal Granularities , 2004, J. Log. Comput..

[14]  Jim Hunter,et al.  TSNet - A Distributed Architecture for Time Series Analysis , 2008, Computer-based Medical Guidelines and Protocols.

[15]  Riccardo Bellazzi,et al.  TimeClust: a clustering tool for gene expression time series , 2008, Bioinform..

[16]  Yuval Shahar,et al.  Evaluation of KNAVE-II: a Tool for Intelligent Query and Exploration of Patient Data , 2004, MedInfo.

[17]  Yuval Shahar,et al.  Intelligent selection and retrieval of multiple time-oriented records , 2010, Journal of Intelligent Information Systems.

[18]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[19]  Griffin M. Weber,et al.  Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2) , 2010, J. Am. Medical Informatics Assoc..

[20]  Michael Stonebraker,et al.  The Morgan Kaufmann Series in Data Management Systems , 1999 .

[21]  Blaz Zupan,et al.  TA-clustering: Cluster analysis of gene expression profiles through Temporal Abstractions , 2005, Int. J. Medical Informatics.

[22]  Samson W. Tu,et al.  The Chronus II temporal database mediator , 2002, AMIA.

[23]  Carolyn McGregor,et al.  Temporal abstraction in intelligent clinical data analysis: A survey , 2007, Artif. Intell. Medicine.

[24]  Ben Shneiderman,et al.  A Visual Interface for Multivariate Temporal Data: Finding Patterns of Events across Multiple Histories , 2006, 2006 IEEE Symposium On Visual Analytics Science And Technology.

[25]  Luca Chittaro,et al.  Visualizing queries on databases of temporal histories: new metaphors and their evaluation , 2003, Data Knowl. Eng..

[26]  Samson W. Tu,et al.  RASTA: A Distributed Temporal Abstraction System to Facilitate Knowledge-Driven Monitoring of Clinical Databases , 2001, MedInfo.

[27]  Yuval Shahar,et al.  Intelligent visualization and exploration of time-oriented data of multiple patients , 2010, Artif. Intell. Medicine.

[28]  Ralph B D'Agostino,et al.  Fasting and postchallenge glycemia and cardiovascular disease risk: the Framingham Offspring Study. , 2002, Diabetes care.

[29]  Riccardo Bellazzi,et al.  Temporal data mining for the quality assessment of hemodialysis services , 2005, Artif. Intell. Medicine.

[30]  James F. Allen Towards a General Theory of Action and Time , 1984, Artif. Intell..

[31]  C. Larizza,et al.  An HTTP based server for Temporal Abstractions , 1999 .

[32]  Elpida T. Keravnou,et al.  Temporal abstraction and temporal Bayesian networks in clinical domains: A survey , 2014, Artif. Intell. Medicine.

[33]  Sushil Jajodia,et al.  A general framework for time granularity and its application to temporal reasoning , 1998, Annals of Mathematics and Artificial Intelligence.

[34]  Carlo Combi,et al.  Data mining with Temporal Abstractions: learning rules from time series , 2007, Data Mining and Knowledge Discovery.