Mining approximate temporal functional dependencies with pure temporal grouping in clinical databases

Functional dependencies (FDs) typically represent associations over facts stored by a database, such as "patients with the same symptom get the same therapy." In more recent years, some extensions have been introduced to represent both temporal constraints (temporal functional dependencies - TFDs), as "for any given month, patients with the same symptom must have the same therapy, but their therapy may change from one month to the next one," and approximate properties (approximate functional dependencies - AFDs), as "patients with the same symptomgenerallyhave the same therapy." An AFD holds most of the facts stored by the database, enabling some data to deviate from the defined property: the percentage of data which violate the given property is user-defined. According to this scenario, in this paper we introduce approximate temporal functional dependencies (ATFDs) and use them to mine clinical data. Specifically, we considered the need for deriving new knowledge from psychiatric and pharmacovigilance data. ATFDs may be defined and measured either on temporal granules (e.g.grouping data by day, week, month, year) or on sliding windows (e.g.a fixed-length time interval which moves over the time axis): in this regard, we propose and discuss some specific and efficient data mining techniques for ATFDs. We also developed two running prototypes and showed the feasibility of our proposal by mining two real-world clinical data sets. The clinical interest of the dependencies derived considering the psychiatry and pharmacovigilance domains confirms the soundness and the usefulness of the proposed techniques.

[1]  E. F. Codd,et al.  Normalized data base structure: a brief tutorial , 1971, SIGFIDET '71.

[2]  Gabriela Ochoa,et al.  A PSO/ACO approach to knowledge discovery in a pharmacovigilance context , 2009, GECCO '09.

[3]  Jean-Marc Petit,et al.  Functional and approximate dependency mining: database and FCA points of view , 2002, J. Exp. Theor. Artif. Intell..

[4]  Carlo Combi,et al.  Querying temporal clinical databases on granular trends , 2012, J. Biomed. Informatics.

[5]  M. Lindquist,et al.  Signal Selection and Follow-Up in Pharmacovigilance , 2002, Drug safety.

[6]  Heikki Mannila,et al.  On the Complexity of Inferring Functional Dependencies , 1992, Discret. Appl. Math..

[7]  Victor Vianu Dynamic functional dependencies and database aging , 1987, JACM.

[8]  Pietro Sala,et al.  Mining Approximate Temporal Functional Dependencies Based on Pure Temporal Grouping , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[9]  LarizzaCristiana,et al.  Data mining with Temporal Abstractions , 2007 .

[10]  Massimo Franceschet,et al.  Representing and Reasoning about Temporal Granularities , 2004, J. Log. Comput..

[11]  Angelo Montanari,et al.  The t4sql temporal query language , 2007, CIKM '07.

[12]  Carlo Combi,et al.  Data mining with Temporal Abstractions: learning rules from time series , 2007, Data Mining and Knowledge Discovery.

[13]  E. F. Codd,et al.  Normalized Data Structure: A Brief Tutorial. , 1971 .

[14]  Hannu Toivonen,et al.  Efficient discovery of functional and approximate dependencies using partitions , 1998, Proceedings 14th International Conference on Data Engineering.

[15]  Christian S. Jensen,et al.  Extending Existing Dependency Theory to Temporal Databases , 1996, IEEE Trans. Knowl. Data Eng..

[16]  Carlo Combi,et al.  Modeling and Querying Temporal Semistructured Data , 2009, New Trends in Data Warehousing and Data Analysis.

[17]  Hannu Toivonen,et al.  TANE: An Efficient Algorithm for Discovering Functional and Approximate Dependencies , 1999, Comput. J..

[18]  Heikki Mannila,et al.  Approximate Inference of Functional Dependencies from Relations , 1995, Theor. Comput. Sci..

[19]  Jef Wijsen,et al.  Temporal FDs on complex objects , 1999, TODS.

[20]  Sushil Jajodia,et al.  Logical design for temporal databases with multiple granularities , 1997, TODS.

[21]  Angelo Montanari,et al.  A Uniform Framework for Temporal Functional Dependencies with Multiple Granularities , 2011, SSTD.

[22]  Riccardo Bellazzi,et al.  Temporal data mining for the quality assessment of hemodialysis services , 2005, Artif. Intell. Medicine.