Pivoting approaches for bulk extraction of Entity-Attribute-Value data

Entity-Attribute-Value (EAV) data, as present in repositories of clinical patient data, must be transformed (pivoted) into one-column-per-parameter format before it can be used by a variety of analytical programs. Pivoting approaches have not been described in depth in the literature, and existing descriptions are dated. We describe and benchmark three alternative algorithms to perform pivoting of clinical data in the context of a clinical study data management system. We conclude that when the number of attributes to be returned is not too large, it is feasible to use static SQL as the basis for views on the data. An alternative but more complex approach that utilizes hash tables and the presence of abundant random-access-memory can achieve improved performance by reducing the load on the database server.

[1]  Ren J. Chevance Server Architectures: Multiprocessors, Clusters, Parallel Systems, Web Servers, Storage Solutions , 2004 .

[2]  Luis N. Marenco,et al.  Reengineering a database for clinical trials management: lessons for system architects. , 2000, Controlled clinical trials.

[3]  P J Haug,et al.  HELP the next generation: a new client-server architecture. , 1994, Proceedings. Symposium on Computer Applications in Medical Care.

[4]  T A Pryor,et al.  Evaluation of an SQL model of the HELP patient database. , 1991, Proceedings. Symposium on Computer Applications in Medical Care.

[5]  G Hripcsak,et al.  Accessing the Columbia Clinical Repository. , 1994, Proceedings. Symposium on Computer Applications in Medical Care.