Development of a large urban longitudinal HIV clinical cohort using a web-based platform to merge electronically and manually abstracted data from disparate medical record systems: technical challenges and innovative solutions

OBJECTIVE Electronic medical records (EMRs) are being increasingly utilized to conduct clinical and epidemiologic research in numerous fields. To monitor and improve care of HIV-infected patients in Washington, DC, one of the most severely affected urban areas in the United States, we developed a city-wide database across 13 clinical sites using electronic data abstraction and manual data entry from EMRs. MATERIALS AND METHODS To develop this unique longitudinal cohort, a web-based electronic data capture system (Discovere®) was used. An Agile software development methodology was implemented across multiple EMR platforms. Clinical informatics staff worked with information technology specialists from each site to abstract data electronically from each respective site's EMR through an extract, transform, and load process. RESULTS Since enrollment began in 2011, more than 7000 patients have been enrolled, with longitudinal clinical data available on all patients. Data sets are produced for scientific analyses on a quarterly basis, and benchmarking reports are generated semi-annually enabling each site to compare their participants' clinical status, treatments, and outcomes to the aggregated summaries from all other sites. DISCUSSION Numerous technical challenges were identified and innovative solutions developed to ensure the successful implementation of the DC Cohort. Central to the success of this project was the broad collaboration established between government, academia, clinics, community, information technology staff, and the patients themselves. CONCLUSIONS Our experiences may have practical implications for researchers who seek to merge data from diverse clinical databases, and are applicable to the study of health-related issues beyond HIV.

[1]  Paul J. Ranauro,et al.  Electronic medical record: research tool for pancreatic cancer? , 2014, The Journal of surgical research.

[2]  Heather N. Watson,et al.  Use of electronic medical records (EMR) for oncology outcomes research: assessing the comparability of EMR information to patient registry and health claims data , 2011, Clinical epidemiology.

[3]  Manisha Desai,et al.  Breast cancer treatment across health care systems: Linking electronic medical records and state registry data to enable outcomes research , 2014, Cancer.

[4]  G. Hartvigsen,et al.  Secondary Use of EHR: Data Quality Issues and Informatics Opportunities , 2010, Summit on translational bioinformatics.

[5]  J. Phair,et al.  The Multicenter AIDS Cohort Study: rationale, organization, and selected characteristics of the participants. , 1987, American journal of epidemiology.

[6]  Dipak Kalra,et al.  Data Resource Profile: Cardiovascular disease research using linked bespoke studies and electronic health records (CALIBER) , 2012, International journal of epidemiology.

[7]  S. Sidney,et al.  Do Protease Inhibitors Increase the Risk for Coronary Heart Disease in Patients With HIV‐1 Infection? , 2002, Journal of acquired immune deficiency syndromes.

[8]  Yuedong Wang,et al.  The MONitoring Dialysis Outcomes (MONDO) Initiative , 2013, Blood Purification.

[9]  Sheldon Brown,et al.  Veterans Aging Cohort Study (VACS): Overview and Description , 2006, Medical care.

[10]  Joseph Feldman,et al.  The Women's Interagency HIV Study , 1998 .

[11]  D. Ward,et al.  Changing conditions and treatments in a dynamic cohort of ambulatory HIV patients: the HIV outpatient study (HOPS). , 1999, Annals of epidemiology.

[12]  Amardeep Thind,et al.  Using your electronic medical record for research: a primer for avoiding pitfalls. , 2010, Family practice.

[13]  Richard C Wasserman,et al.  Electronic medical records (EMRs), epidemiology, and epistemology: reflections on EMRs and future pediatric clinical research. , 2011, Academic pediatrics.

[14]  R. Bosch,et al.  AIDS Clinical Trials Group Longitudinal Linked Randomized Trials (ALLRT): Rationale, Design, and Baseline Characteristics , 2008, HIV clinical trials.

[15]  Wendy A. Wolf,et al.  The eMERGE Network: A consortium of biorepositories linked to electronic medical records data for conducting genomic studies , 2011, BMC Medical Genomics.

[16]  Ke Xiao,et al.  Implementation of an innovative, integrated electronic medical record (EMR) and public health information exchange for HIV/AIDS , 2011, J. Am. Medical Informatics Assoc..

[17]  Amardeep Thind,et al.  Implementing and maintaining a researchable database from electronic medical records: a perspective from an academic family medicine department. , 2009, Healthcare policy = Politiques de sante.

[18]  Choung-Soo Kim,et al.  Development of prostate cancer research database with the clinical data warehouse technology for direct linkage with electronic medical record system , 2013, Prostate international.

[19]  Richard D Moore,et al.  Cohort profile: the Centers for AIDS Research Network of Integrated Clinical Systems. , 2008, International journal of epidemiology.

[20]  M. Fava,et al.  QT interval and antidepressant use: a cross sectional study of electronic health records , 2013, BMJ.

[21]  B. Dean,et al.  Review: Use of Electronic Medical Records for Health Outcomes Research , 2009, Medical care research and review : MCRR.

[22]  Mohamud Daya,et al.  Electronic versus manual data processing: evaluating the use of electronic health records in out-of-hospital clinical research. , 2012, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[23]  Richard D Moore,et al.  Cohort profile: the North American AIDS Cohort Collaboration on Research and Design (NA-ACCORD). , 2007, International journal of epidemiology.

[24]  R. Wasserman,et al.  Comparative effectiveness research using the electronic medical record: an emerging area of investigation in pediatric primary care. , 2012, The Journal of pediatrics.