Database research in transfusion medicine: The power of large numbers

T he article by Edgren and colleagues in this issue of TRANSFUSION presents information about the expansion of a previously constructed linked blood donor–transfusion recipient database, the SCANdinavian Donations And Transfusions database (SCANDAT). The “original” SCANDAT consisted of computerized records of blood donation and transfusion activities dating back to the mid-1960s in Sweden and the early 1980s in Denmark with long-term follow-up data from national health registries extending through 2002. This database included data on more than one million blood donors linked to more than 1.3 million transfused recipients. Health outcomes were retrieved by record linkage to Swedish and Danish nationwide data health registries on cancer occurrence, hospital care, and mortality. This linkage was made possible by the use of a national registration number for all health-related activities, thereby providing thorough follow-up of donors and recipients for multiple (long-term) health outcomes. Previous SCANDAT accomplishments include determination of cancer risks in volunteer blood donors compared with general population data, evaluation of the potential transmissibility of cancer to transfusion recipients from donors who subsequently develop cancer, and evaluation of recipient posttransfusion survival stratified by a number of variables including red blood cell (RBC) storage age. The second version of SCANDAT has added another 8 to 10 years of data, thereby extending the original observations through 2010 to 2012. SCANDAT2 contains 25.5 million donation records, 21.3 million transfusion records, 3.7 million unique persons, and 40 million person-years of follow-up. Data quality is high as there is a more than 97% concordance with official annual statistics on blood donations and transfusions, and 96% of all transfusions in Sweden (94% in Denmark) are linkable to their respective donation(s). This robust linkage will enable investigators to study disease concordance between donors and recipients and to make inferences about disease transmissibility. This capability is enhanced by SCANDAT2’s impressive number of transfused components and unprecedented longterm follow-up morbidity and mortality data. As SCANDAT2 illustrates, the routine collection of massive amounts of clinical data in large multiinstitution databases has become increasingly common across multiple medical and surgical specialties. These databases come in two main flavors: 1) administrative databases whose primary purpose is for reimbursement and insurance purposes and 2) clinical registries that focus on collecting data on patients with a given disease or undergoing a specific procedure. As in other medical specialties, the use of complex databases that extract data from various sources are becoming more common in transfusion medicine (TM), where such databases can be used for hemovigilance and monitoring purposes and blood management evaluations or to address key research questions. SCANDAT and SCANDAT2 are excellent examples of how TM research questions can be approached using a large multicenter database. The remainder of this editorial will further review how TM research can benefit from such large database queries and will address some important considerations in this type of research. We have organized our discussion in order of the complexity of the efforts needed to compile the databases, although none of these efforts should be characterized as uncomplicated or straightforward!

[1]  G. Bonsel,et al.  Survival after transfusion in the Netherlands , 2011, Vox sanguinis.

[2]  M. Melbye,et al.  Survival after blood transfusion , 2008, Transfusion.

[3]  Sunil V. Rao,et al.  The implications of blood transfusions for patients with non-ST-segment elevation acute coronary syndromes: results from the CRUSADE National Quality Improvement Initiative. , 2005, Journal of the American College of Cardiology.

[4]  S. Glynn,et al.  The National Heart, Lung, and Blood Institute Recipient Epidemiology and Donor Evaluation Study (REDS‐III): a research program striving to improve blood donor and transfusion recipient outcomes , 2014, Transfusion.

[5]  S. Stramer,et al.  Current prevalence and incidence of infectious disease markers and estimated window‐period risk in the American Red Cross blood donor population , 2002, Transfusion.

[6]  L. Wideroff,et al.  Improving health profile of blood donors as a consequence of transfusion safety efforts , 2007, Transfusion.

[7]  P. Mintz,et al.  Transfusion‐related acute lung injury and potential risk factors among the inpatient US elderly as recorded in Medicare claims data, during 2007 through 2011 , 2014, Transfusion.

[8]  S. Glynn,et al.  The National Heart, Lung, and Blood Institute retrovirus epidemiology donor studies (Retrovirus Epidemiology Donor Study and Retrovirus Epidemiology Donor Study-II): twenty years of research to advance blood product safety and availability. , 2012, Transfusion medicine reviews.

[9]  R. Cartier,et al.  Evaluation of care and surveillance of cardiovascular disease: can we trust medico-administrative hospital data? , 2012, The Canadian journal of cardiology.

[10]  P. Friedmann,et al.  Preoperative hematocrit levels and postoperative outcomes in older patients undergoing noncardiac surgery. , 2007, JAMA.

[11]  L. Resar,et al.  A novel method of data analysis for utilization of red blood cell transfusion , 2013, Transfusion.

[12]  Kjell Titlestad,et al.  The new Scandinavian Donations and Transfusions database (SCANDAT2): a blood safety resource with added versatility , 2015, Transfusion.

[13]  M. Melbye,et al.  Donation frequency, iron loss, and risk of cancer among blood donors. , 2008, Journal of the National Cancer Institute.

[14]  C. Blais,et al.  Iron and cardiac ischemia: a natural, quasi‐random experiment comparing eligible with disqualified blood donors (CME) , 2013, Transfusion.

[15]  N. Roubinian,et al.  A multicenter study of plasma use in the United States , 2015, Transfusion.

[16]  Gabriel J Escobar,et al.  Predicting red blood cell transfusion in hospitalized patients: role of hemoglobin level, comorbidities, and illness severity , 2014, BMC Health Services Research.

[17]  H. Kamel,et al.  Predictors of hemoglobin recovery or deferral in blood donors with an initial successful donation , 2014, Transfusion.

[18]  G. Escobar,et al.  Trends in red blood cell transfusion and 30‐day mortality among hospitalized patients , 2014, Transfusion.

[19]  H. Ullum,et al.  Duration of red blood cell storage and survival of transfused patients (CME) , 2010, Transfusion.

[20]  Theodore J. Iwashyna,et al.  Using Existing Data to Address Important Clinical Questions in Critical Care , 2013, Critical care medicine.

[21]  Carl van Walraven,et al.  Administrative database research infrequently used validated diagnostic or procedural codes. , 2011, Journal of clinical epidemiology.

[22]  D. Triulzi,et al.  Things aren't always as they seem: what the randomized trials of red blood cell transfusion tell us about adverse outcomes (CME) , 2014, Transfusion.

[23]  J A Cook,et al.  The rise of big clinical databases , 2015, The British journal of surgery.

[24]  L. Wideroff,et al.  Risk of cancer after blood transfusion from donors with subclinical cancer: a retrospective cohort study , 2007, The Lancet.

[25]  G. Bonsel,et al.  The PROTON study: profiles of blood product transfusion recipients in the Netherlands , 2010, Vox sanguinis.

[26]  B. Custer,et al.  Cancer Incidence and Mortality in a Cohort of US Blood Donors: A 20-Year Study , 2013, Journal of cancer epidemiology.

[27]  P. Austin,et al.  Surgical outcomes and transfusion of minimal amounts of blood in the operating room. , 2012, Archives of surgery.