Development and validation of method for defining conditions using Chinese electronic medical record

BackgroundThe adoption of the electronic medical record (EMR) is rapidly growing in China. Constantly evolving, Chinese EMRs contain vast amounts of clinical and financial data, providing tremendous potential for research and policy use; however, they are only partially standardized and contain free text or unstructured data. To utilize the information contained in Chinese EMRs, the development of data extraction methodology is urgently needed. The purpose of this study is to develop and validate methods to extract clinical information from the Chinese EMR for research use.MethodsUsing 2010 to 2014 EMR data from YouAn Hospital, a large teaching hospital affiliated with Capital Medical University in Beijing, China, we developed extraction methods including 40 EMR definitions for defining 6 liver disease, 5 disease severity conditions, and 29 comorbidities and treatments. We conducted a chart review of 450 randomly selected EMRs. Using physician chart review results as a reference, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated to validate each EMR definition.ResultsThe sensitivity of the 6 EMR definitions for liver diseases ranged from 78.9 to 100.0 %, and PPV ranged from 82.1 to 100.0 %. The sensitivity of the 5 definitions on disease severity conditions ranged from 91.0 to 100.0 %, and PPV ranged from 79.2 to 100.0 %. Among the 29 EMR definitions for comorbidities and treatments, 23 had sensitivity over 90.0 % and 25 had PPV over 80.0 %. The specificity and NPV for all 40 EMR definitions were over 90.0 %.ConclusionThe extraction method developed is a valid way of extracting information on liver diseases, comorbidities and related treatments from YouAn hospital EMRs. Our method should be modified for application to other Chinese EMR systems, following our framework for extracting conditions.

[1]  W. Kim,et al.  The model for end‐stage liver disease (MELD) , 2007, Hepatology.

[2]  C. Steiner,et al.  Comorbidity measures for use with administrative data. , 1998, Medical care.

[3]  Tony Antoniou,et al.  Validation of Case-Finding Algorithms Derived from Administrative Data for Identifying Adults Living with Human Immunodeficiency Virus Infection , 2011, PloS one.

[4]  Siwei Zhang,et al.  Liver cancer epidemic in China: past, present and future. , 2011, Seminars in cancer biology.

[5]  David S Goldberg,et al.  Coding algorithms for identifying patients with cirrhosis and hepatitis B or C virus using administrative data , 2015, Pharmacoepidemiology and drug safety.

[6]  Ashraf A. Omar,et al.  Liver fibrosis: consensus recommendations of the Asian Pacific Association for the Study of the Liver (APASL) , 2009, Hepatology international.

[7]  Yi Qian,et al.  Joint segmentation and named entity recognition using dual decomposition in Chinese discharge summaries. , 2014, Journal of the American Medical Informatics Association : JAMIA.

[8]  Lei Liu,et al.  Extracting important information from Chinese Operation Notes with natural language processing methods , 2014, J. Biomed. Informatics.

[9]  Jia-Horng Kao,et al.  Asian-Pacific consensus statement on the management of chronic hepatitis B: a 2012 update , 2012, Hepatology International.

[10]  Hude Quan,et al.  Predicting in‐hospital mortality in patients with cirrhosis: Results differ across risk adjustment methods , 2009, Hepatology.

[11]  Hua Xu,et al.  A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries , 2011, J. Am. Medical Informatics Assoc..

[12]  S. Sarin,et al.  Asian Pacific Association for the Study of the Liver consensus statements on the diagnosis, management and treatment of hepatitis C virus infection , 2007, Journal of gastroenterology and hepatology.

[13]  You-Lin Qiao,et al.  Estimation of Cancer Burden Attributable to Infection in Asia , 2015, Journal of epidemiology.

[14]  Peter J. Richardson,et al.  Validation of Case Finding Algorithms for Hepatocellular Cancer From Administrative Data and Electronic Health Records Using Natural Language Processing , 2016, Medical care.

[15]  Yu Wang,et al.  Epidemiological serosurvey of hepatitis B in China--declining HBV prevalence due to hepatitis B vaccination. , 2009, Vaccine.

[16]  Melissa A. Basford,et al.  Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[17]  中华医学会肝病学分会,et al.  The guideline of prevention and treatment for chronic hepatitis B(2010 version) , 2006 .

[18]  Jia Ji-dong,et al.  [The guideline of prevention and treatment for chronic hepatitis B (2010 version)]. , 2011, Zhonghua gan zang bing za zhi = Zhonghua ganzangbing zazhi = Chinese journal of hepatology.

[19]  R. Pugh,et al.  Transection of the oesophagus for bleeding oesophageal varices , 1973, The British journal of surgery.

[20]  Neil B. Pride,et al.  Validation of general practitioner-diagnosed COPD in the UK General Practice Research Database , 2004, European Journal of Epidemiology.

[21]  R. Quinn,et al.  Gender, renal function, and outcomes on the liver transplant waiting list: assessment of revised MELD including estimated glomerular filtration rate. , 2011, Journal of hepatology.

[22]  C. Mackenzie,et al.  A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. , 1987, Journal of chronic diseases.

[23]  George Hripcsak,et al.  Development and validation of an electronic phenotyping algorithm for chronic kidney disease , 2014, AMIA.

[24]  Melissa A. Basford,et al.  The Electronic Medical Records and Genomics (eMERGE) Network: past, present, and future , 2013, Genetics in Medicine.

[25]  Tewodros Eguale,et al.  Automated Extraction of VTE Events From Narrative Radiology Reports in Electronic Health Records , 2015, Medical care.

[26]  Jeong Min Lee,et al.  Asian Pacific Association for the Study of the Liver consensus recommendations on hepatocellular carcinoma , 2010, Hepatology international.

[27]  Jianping Hu,et al.  Harmonization of health data at national level: A pilot study in China , 2010, Int. J. Medical Informatics.

[28]  Jiajie Zhang,et al.  A comparison of electronic health records at two major Peking University Hospitals in China to United States meaningful use objectives , 2013, BMC Medical Informatics and Decision Making.

[29]  Wendy A. Wolf,et al.  The eMERGE Network: A consortium of biorepositories linked to electronic medical records data for conducting genomic studies , 2011, BMC Medical Genomics.

[30]  Tyler Williamson,et al.  Validating the 8 CPCSSN Case Definitions for Chronic Disease Surveillance in a Primary Care Database of Electronic Health Records , 2014, The Annals of Family Medicine.

[31]  David W. Bates,et al.  EHR adoption across China's tertiary hospitals: A cross-sectional observational study , 2014, Int. J. Medical Informatics.

[32]  Bing Li,et al.  Updating and validating the Charlson comorbidity index and score for risk adjustment in hospital discharge abstracts using data from 6 countries. , 2011, American journal of epidemiology.

[33]  Baoyan Liu,et al.  Data processing and analysis in real‐world traditional Chinese medicine clinical data: challenges and approaches , 2012, Statistics in medicine.