In-pandemic development of an application ontology for COVID-19 surveillance in a primary care sentinel network.

BACKGROUND Creating an ontology for coronavirus disease 2019 (COVID-19) surveillance should help ensure transparency and consistency. Ontologies formalise conceptualisations at either domain or application level. Application ontologies cross domains and are specified through testable use cases. Our use case was extension of the role of the Oxford Royal College of General Practitioners (RCGP) Research and Surveillance Centre (RSC) to monitor the current pandemic and become an in-pandemic research platform. OBJECTIVE To develop an application ontology for COVID-19 which can be deployed across the various use case domains of the Oxford- RCGP RSC research and surveillance activities. METHODS We described our domain-specific use case. The actor was the RCGP RSC sentinel network; the system the course of the COVID-19 pandemic; the outcomes the spread and effect of mitigation measures. We used our established three-step method to develop the ontology, separating ontological concept development from code mapping and data extract validation. We developed a coding system-independent COVID-19 case identification algorithm. As there were no gold standard pandemic surveillance ontologies, we conducted a rapid Delphi consensus exercise through the International Medical Informatics Association (IMIA) Primary Health Care Informatics working group and extended networks. RESULTS Our use case domains included primary care, public health, virology, clinical research and clinical informatics. Our ontology supported: (1) Case identification, microbiological sampling and health outcomes at both an individual practice and national level; (2) Feedback through a dashboard; (3) A national observatory, (4) Regular updates for Public Health England, and (5) Transformation of the sentinel network to be a trial platform. We have identified a total of 19,115 people with a definite COVID-19 status, 5,226 with probable, and 74,293 people with possible COVID-19, within the RCGP RSC network (N=5,370,225). CONCLUSIONS The underpinning structure of our ontological approach has coped with multiple clinical coding challenges. At a time when there is uncertainty about international comparisons, clarity about the basis on which case definitions and outcomes are made from routine data is essential. CLINICALTRIAL

[1]  M A Musen,et al.  Domain Ontologies in Software Engineering: Use of Protégé with the EON Architecture , 1998, Methods of Information in Medicine.

[2]  Mordechai Ben-Menachem,et al.  Writing effective use cases , 2001, SOEN.

[3]  L. Madoff ProMED-mail: an early warning system for emerging diseases. , 2004, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[4]  David L. Buckeridge,et al.  Ontology-centered syndromic surveillance for bioterrorism , 2005, IEEE Intelligent Systems.

[5]  P Kumarapeli,et al.  Using Unified Modelling Language (UML) as a process-modelling technique for clinical-research process improvement , 2007, Medical informatics and the Internet in medicine.

[6]  Son Doan,et al.  BioCaster: detecting public health rumors with a Web-based text mining system , 2008, Bioinform..

[7]  Kenneth D. Mandl,et al.  HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports , 2008, Journal of the American Medical Informatics Association.

[8]  S. de Lusignan,et al.  Variation in the recording of diabetes diagnostic data in primary care computer systems: implications for the quality of care. , 2009, Informatics in primary care.

[9]  Helen Parkinson,et al.  Reference and Application Ontologies , 2010 .

[10]  S. de Lusignan,et al.  A method of identifying and correcting miscoding, misclassification and misdiagnosis in diabetes: a pilot and validation study of routinely collected data , 2010, Diabetic medicine : a journal of the British Diabetic Association.

[11]  Anna Zhukova,et al.  Modeling sample variables with an Experimental Factor Ontology , 2010, Bioinform..

[12]  L. Lipsitz,et al.  Understanding health care as a complex system: the foundation for unintended consequences. , 2012, JAMA.

[13]  Joachim Sturmberg,et al.  Understanding health care delivery as a complex system: achieving best possible health outcomes for individuals and communities by focusing on interdependencies. , 2014, Journal of evaluation in clinical practice.

[14]  K. Denecke,et al.  Social Media and Internet-Based Data in Global Systems for Public Health Surveillance: A Systematic Review , 2014, The Milbank quarterly.

[15]  S de Lusignan,et al.  Big Data Usage Patterns in the Health Care Domain: A Use Case Driven Approach Applied to the Assessment of Vaccination Benefits and Risks , 2014, Yearbook of Medical Informatics.

[16]  S. de Lusignan In this issue: Ontologies a key concept in informatics and key for open definition of cases, exposures and outcome measures , 2015, BMJ Health & Care Informatics.

[17]  Mark A. Musen,et al.  The protégé project: a look back and a look forward , 2015, SIGAI.

[18]  A Mawudeku,et al.  Big Data and the Global Public Health Intelligence Network (GPHIN). , 2015, Canada communicable disease report = Releve des maladies transmissibles au Canada.

[19]  Mark J. Taylor,et al.  LEGAL BASES FOR DISCLOSING CONFIDENTIAL PATIENT INFORMATION FOR PUBLIC HEALTH: DISTINGUISHING BETWEEN HEALTH PROTECTION AND HEALTH IMPROVEMENT , 2015, Medical law review.

[20]  Joanna Ellis,et al.  Effectiveness of seasonal influenza vaccine for adults and children in preventing laboratory-confirmed influenza in primary care in the United Kingdom: 2015/16 end-of-season results , 2016, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[21]  Simon de Lusignan,et al.  Royal College of General Practitioners Research and Surveillance Centre (RCGP RSC) sentinel network: a cohort profile , 2016, BMJ Open.

[22]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[23]  Filipa Ferreira,et al.  RCGP Research and Surveillance Centre: 50 years' surveillance of influenza, infections, and respiratory conditions. , 2017, The British journal of general practice : the journal of the Royal College of General Practitioners.

[24]  Simon de Lusignan,et al.  Automated Differentiation of Incident and Prevalent Cases in Primary Care Computerised Medical Records (CMR) , 2018, MIE.

[25]  Harshana Liyanage,et al.  An ontological approach to identifying cases of chronic kidney disease from routine primary care data: a cross-sectional study , 2018, BMC Nephrology.

[26]  Simon de Lusignan,et al.  Common Data Models (CDMs) to Enhance International Big Data Analytics: A Diabetes Use Case to Compare Three CDMs , 2018, EFMI-STC.

[27]  Charlene R. Weir,et al.  Development of a cardiac-centered frailty ontology , 2019, Journal of Biomedical Semantics.

[28]  Harshana Liyanage,et al.  Artificial Intelligence in Primary Health Care: Perceptions, Issues, and Challenges , 2019, Yearbook of Medical Informatics.

[29]  Harshana Liyanage,et al.  Ontology to identify pregnant women in electronic health records: primary care sentinel network database study , 2019, BMJ Health & Care Informatics.

[30]  P. Horby,et al.  A novel coronavirus outbreak of global health concern , 2020, The Lancet.

[31]  Xiaolong Qi,et al.  Real estimates of mortality following COVID-19 infection , 2020, The Lancet Infectious Diseases.

[32]  Y. Hu,et al.  Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China , 2020, The Lancet.

[33]  Christian Drosten,et al.  Severe acute respiratory syndrome-related coronavirus: The species and its viruses – a statement of the Coronavirus Study Group , 2020, bioRxiv.

[34]  C. Whittaker,et al.  Estimates of the severity of coronavirus disease 2019: a model-based analysis , 2020, The Lancet Infectious Diseases.

[35]  Joanna Ellis,et al.  Risk factors for SARS-CoV-2 among patients in the Oxford Royal College of General Practitioners Research and Surveillance Centre primary care network: a cross-sectional study , 2020, The Lancet Infectious Diseases.

[36]  Alex J. Elliot,et al.  The Oxford Royal College of General Practitioners Clinical Informatics Digital Hub: Protocol to Develop Extended COVID-19 Surveillance and Trial Platforms (Preprint) , 2020 .

[37]  Simon de Lusignan,et al.  Investigating regional variation of respiratory infections in a general practice syndromic surveillance system. , 2020, Journal of public health.

[38]  Simon de Lusignan,et al.  To monitor the COVID-19 pandemic we need better quality primary care data , 2020, BJGP open.

[39]  A. Giacomelli,et al.  Self-reported olfactory and taste disorders in SARS-CoV-2 patients: a cross-sectional study. , 2020, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[40]  Harshana Liyanage,et al.  Ethical Use of Electronic Health Record Data and Artificial Intelligence: Recommendations of the Primary Care Informatics Working Group of the International Medical Informatics Association , 2020, Yearbook of Medical Informatics.

[41]  Joanna Ellis,et al.  Emergence of a Novel Coronavirus (COVID-19): Protocol for Extending Surveillance Used by the Royal College of General Practitioners Research and Surveillance Centre and Public Health England , 2020, JMIR public health and surveillance.

[42]  C. Buckee,et al.  Improving epidemic surveillance and response: big data is dead, long live big data , 2020, The Lancet Digital Health.

[43]  J. Pell,et al.  COVID-19 Surveillance in a Primary Care Sentinel Network: In-Pandemic Development of an Application Ontology , 2020, JMIR public health and surveillance.