Ontology for Data Quality and Chronic Disease Management: A Literature Review

Improved Data Quality (DQ) can improve the quality of decisions and lead to better policy in health organizations. Ontologies can support automated tools to assess DQ. This chapter examines ontologybased approaches to conceptualization and specification of DQ based on “fitness for purpose” within the health context. English language studies that addressed DQ, fitness for purpose, ontology-based approaches, and implementations were included. The authors screened 315 papers; excluded 36 duplicates, 182 on abstract review, and 46 on full-text review; leaving 52 papers. These were appraised with a realist “context-mechanism-impacts/outcomes” template. The authors found a lack of consensus frameworks or definitions for DQ and comprehensive ontological approaches to DQ or fitness for purpose. The majority of papers described the processes of the development of DQ tools. Some assessed the impact of implementing ontology-based specifications for DQ. There were few evaluative studies of the performance of DQ assessment tools developed; none compared ontological with non-ontological approaches. Alireza Rahimi University of New South Wales, Australia & Isfahan University of Medical Sciences, Iran Siaw-Teng Liaw University of New South Wales, Australia Pradeep Kumar Ray University of New South Wales, Australia Jane Taggart University of New South Wales, Australia Hairong Yu University of New South Wales, Australia

[1]  Peter Piela,et al.  Work language analysis and the naming problem , 1993, CACM.

[2]  Monica Cahill McJunkin Precision and recall in title keyword searches , 1995 .

[3]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[4]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[5]  Richard Y. Wang,et al.  A product perspective on total data quality management , 1998, CACM.

[6]  A. Gillies,et al.  Assessing and improving the quality of information for health evaluation and promotion. , 2000, Methods of information in medicine.

[7]  Anita Burgun-Parenthoine,et al.  Building an Ontology for End-stage Diseases and Organ Failure in the Context of Transplantation and Dialysis , 2001, MedInfo.

[8]  Diane M. Strong,et al.  Information quality benchmarks: product and service performance , 2002, CACM.

[9]  Nicolette de Keizer,et al.  Model Formulation: Defining and Improving Data Quality in Medical Registries: A Literature Review, Case Study, and Generic Framework , 2002, J. Am. Medical Informatics Assoc..

[10]  N. D. de Keizer,et al.  Quality of data collected for severity of illness scores in the Dutch National Intensive Care Evaluation (NICE) registry , 2002, Intensive Care Medicine.

[11]  T. Peters,et al.  The quality of record keeping in primary care: a comparison of computerised, paper and hybrid systems. , 2003, The British journal of general practice : the journal of the Royal College of General Practitioners.

[12]  Graeme G. Shanks,et al.  Improving the quality of data models: empirical validation of a quality management framework , 2003, Inf. Syst..

[13]  H. Sofia Pinto,et al.  Ontologies: How can They be Built? , 2004, Knowledge and Information Systems.

[14]  Nicolette F de Keizer,et al.  Training in data definitions improves quality of intensive care data , 2003, Critical care.

[15]  Anita Burgun-Parenthoine,et al.  Developing the ontological foundations of a terminological system for end-stage diseases, organ failure, dialysis and transplantation , 2003, Int. J. Medical Informatics.

[16]  Bertram Ludäscher,et al.  Towards a formalization of disease-specific ontologies for neuroinformatics , 2003, Neural Networks.

[17]  Yorick Wilks,et al.  Data Driven Ontology Evaluation , 2004, LREC.

[18]  M L Moro,et al.  Can hospital discharge diagnoses be used for surveillance of surgical-site infections? , 2004, The Journal of hospital infection.

[19]  Elizabeth Chang,et al.  Ontology-based Multi-agent Systems Support Human Disease Study and Control , 2005, SOAS.

[20]  R. W. Schuring,et al.  E-Health Systems Diffusion and Use: The Innovation, the User and the Use It Model , 2005 .

[21]  Sophia Ananiadou,et al.  A Flexible Measure of Contextual Similarity for Biomedical Terms , 2004, Pacific Symposium on Biocomputing.

[22]  Letha H. Etzkorn,et al.  Cohesion Metrics for Ontology Design and Application , 2005 .

[23]  M. Simonet,et al.  Building an ontology of cardio-vascular diseases for concept-based information retrieval , 2005, Computers in Cardiology, 2005.

[24]  Margarita Álvarez,et al.  Information systems: new ontology-based scenarios , 2006 .

[25]  Aldo Gangemi,et al.  Modelling Ontology Evaluation and Validation , 2006, ESWC.

[26]  Miguel García-Remesal,et al.  ONTOFUSION: Ontology-based integration of genomic and clinical databases , 2006, Comput. Biol. Medicine.

[27]  Graciela Elisa Barchini,et al.  SISTEMAS DE INFORMACIÓN: NUEVOS ESCENARIOS BASADOS EN ONTOLOGÍAS INFORMATION SYSTEMS: NEW ONTOLOGY-BASED SCENARIOS , 2006 .

[28]  Huan-Chung Li,et al.  Automated Food Ontology Construction Mechanism for Diabetes Diet Care , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[29]  Letha H. Etzkorn,et al.  Indicating ontology data quality, stability, and completeness throughout ontology evolution , 2007, J. Softw. Maintenance Res. Pract..

[30]  R. Stockdale,et al.  Data Quality Information and Decision Making: A Healthcare Case Study , 2007 .

[31]  Chang-Shing Lee,et al.  Ontology-based Fuzzy Inference Agent for Diabetes Classification , 2007, NAFIPS 2007 - 2007 Annual Meeting of the North American Fuzzy Information Processing Society.

[32]  Graeme Miller,et al.  The quality of data on general practice - a discussion of BEACH reliability and validity. , 2007, Australian family physician.

[33]  Jérôme Euzenat,et al.  Semantic Precision and Recall for Ontology Alignment Evaluation , 2007, IJCAI.

[34]  Massimo Esposito,et al.  An Ontological and Non-monotonic Rule-Based Approach to Label Medical Images , 2007, 2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System.

[35]  Letha H. Etzkorn,et al.  Indicating ontology data quality, stability, and completeness throughout ontology evolution: Research Articles , 2007 .

[36]  Robert Jeansoulin,et al.  Towards spatial data quality information analysis tools for experts assessing the fitness for use of spatial data , 2007, Int. J. Geogr. Inf. Sci..

[37]  Stefan Wermter,et al.  Auto-Extraction, Representation and Integration of a Diabetes Ontology Using Bayesian Networks , 2007, Twentieth IEEE International Symposium on Computer-Based Medical Systems (CBMS'07).

[38]  H. Dreher,et al.  Ontology based data warehouse modeling and managing ecology of human body for disease and drug prescription management , 2008, 2008 2nd IEEE International Conference on Digital Ecosystems and Technologies.

[39]  Jonathan M Morris,et al.  How accurate is the reporting of obstetric haemorrhage in hospital discharge data? A validation study , 2008, The Australian & New Zealand journal of obstetrics & gynaecology.

[40]  Massimo Esposito Congenital Heart Disease: An Ontology-Based Approach for the Examination of the Cardiovascular System , 2008, KES.

[41]  Ddembe Williams,et al.  A flexible approach for user evaluation of biomedical ontologies , 2008 .

[42]  J S Mitchell,et al.  Emergency department information system diagnosis: how accurate is it? , 2008, Emergency Medicine Journal.

[43]  Paolo Missier,et al.  Performance prediction for a code with data-dependent runtimes , 2008 .

[44]  Nikos Fakotakis,et al.  A hierarchical, ontology-driven Bayesian concept for ubiquitous medical environments- A case study for pulmonary diseases , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[45]  J Stausberg,et al.  [Frequency of hospital-acquired pneumonia--comparison between electronic and paper-based patient records]. , 2008, Pneumologie.

[46]  Rafael Valencia-García,et al.  A knowledge acquisition methodology to ontology construction for information retrieval from medical documents , 2008, Expert Syst. J. Knowl. Eng..

[47]  H. Quan,et al.  Assessing validity of ICD-9-CM and ICD-10 administrative data in recording clinical conditions in a unique dually coded database. , 2008, Health services research.

[48]  T. Forrester,et al.  A comparative study of the quality and availability of health information used to facilitate cost burden analysis of diabetes and hypertension in the Caribbean. , 2008, The West Indian medical journal.

[49]  Giovanni Acampora,et al.  Ontology-based intelligent fuzzy agent for diabetes application , 2009, 2009 IEEE Symposium on Intelligent Agents.

[50]  David L. Blazes,et al.  Impact of two interventions on timeliness and data quality of an electronic disease surveillance system in a resource limited setting (Peru): a prospective evaluation , 2009, BMC Medical Informatics Decis. Mak..

[51]  John Herbert,et al.  Towards Improved Information Quality: The Integration of Body Area Network Data within Electronic Health Records , 2009, ICOST.

[52]  Besiki Stvilia,et al.  Disorderly reasoning in information design , 2009 .

[53]  Hua Min,et al.  Integration of prostate cancer clinical data using an ontology , 2009, J. Biomed. Informatics.

[54]  Gao Xiang,et al.  Anchoring the Consistency Dimension of Data Quality Using Ontology in Data Integration , 2009, 2009 Sixth Web Information Systems and Applications Conference.

[55]  Nikola K. Kasabov,et al.  Ontology Based Personalized Modeling for Chronic Disease Risk Analysis: An Integrated Approach , 2008, Aust. J. Intell. Inf. Process. Syst..

[56]  Antonio J. Jara,et al.  An ontology and rule based intelligent information system to detect and predict myocardial diseases , 2009, 2009 9th International Conference on Information Technology and Applications in Biomedicine.

[57]  Thusitha De Silva Mabotuwana,et al.  An ontology-based approach to enhance querying capabilities of general practice medicine for better management of hypertension , 2009, Artif. Intell. Medicine.

[58]  Samson W. Tu,et al.  Ontology driven data integration for autism research , 2009, 2009 22nd IEEE International Symposium on Computer-Based Medical Systems.

[59]  Stefan Brüggemann,et al.  Using Ontologies Providing Domain Knowledge for Data Quality Management , 2009, Networked Knowledge - Networked Media - Integrating Knowledge Management.

[60]  Frederick Chen Modeling the effect of information quality on risk behavior change and the transmission of infectious diseases. , 2009, Mathematical biosciences.

[61]  José Neves,et al.  Decision Making Based on Quality-of-Information a Clinical Guideline for Chronic Obstructive Pulmonary Disease Scenario , 2010, DCAI.

[62]  Samina Raza Abidi Ontology-Based Knowledge Modeling to Provide Decision Support for Comorbid Diseases , 2010, KR4HC.

[63]  S. de Lusignan,et al.  A method of identifying and correcting miscoding, misclassification and misdiagnosis in diabetes: a pilot and validation study of routinely collected data , 2010, Diabetic medicine : a journal of the British Diabetic Association.

[64]  Christel Daniel-Le Bozec,et al.  The Information Quality Triangle: A methodology to assess Clinical Information quality , 2010, MedInfo.

[65]  Thepchai Supnithi,et al.  Design and Implementation of an Ontology-Based Clinical Reminder System to Support Chronic Disease Healthcare , 2011, IEICE Trans. Inf. Syst..

[66]  I. Ivánová,et al.  Searching for spatial data resources by fitness for use , 2011 .

[67]  Jane Taggart,et al.  Data quality and fitness for purpose of routinely collected data--a general practice case study from an electronic practice-based research network (ePBRN). , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[68]  Barbara Castelnuovo,et al.  Quality of data collection in a large HIV observational clinic database in sub-Saharan Africa: implications for clinical research and audit of care , 2011, Journal of the International AIDS Society.

[69]  Pantelis Topalis,et al.  A set of ontologies to drive tools for the control of vector-borne diseases , 2009, J. Biomed. Informatics.

[70]  Suzette J. Bielinski,et al.  Mining the Human Phenome using Semantic Web Technologies: A Case Study for Type 2 Diabetes , 2012, AMIA.

[71]  Deborah H. Batson,et al.  Data model considerations for clinical effectiveness researchers. , 2012, Medical care.

[72]  Francesco Rubino,et al.  GIDL: a rule based expert system for GenBank Intelligent Data Loading into the Molecular Biodiversity database , 2012, BMC Bioinformatics.

[73]  Christopher G. Chute,et al.  Using Semantic Web Technologies for Cohort Identification from Electronic Health Records for Clinical Research , 2012, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[74]  Jane Taggart,et al.  Corrigendum to "Towards an ontology for data quality in integrated chronic disease management: A realist review of the literature" [Int. J. Med. Inform. 82 (2013) 10-24] , 2013, Int. J. Medical Informatics.

[75]  Pradeep Kumar Ray,et al.  Towards an ontology for data quality in integrated chronic disease management: A realist review of the literature , 2013, Int. J. Medical Informatics.

[76]  WisemanPatricia,et al.  Using Theory to Drive Influenza Related Text Messaging Interventions , 2015 .

[77]  Patricia Wiseman,et al.  Using Theory to Drive Influenza Related Text Messaging Interventions: A Pilot Study to Evaluate the Development of the Theory Based Influenza Related Text Messages Content for Clarity, Internal Consistency, and Content Validity , 2015, Int. J. E Health Medical Commun..