A multilingual ontology for infectious disease surveillance: rationale, design and challenges

A lack of surveillance system infrastructure in the Asia-Pacific region is seen as hindering the global control of rapidly spreading infectious diseases such as the recent avian H5N1 epidemic. As part of improving surveillance in the region, the BioCaster project aims to develop a system based on text mining for automatically monitoring Internet news and other online sources in several regional languages. At the heart of the system is an application ontology which serves the dual purpose of enabling advanced searches on the mined facts and of allowing the system to make intelligent inferences for assessing the priority of events. However, it became clear early on in the project that existing classification schemes did not have the necessary language coverage or semantic specificity for our needs. In this article we present an overview of our needs and explore in detail the rationale and methods for developing a new conceptual structure and multilingual terminological resource that focusses on priority pathogens and the diseases they cause. The ontology is made freely available as an online database and downloadable OWL file.

[1]  D. Lindberg,et al.  Unified Medical Language System , 2020, Definitions.

[2]  J. Giles Internet encyclopaedias go head to head , 2005, Nature.

[3]  Nigel Collier,et al.  The Development of a Schema for the Annotation of Terms in the Biocaster Disease Detecting/Tracking System , 2006, KR-MED.

[4]  D. Terence Langendoen,et al.  An ontology for linguistics on the semantic web , 2003 .

[5]  Nicola Guarino,et al.  Ontological Analysis of Taxonomic Relationships , 2000, ER.

[6]  Piek T. J. M. Vossen,et al.  Introduction to EuroWordNet , 1998, Comput. Humanit..

[7]  Timothy J. Doyle,et al.  PHSkb: A knowledgebase to support notifiable disease surveillance , 2005, BMC Medical Informatics Decis. Mak..

[8]  A. Rector,et al.  Relations in biomedical ontologies , 2005, Genome Biology.

[9]  Stefan Decker,et al.  Creating Semantic Web Contents with Protégé-2000 , 2001, IEEE Intell. Syst..

[10]  Michael J. Ryan,et al.  Rumors of disease in the global village: outbreak verification. , 2000, Emerging infectious diseases.

[11]  Brämer Gr International statistical classification of diseases and related health problems. Tenth revision. , 1988, World health statistics quarterly. Rapport trimestriel de statistiques sanitaires mondiales.

[12]  Michael Blench Global Public Health Intelligence Network (GPHIN) , 2008, AMTA.

[13]  Kent A. Spackman,et al.  SNOMED clinical terms: overview of the development process and project status , 2001, AMIA.

[14]  Olivier Bodenreider,et al.  The Ontology-Epistemology Divide: A Case Study in Medical Terminology. , 2004, Formal ontology in information systems : proceedings of the ... International Conference. FOIS.

[15]  A. Rector,et al.  A Terminology Server for Medical Language and Medical Information Systems , 1995, Methods of Information in Medicine.