Conceptualisation and Annotation of Drug Nonadherence Information for Knowledge Extraction from Patient-Generated Texts

Approaches to knowledge extraction (KE) in the health domain often start by annotating text to indicate the knowledge to be extracted, and then use the annotated text to train systems to perform the KE. This may work for annotat- ing named entities or other contiguous noun phrases (drugs, some drug effects), but be- comes increasingly difficult when items tend to be expressed across multiple, possibly non- contiguous, syntactic constituents (e.g. most descriptions of drug effects in user-generated text). Other issues include that it is not al- ways clear how annotations map to actionable insights, or how they scale up to, or can form part of, more complex KE tasks. This paper reports our efforts in developing an approach to extracting knowledge about drug nonadher- ence from health forums which led us to con- clude that development cannot proceed in sep- arate steps but that all aspects—from concep- tualisation to annotation scheme development, annotation, KE system training and knowl- edge graph instantiation—are interdependent and need to be co-developed. Our aim in this paper is two-fold: we describe a generally ap- plicable framework for developing a KE ap- proach, and present a specific KE approach, developed with the framework, for the task of gathering information about antidepressant drug nonadherence. We report the conceptual- isation, the annotation scheme, the annotated corpus, and an analysis of annotated texts.

[1]  Nigel Collier,et al.  OMG U got flu? Analysis of shared health messages for bio-surveillance , 2011, Semantic Mining in Biomedicine.

[2]  Jian Yang,et al.  Towards Internet-Age Pharmacovigilance: Extracting Adverse Drug Reactions from User Posts in Health-Related Social Networks , 2010, BioNLP@ACL.

[3]  Sarvnaz Karimi,et al.  Cadec: A corpus of adverse drug event annotations , 2015, J. Biomed. Informatics.

[4]  Mark Dredze,et al.  Entity Linking: Finding Extracted Entities in a Knowledge Base , 2013, Multi-source, Multilingual Information Extraction and Summarization.

[5]  Arkaitz Zubiaga,et al.  Analysing How People Orient to and Spread Rumours in Social Media by Looking at Conversational Threads , 2015, PloS one.

[6]  Mike Conway,et al.  Towards Developing an Annotation Scheme for Depressive Disorder Symptoms: A Preliminary Study using Twitter Data , 2015, CLPsych@HLT-NAACL.

[7]  Finn Årup Nielsen,et al.  A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs , 2011, #MSM.

[8]  Sarvnaz Karimi,et al.  Concept Identification and Normalisation for Adverse Drug Event Discovery in Medical Forums , 2016, BMDID@ISWC.

[9]  Ari Z. Klein,et al.  Dealing with Medication Non-Adherence Expressions in Twitter , 2018, EMNLP 2018.

[10]  A. Egberts,et al.  Initiation of antidepressant therapy: do patients follow the GP's prescription? , 2009, The British journal of general practice : the journal of the Royal College of General Practitioners.

[11]  Danushka Bollegala,et al.  Causality Patterns for Detecting Adverse Drug Reactions From Social Media: Text Mining Approach , 2018, JMIR public health and surveillance.

[12]  Kevin Donnelly,et al.  SNOMED-CT: The advanced terminology and coding system for eHealth. , 2006, Studies in health technology and informatics.

[13]  Peer Bork,et al.  The SIDER database of drugs and side effects , 2015, Nucleic Acids Res..

[14]  Kei-Hoi Cheung,et al.  BioPAX – A community standard for pathway data sharing , 2010, Nature Biotechnology.

[15]  Abeed Sarker,et al.  Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features , 2015, J. Am. Medical Informatics Assoc..

[16]  Cynthia Van Hee,et al.  Guidelines for the fine-grained analysis of cyberbullying, version 1.0 , 2015 .

[17]  Amit P. Sheth,et al.  PREDOSE: A semantic web platform for drug abuse epidemiology using social media , 2013, J. Biomed. Informatics.

[18]  Use of social media in pharmacovigilance , 2019, Reactions Weekly.

[19]  Véronique Hoste,et al.  Recognising suicidal messages in Dutch social media , 2014, LREC.

[20]  S. De Geest,et al.  Adherence to Long-Term Therapies: Evidence for Action , 2003, European journal of cardiovascular nursing : journal of the Working Group on Cardiovascular Nursing of the European Society of Cardiology.

[21]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[22]  Jun Zhao,et al.  Collective entity linking in web text: a graph-based method , 2011, SIGIR.

[23]  P. Burkhart,et al.  Adherence to long-term therapies: evidence for action. , 2003, Journal of nursing scholarship : an official publication of Sigma Theta Tau International Honor Society of Nursing.

[24]  Joel Nothman,et al.  Evaluating Entity Linking with Wikipedia , 2013, Artif. Intell..

[25]  S. van Dulmen,et al.  Patient adherence to medical treatment: a review of reviews , 2007, BMC Health Services Research.

[26]  S. Golder,et al.  Methods to Compare Adverse Events in Twitter to FAERS, Drug Information Databases, and Systematic Reviews: Proof of Concept with Adalimumab , 2018, Drug Safety.

[27]  Ophir Frieder,et al.  Extracting Adverse Drug Reactions from Social Media , 2015, AAAI.

[28]  Michael J. Paul,et al.  Exploring Timelines of Confirmed Suicide Incidents Through Social Media , 2017, IEEE International Conference on Healthcare Informatics.

[29]  L. van Dijk,et al.  Definitions, variants, and causes of nonadherence with medication: a challenge for tailored interventions , 2013, Patient preference and adherence.