Using Domain Knowledge about Medications to Correct Recognition Errors in Medical Report Creation

We present an approach to analysing automatic speech recognition (ASR) hypotheses for dictated medical reports based on background knowledge. Our application area is prescriptions of medications, which are a frequent source of misrecognitions: In a sample report corpus, we found that about 40% of the active substances or trade names and dosages were recognized incorrectly. In about 25% of these errors, the correct string of words was contained in the word graph. We have built a knowledge base of medications based on information contained in the Unified Medical Language System (UMLS), consisting of trade names, active substances, strengths and dosages. From this, we generate a variety of linguistic realizations for prescriptions. Whenever an inconsistency in a prescription is encountered on the best path of the word graph, the system searches for alternative paths which contain valid linguistic realizations of prescriptions consistent with the knowledge base. If such a path exists, a new concept edge with a better score is added to the word graph, resulting in a higher plausibility for this reading. The concept edge can be used for rescoring the word graph to obtain a new best path. A preliminary evaluation led to encouraging results: in nearly half of the cases where the word graph contained the correct variant, the correction was successful.

[1]  Stephanie Seneff,et al.  Dialogue Management in the Mercury Flight Reservation System , 2000 .

[2]  Kimberly Voll,et al.  A methodology of error detection: improving speech recognition in radiology , 2006 .

[3]  Morten Hertzum,et al.  Acceptance of speech recognition by physicians: A survey of expectations, experiences, and social influence , 2009, Int. J. Hum. Comput. Stud..

[4]  H. Ney,et al.  Multi-Level Error Handling for Tree Based Dialogue Course Management , 2003 .

[5]  Iryna Gurevych,et al.  Ontology-based Contextual Coherence Scoring , 2003, SIGDIAL Workshop.

[6]  Alex Acero,et al.  Use and Acquisition of Semantic Language Model , 2004, HLT-NAACL.

[7]  Wolfgang Minker,et al.  Using Language Modelling to Integrate Speech Recognition with a Flat Semantic Analysis , 2005, SIGDIAL Workshop.

[8]  Hermann Ney,et al.  Word graphs: an efficient interface between continuous-speech recognition and language understanding , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  George Hripcsak,et al.  Extracting Structured Medication Event Information from Discharge Summaries , 2008, AMIA.

[10]  Iryna Gurevych,et al.  Using Knowledge-Based Scores for Identifying Best Speech Recognition Hypothesis , 2003 .

[11]  D. Lindberg,et al.  Unified Medical Language System , 2020, Definitions.

[12]  Michael Picheny,et al.  Word level confidence measurement using semantic features , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[13]  Rong Zhang,et al.  Word level confidence annotation using combinations of features , 2001, INTERSPEECH.

[14]  Min Li,et al.  A Cascade Approach to Extracting Medication Events , 2009, ALTA.

[15]  Wei Ma,et al.  RxNorm: prescription for electronic drug information exchange , 2005, IT Professional.