论文信息 - Improving language models for radiology speech recognition

Improving language models for radiology speech recognition

Speech recognition systems have become increasingly popular as a means to produce radiology reports, for reasons both of efficiency and of cost. However, the suboptimal recognition accuracy of these systems can affect the productivity of the radiologists creating the text reports. We analyzed a database of over two million de-identified radiology reports to determine the strongest determinants of word frequency. Our results showed that body site and imaging modality had a similar influence on the frequency of words and of three-word phrases as did the identity of the speaker. These findings suggest that the accuracy of speech recognition systems could be significantly enhanced by further tailoring their language models to body site and imaging modality, which are readily available at the time of report creation.

Curtis P. Langlotz | John M. Paulett

[1] Paul Rayson,et al. Extending the Cochran rule for the comparison of word frequencies between corpora , 2004 .

[2] Vesa Siivola,et al. Growing an n-gram language model , 2005, INTERSPEECH.

[3] Alberto F. Goldszal,et al. Development and Validation of Queries Using Structured Query Language (SQL) to Determine the Utilization of Comparison Imaging in Radiology Reports Stored on PACS , 2005, Journal of Digital Imaging.

[4] David Liu,et al. Six Characteristics of Effective Structured Reporting and the Inevitable Integration with Speech Recognition , 2005, Journal of Digital Imaging.

[5] Adam Kilgarriff,et al. Which words are particularly characteristic of a text? a survey of statistical approaches , 1996 .

[6] Maamoun M Al-Aynati,et al. Comparison of voice-automated transcription and human transcription in generating pathology reports. , 2003, Archives of pathology & laboratory medicine.

[7] Steve Langer,et al. Radiology speech recognition: workflow, integration, and productivity issues. , 2002, Current problems in diagnostic radiology.