A Cross Language Document Retrieval System Based on Semantic Annotation
暂无分享,去创建一个
The paper describes a cross-lingual document retrieval system in the medical domain that employs a controlled vocabulary (UMLS1) in constructing an XML-based intermediary representation into which queries as well as documents are mapped. The system assists in the retrieval of English and German medical scientific abstracts relevant to a German query document (electronic patient record). The modularity of the system allows for deployment in other domains, given appropriate linguistic and semantic resources.
[1] Wojciech Skut,et al. A Maximum-Entropy Partial Parser for Unrestricted Text , 1998, VLC@COLING/ACL.
[2] Thorsten Brants,et al. TnT – A Statistical Part-of-Speech Tagger , 2000, ANLP.
[3] Paul Buitelaar,et al. An Efficient and Flexible Format for Linguistic and Semantic Annotation , 2002, LREC.