Combining Context and Existing Knowledge When Recognizing Biological Entities - Early Results
暂无分享,去创建一个
Entity recognition has been studied for several years with good results. However, as the focus of information extraction (IE) and entity recognition (ER) has been set on biology and bioinformatics, the existing methods do not produce as good results as before. This is mainly due to the complex naming conventions of biological entities. In our information extraction system for biomedical documents called OAT (Ontology Aided Text mining system) we developed our own method for recognizing the biological entities. The difference to the existing methods, which use lexicons, rules and statistics, is that we combine the context of the entity with the existing knowledge about the relationships of the entities. This has produced encouraging preliminary results. This paper describes the approach we are using in our information extraction system for entity recognition.
[1] Russ B. Altman,et al. GAPSCORE: finding gene and protein names one word at a time , 2004, Bioinform..
[2] Daniel Hanisch,et al. Playing Biology's Name Game: Identifying Protein Names in Scientific Text , 2002, Pacific Symposium on Biocomputing.
[3] William R. Hersh,et al. A Survey of Current Work in Biomedical Text Mining , 2005 .
[4] Lorraine K. Tanabe,et al. Tagging gene and protein names in biomedical text , 2002, Bioinform..