In the recent decades, high-throughput screening methods were established, bringing forth major breakthroughs in the fields of molecular biology and biomedicine. Since researchers in these fields need to interpret an enormous quantity of data and the publication rates of scientific articles are exploding, demands on text mining technology are growing with each passing year.
Medie (http://www-tsujii.is.s.u-tokyo.ac.jp/medie/) and Info-pubmed (http://www-tsujii.is.s.u-tokyo.ac.jp/info-pubmed/) were developed as a response to these information needs. Medie is a general-purpose integrated Pubmed search engine and Info-pubmed is a targeted system for finding information about the interactions of key biomedical entities.
In this work, the first update of these systems since their introduction, we present multiple extensions of the systems based on recent advances in biomedical text mining.
[1]
Naoaki Okazaki,et al.
Identifying Sections in Scientific Abstracts using Conditional Random Fields
,
2008,
IJCNLP.
[2]
Jun'ichi Tsujii,et al.
Corpus annotation for mining biomedical events from literature
,
2008,
BMC Bioinformatics.
[3]
Teruyoshi Hishiki,et al.
Extraction of Gene-Disease Relations from Medline Using Domain Dictionaries and Machine Learning
,
2005,
Pacific Symposium on Biocomputing.
[4]
Jun'ichi Tsujii,et al.
A log-linear model with an n-gram reference distribution for accurate HPSG parsing
,
2007,
IWPT.