The problem that the same information need can be expressed in a variety of ways is especially true for scientific literature. Each scientific discipline has its own domain-specific language and vocabulary. This language is coded into documentary tools like thesauri or classifications that are used to document and describe scientific documents. When we think of information retrieval as "fundamentally a linguistic process" (Blair, 2003) users have to be aware of the most relevant search terms - which are the controlled thesauri terms the documents are described with. This can be achieved with so-called search-term-recommenders (STR) that map free search terms of a lay user to controlled vocabulary terms which can then be used as a term suggestion or to do an automatic query expansion (Hienert, Schaer, Schaible, & Mayr, 2011). State-of-the-art repository software systems like DSpace or EPrints already offer some kind of term suggestion features in search or input forms but these implementations only work as simple auto completion mechanisms that don't incorporate any kind of semantic mapping. Such software systems would gain a lot in terms of usability and data consistency if tools like the proposed domain-specific STRs would be freely available. We aim to implement a rich toolbox of web services (like the mentioned domain-specific STRs) to support users and providers of online Digital Library (DL) or repository systems.
[1]
York Sure-Vetter,et al.
Science models as value-added services for scholarly information systems
,
2011,
Scientometrics.
[2]
P. Jaccard.
Distribution de la flore alpine dans le bassin des Dranses et dans quelques régions voisines
,
1901
.
[3]
Susan T. Dumais,et al.
The vocabulary problem in human-system communication
,
1987,
CACM.
[4]
Philipp Mayr,et al.
A Novel Combined Term Suggestion Service for Domain-Specific Digital Libraries
,
2011,
TPDL.
[5]
David C. Blair,et al.
Information retrieval and the philosophy of language
,
1992,
Annu. Rev. Inf. Sci. Technol..
[6]
Ismael Rafols,et al.
A global map of science based on the ISI subject categories
,
2009,
J. Assoc. Inf. Sci. Technol..
[7]
Paul M. B. Vitányi,et al.
Normalized Web Distance and Word Similarity
,
2009,
Handbook of Natural Language Processing.