Integrating explicit semantic analysis for ontology-based resource selection

In federated information systems, deciding whether an information source is relevant for a given query is crucial for its overall performance. Focusing on uncooperative unstructured information sources, we analyze several drawbacks of the popular CORI resource selection algorithm by evaluating it in a federated product information scenario. Based on these results, we propose and describe a novel approach using an ontology-based sampling method, which is used to initialize an Explicit Semantic Analysis index.