Leveraging Elasticsearch to Improve Data Discoverability in Science Gateways
暂无分享,去创建一个
Data discoverability is a challenge in science gateway architectures. As the volume of data managed and shared through a science gateway grows, it is imperative to expose a search functionality which enables users to quickly navigate to files within their own data sets as well as to identify relevant files in shared or public data sets. Desirable qualities in a file search feature include scalability to arbitrary data sizes, rapid and responsive indexing triggered by user activity, and easy maintainability by development teams without specialist knowledge of search algorithms. We describe a search architecture built around Elasticsearch that meets each of these criteria, and which has been successfully implemented at the Texas Advanced Computing Center to enhance data discoverability in several science gateway projects.
[1] Josue Balandrano Coronel,et al. DesignSafe: Using Elasticsearch to Share and Search Data on a Science Web Portal , 2017, PEARC.
[2] Rion Dooley,et al. Software-as-a-Service: The iPlant Foundation API , 2012 .