Data management for high energy physics experiments — Preliminary proposals☆
暂无分享,去创建一个
Abstract Currently HEP experimental data are reduced as they become available. We propose instead a “demand driven” approach to data analysis. Full analysis will be performed only as needed, in response to user queries which specify the subset of events for which reduced data is needed. To support this approach we propose to partition the datasets on the cross product of several trigger inputs, instead of storing the data in chronological order. Queries will be automatically decomposed into a set of requests against several partitions. Indexing, physically clustering the data on the logical partitions, and caching of partitions will be employed for efficiency.
[1] Philippe Flajolet,et al. Partial match retrieval of multidimensional data , 1986, JACM.
[2] Jack A. Orenstein. Multidimensional Tries Used for Associative Searching , 1982, Inf. Process. Lett..
[3] Frank Olken. Physical Database Support for Scientific and Statistical Database Management , 1986, SSDBM.