SFINX: a multisensor fusion and mining system

In a surveillance system, video signals are generated by multiple cameras with or without spatially and temporally overlapping coverage. These signals need to be transmitted, processed, fused, stored, indexed, and then summarized as semantic events to allow efficient and effective querying and mining. SfinX aims to build several core components for multisensor fusion and mining. This paper first depicts SfinX' architecture and its core components, namely, data fusion, event detection, event characterization, event recognition, and storage. In particular, we survey representative methods and discuss plausible research directions for event recognition and storage.