This paper proposes a novel non-linear dimensionality reduction algorithm, named double-layer isometric feature mapping (DLIso), which generates the trajectories for the video sequence containing different kinds of video clips. First, a nearest neighbor based clustering algorithm is utilized to partition the video sequence into a set of data blocks. Second, intra-cluster graphs are constructed based on the individual character of each data block to build the basic layer for DLIso. Third, the inter-cluster graph is constructed by analyzing the interrelation among these isolated data blocks to build the hyper-layer. Finally, all data points are mapped onto a unique low-dimensional feature space while preserving the corresponding relations in the double layers. Experiments on synthetic datasets as well as the real video sequences demonstrate that the low-dimensional trajectories generated by the proposed method correctly represent the semantic information of the data.
[1]
Jorma Laaksonen,et al.
Rushes summarization with self-organizing maps
,
2007,
TVS '07.
[2]
Avideh Zakhor,et al.
Content analysis of video using principal components
,
1998,
IEEE Trans. Circuits Syst. Video Technol..
[3]
Joshua B. Tenenbaum,et al.
Mapping a Manifold of Perceptual Observations
,
1997,
NIPS.
[4]
Robert Pless,et al.
Image spaces and video trajectories: using Isomap to explore video sequences
,
2003,
Proceedings Ninth IEEE International Conference on Computer Vision.
[5]
J. Tenenbaum,et al.
A global geometric framework for nonlinear dimensionality reduction.
,
2000,
Science.
[6]
Maja J. Mataric,et al.
A spatio-temporal extension to Isomap nonlinear dimension reduction
,
2004,
ICML.