A distance metric learning based summarization system for nursery school surveillance video

In this paper, we present a system for summarizing nursery school surveillance video. The system takes full use of a learned distance metric, which can properly measure the similarity between videos. The metric is combined with supervised classification and unsupervised clustering, to categorize raw video materials into individual events. By selecting representative videos for each event, the system produces short video digests as the summarization output. The digests cover and reflect the children's activities on a daily basis. They are not only of interest to the parents, but also provide easy access to the mass quantity of daily surveillance video data. We implemented the proposed system in a real nursery school environment and confirmed its performance through both quantitative experiment and questionnaire survey.

[1]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[2]  Deva Ramanan,et al.  Local distance functions: A taxonomy, new algorithms, and an evaluation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[3]  Koichi Miura,et al.  Motion Based Automatic Abstraction of Cooking Videos , 2003 .

[4]  Noboru Babaguchi,et al.  Video Summarization for Large Sports Video Archives , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[5]  Shigeki Yokoi,et al.  A Practical Video Digest Generation System Designed for Nursery Schools , 2011, MVA.

[6]  Mikel D. Rodriguez CRAM: Compact representation of actions in movies , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.