Detection and Synchronization of Video Sequences for Event Reconstruction

With an ever-growing amount of unexpected menaces in crowded places such as terrorist attacks, it is paramount to develop techniques to aid investigators reconstructing all details about an event of interest. To extract reliable information about the event, all kinds of available clues must be jointly exploited. As a matter of fact, today’s sources of information are plenty and varied, as important events affecting many people are typically documented by different sources. Both witnesses’ smartphones and security cameras can provide valuable information coming from multiple viewpoints and time instants — "the eyes of the crowd". In this paper, we focus on the specific problem of automatically detecting and temporally synchronizing videos depicting the same event of interest. Videos can be either near-duplicates (i.e., edited copies of the same original source) or sequences shot by different users from different vantage points. The proposed method relies upon a video fingerprinting technique capable of describing how video semantic content evolves in time. The solution does not assume a priori information about cameras location, and it only exploits visual cues, not relying on audio channels.

[1]  Anderson Rocha,et al.  Vision of the unseen: Current trends and challenges in digital image and video forensics , 2011, CSUR.

[2]  Paolo Bestagini,et al.  Near-duplicate detection and alignment for multi-view videos , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[3]  Paolo Bestagini,et al.  Phylogeny reconstruction for misaligned and compressed video sequences , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[4]  Paolo Bestagini,et al.  NEAR-DUPLICATE VIDEO DETECTION EXPLOITING NOISE RESIDUAL TRACES , 2017 .

[5]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[6]  Nasir D. Memon,et al.  Spatio–Temporal Transform Based Video Hashing , 2006, IEEE Transactions on Multimedia.

[7]  Min Wu,et al.  Information Forensics: An Overview of the First Decade , 2013, IEEE Access.

[8]  A. Piva,et al.  overview paper An overview on video forensics , 2012 .

[9]  John R. Kender,et al.  Fast Near-Duplicate Video Retrieval via Motion Time Series Matching , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[10]  Paolo Bestagini,et al.  Hash-based frame selection for video phylogeny , 2016, 2016 IEEE International Workshop on Information Forensics and Security (WIFS).

[11]  Paolo Bestagini,et al.  Video alignment for phylogenetic analysis , 2016, 2016 24th European Signal Processing Conference (EUSIPCO).

[12]  Jacques Marescaux,et al.  Multiple video sequences synchronization during minimally invasive surgery , 2016, SPIE Medical Imaging.

[13]  A. Piva An Overview on Image Forensics , 2013 .

[14]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[15]  Paolo Bestagini,et al.  Who is my parent? Reconstructing video sequences from partially matching shots , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[16]  Zi Huang,et al.  Effective Multiple Feature Hashing for Large-Scale Near-Duplicate Video Retrieval , 2013, IEEE Transactions on Multimedia.

[17]  Cordelia Schmid,et al.  Circulant Temporal Encoding for Video Retrieval and Temporal Alignment , 2015, International Journal of Computer Vision.