Aerial video surveillance and exploitation

There is growing interest in performing aerial surveillance using video cameras. Compared to traditional framing cameras, video cameras provide the capability to observe ongoing activity within a scene and to automatically control the camera to track the activity. However, the high data rates and relatively small field of view of video cameras present new technical challenges that must be overcome before such cameras can be widely used. In this paper, we present a framework and details of the key components for real-time, automatic exploitation of aerial video for surveillance applications. The framework involves separating an aerial video into the natural components corresponding to the scene. Three major components of the scene are the static background geometry, moving objects, and appearance of the static and dynamic components of the scene. In order to delineate videos into these scene components, we have developed real time, image-processing techniques for 2-D/3-D frame-to-frame alignment, change detection, camera control, and tracking of independently moving objects in cluttered scenes. The geo-location of video and tracked objects is estimated by registration of the video to controlled reference imagery, elevation maps, and site models. Finally static, dynamic and reprojected mosaics may be constructed for compression, enhanced visualization, and mapping applications.

[1]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[2]  Hironobu Fujiyoshi,et al.  Moving target classification and tracking from real-time video , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[3]  Michael E. Murphy Rapid generation and use of 3D site models to aid imagery analysts/systems performing image exploitation , 1993, Defense, Security, and Sensing.

[4]  Harpreet S. Sawhney,et al.  Independent motion detection in 3D scenes , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[5]  Lambert E. Wixson,et al.  Image alignment for precise camera fixation and aim , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[6]  J J Koenderink,et al.  Affine structure from motion. , 1991, Journal of the Optical Society of America. A, Optics and image science.

[7]  Michael J. Black,et al.  The Robust Estimation of Multiple Motions: Parametric and Piecewise-Smooth Flow Fields , 1996, Comput. Vis. Image Underst..

[8]  P. J. Burt,et al.  Change Detection and Tracking Using Pyramid Transform Techniques , 1985, Other Conferences.

[9]  Daniel Snow,et al.  Efficient optimization of a deformable template using dynamic programming , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[10]  Supun Samarasekera,et al.  Registration of highly-oblique and zoomed in aerial video to reference imagery , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[11]  Richard P. Wildes A measure of motion salience for surveillance applications , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[12]  Gilad Adiv,et al.  Determining Three-Dimensional Motion and Structure from Optical Flow Generated by Several Moving Objects , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  R. Kumar,et al.  Video abstraction: summarizing video content for retrieval and visualization , 1998, Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284).

[14]  Michael J. Black,et al.  Tracking and recognizing rigid and non-rigid facial motions using local parametric models of image motion , 1995, Proceedings of IEEE International Conference on Computer Vision.

[15]  P. Wolf,et al.  Elements of Photogrammetry(with Applications in GIS) , 2000 .

[16]  Gregory D. Hager,et al.  Real-time tracking of image regions with changes in geometry and illumination , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  P. Anandan,et al.  Direct recovery of shape from multiple views: a parallax based approach , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[19]  Larry S. Davis,et al.  What can projections of flow fields tell us about the visual motion , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[20]  Edward H. Adelson,et al.  Layered representation for motion analysis , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Richard Szeliski,et al.  Video mosaics for virtual environments , 1996, IEEE Computer Graphics and Applications.

[22]  Peter J. Burt,et al.  Object tracking with a moving camera , 1989, [1989] Proceedings. Workshop on Visual Motion.

[23]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[24]  E.E. Pissaloux,et al.  Image Processing , 1994, Proceedings. Second Euromicro Workshop on Parallel and Distributed Processing.

[25]  Robert C. Bolles,et al.  An Integrated Feasibility Demonstration for Automatic Population of Geospatial Databases , 1998 .

[26]  R. Hingorani,et al.  OBJECT TRACKING WITH A MOVING CAMERA An Application of Dynaiiiic Motion Analysis , 1989 .

[27]  Dana H. Ballard,et al.  Computer Vision , 1982 .

[28]  Manolis I. A. Lourakis,et al.  Independent 3D motion detection using residual parallax normal flow fields , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[29]  Supun Samarasekera,et al.  Pose estimation, model refinement, and enhanced visualization using video , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[30]  Michal Irani,et al.  Representation of scenes from collections of images , 1995, Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95).

[31]  Andrew Zisserman,et al.  Robust Detection of Degenerate Configurations while Estimating the Fundamental Matrix , 1998, Comput. Vis. Image Underst..

[32]  Kristin J. Dana,et al.  Real-time scene stabilization and mosaic construction , 1994, Proceedings of 1994 IEEE Workshop on Applications of Computer Vision.

[33]  P. Anandan,et al.  A unified approach to moving object detection in 2D and 3D scenes , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[34]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[35]  Harpreet S. Sawhney,et al.  Model-based 2D&3D dominant motion estimation for mosaicing and video representation , 1995, Proceedings of IEEE International Conference on Computer Vision.

[36]  P. Anandan,et al.  Efficient representations of video sequences and their applications , 1996, Signal Process. Image Commun..

[37]  Gooitzen S. van der Wal,et al.  The Acadia vision processor , 2000, Proceedings Fifth IEEE International Workshop on Computer Architectures for Machine Perception.

[38]  Patrick Bouthemy,et al.  Robust real-time visual tracking using a 2D-3D model-based approach , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[39]  Gooitzen S. van der Wal,et al.  Reconfigurable elements for a video pipeline processor , 1999, Seventh Annual IEEE Symposium on Field-Programmable Custom Computing Machines (Cat. No.PR00375).

[40]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[41]  Andrew Zisserman,et al.  Robust detection of degenerate configurations for the fundamental matrix , 1995, Proceedings of IEEE International Conference on Computer Vision.

[42]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[43]  Harpreet S. Sawhney,et al.  Registration of video to geo-referenced imagery , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[44]  Peter J. Burt A pyramid-based front-end processor for dynamic vision applications , 2002 .

[45]  Nassir Navab,et al.  Relative affine structure: theory and application to 3D reconstruction from perspective views , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[46]  Keith J. Hanna,et al.  Combining stereo and motion analysis for direct estimation of scene structure , 1993, 1993 (4th) International Conference on Computer Vision.

[47]  Richard P. Wildes,et al.  Video georegistration: algorithm and quantitative evaluation , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[48]  Edward H. Adelson,et al.  A multiresolution spline with application to image mosaics , 1983, TOGS.

[49]  Harpreet S. Sawhney,et al.  Robust Video Mosaicing through Topology Inference and Local to Global Alignment , 1998, ECCV.

[50]  Hans-Hellmut Nagel,et al.  3D pose estimation by fitting image gradients directly to polyhedral models , 1995, Proceedings of IEEE International Conference on Computer Vision.

[51]  Allen R. Hanson,et al.  Robust methods for estimating pose and a sensitivity analysis , 1994 .

[52]  Roberto Cipolla,et al.  Real-Time Visual Tracking of Complex Structures , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[53]  Harpreet S. Sawhney,et al.  3D geometry from planar parallax , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[54]  S. P. Mudur,et al.  Three-dimensional computer vision: a geometric viewpoint , 1993 .

[55]  Rachid Deriche,et al.  A Robust Technique for Matching two Uncalibrated Images Through the Recovery of the Unknown Epipolar Geometry , 1995, Artif. Intell..

[56]  Lambert E. Wixson,et al.  Classifying Moving Objects as Rigid Or Non-Rigid Without Correspondences , 1998 .

[57]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[58]  P. Anandan,et al.  Video compression using mosaic representations , 1995, Signal Process. Image Commun..

[59]  Hai Tao,et al.  Dynamic layer representation with applications to tracking , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).