On the semantics of object-oriented landmark recognition

Computer vision is an ever more important means for the navigation of UAVs. Here we propose a landmark recognition system looking for salient man-made infrastructure. An object-oriented structural system is preferred since it can utilize known properties of these objects such as part-of hierarchies, mutual geometric constraints of parts, generalization etc. The structure, available for use as landmark, will vary strongly with the region the UAV is supposed to navigate in. The structural knowledge can lose its meaning in two ways: 1) If the area contains a lot of non-intended structure fulfilling the demands modeled the system will start hallucinating lots of landmarks anywhere. 2) If the landmarks in the area do not fulfill the demands modeled they will not be detected. Up to a certain degree these semantics—or lack of meaning—can be investigated mathematically using probabilistic models. But the results from this are very optimistic. In reality the meaning breaks down much earlier. This contribution reports on an example: Testing a system, designed for a central European country (Germany), for use elsewhere (e.g., Russia or Turkey).

[1]  Makoto Nagao,et al.  A Structural Analysis of Complex Aerial Photographs , 1980, Advanced Applications in Pattern Recognition.

[2]  Zhihai He,et al.  Vision-based UAV flight control and obstacle avoidance , 2006, 2006 American Control Conference.

[3]  E. Baltsavias,et al.  Automatic Extraction of Man-Made Objects from Aerial and Space Images (II) , 1995 .

[4]  Jean-Michel Morel,et al.  From Gestalt Theory to Image Analysis: A Probabilistic Approach , 2007 .

[5]  Enis Bayramoglu,et al.  Visual Navigation for Mobile Robots , 2010 .

[6]  Bruce A. Draper,et al.  The schema system , 1988, International Journal of Computer Vision.

[7]  E. Michaelsen,et al.  An accumulating interpreter for cognitive vision production systems , 2011, Pattern Recognition and Image Analysis.

[8]  Bea Thai,et al.  All source adaptive fusion for aided navigation in non-GPS environment , 2007, SPIE Defense + Commercial Sensing.

[9]  K. Lütjen BPI: Ein Blackboard-basiertes Produktionssystem für die automatische Bildauswertung , 1986, DAGM-Symposium.

[10]  Francisco Bonin-Font,et al.  Visual Navigation for Mobile Robots: A Survey , 2008, J. Intell. Robotic Syst..

[11]  Jean-Michel Morel,et al.  From Gestalt Theory to Image Analysis , 2008 .

[12]  Eckart Michaelsen,et al.  A GOOGLE-earth based test bed for structural image-based UAV navigation , 2009, 2009 12th International Conference on Information Fusion.

[13]  Bernd Meyer,et al.  Visual Language Theory , 2012, Springer New York.

[14]  Uwe Stilla,et al.  Semantic Modelling of Man-Made Objects by Production Nets , 1997 .

[15]  Michael Arens,et al.  Interaction of Control and Knowledge in a Structural Recognition System , 2009, KI.

[16]  Eckart Michaelsen Über Koordinaten Grammatiken zur Bildverarbeitung und Szenenanalyse , 1998 .

[17]  E. Michaelsen,et al.  An accumulating interpreter for cognitive vision production systems , 2012, Pattern Recognition and Image Analysis.

[18]  Song-Chun Zhu,et al.  Modeling Visual Patterns by Integrating Descriptive and Generative Methods , 2004, International Journal of Computer Vision.

[19]  Uwe Soergel,et al.  Extraction of building polygons from SAR images: Grouping and decision-level in the GESTALT System , 2008, 2008 IAPR Workshop on Pattern Recognition in Remote Sensing (PRRS 2008).