Combining image regions and human activity for indirect object recognition in indoor wide-angle views

Traditional methods of object recognition are reliant on shape and so are very difficult to apply in cluttered, wide angle and low detail views such as surveillance scenes. To address this, a method of indirect object recognition is proposed, where human activity is used to infer both the location and identity of objects. No shape analysis is necessary. The concept is dubbed 'interaction signatures', since the premise is that a human interacts with objects in ways characteristic of the function of that object - for example, a person sits in a chair and drinks from a cup. The human-centred approach means that recognition is possible in low detail views and is largely invariant to the shape of objects within the same functional class. This paper implements a Bayesian network for classifying region patches with object labels, building upon our previous work in automatically segmenting and recognising a human's interactions with the objects. Experiments show that interaction signatures can successfully find and label objects in low detail views and are equally effective at recognising test objects that differ markedly in appearance from the training objects.

[1]  Patrick J. Flynn,et al.  A Survey Of Free-Form Object Representation and Recognition Techniques , 2001, Comput. Vis. Image Underst..

[2]  Irfan A. Essa,et al.  Exploiting human actions and object context for recognition tasks , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[3]  Linda G. Shapiro,et al.  Image Segmentation Techniques , 1984, Other Conferences.

[4]  Svetha Venkatesh,et al.  Using interaction signatures to find and label chairs and floors , 2004, IEEE Pervasive Computing.

[5]  Tim J. Ellis,et al.  Partial Observation vs. Blind Tracking through Occlusion , 2002, BMVC.

[6]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[7]  Leonard G. C. Hamey,et al.  Object Recognition, A Survey of the Literature , 1991 .

[8]  W. Eric L. Grimson,et al.  Using adaptive tracking to classify and monitor activities in a site , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[9]  M. Teal,et al.  Spatial-Temporal Reasoning Based on Object Motion , 1996, BMVC.

[10]  Svetha Venkatesh,et al.  Robust Recognition and Segmentation of Human Actions Using HMMs with Missing Observations , 2005, EURASIP J. Adv. Signal Process..

[11]  Tim J. Ellis,et al.  Finding Paths in Video Sequences , 2001, BMVC.

[12]  Konrad Tollmar,et al.  Activity Zones for Context-Aware Computing , 2003, UbiComp.