Project-Team PRIMA Perception, recognition and integration for interactive environments

derivatives computed in the cardinal directions. The weights are provided by the direction cosines of the rotation. equipped with a microphone array, a fixed wide angle camera, five steerable cameras, three "video interaction devices". The microphone array is used as an acoustic sensor to detect, locate and classify acoustic signals for recognizing human activities. The wide-angle camera provides a field of view that covers the entire room, and allows detection and tracking of individuals. Steerable cameras are installed in each of the four corners of the room, and used to acquire video of activities from any viewing direction. Video interaction devices associate a camera with a video projector to provide new modes if man-machine interaction. Such devices may be used for interaction, presentation or capture of information based on natural activity. Examples include selecting menus and buttons with a finger and capturing drawings from paper or a whiteboard. Fixed video interaction devices in the AME have been constructed for a vertical surface (a wall mounted white board) and a horizontal desk-top work-space. Recently a steerable interaction device has been constructed based on a tightly integrated steerable camera-projector pair (SCP). The SCP described below, allows any surface to be used for interaction with information. It also offers a range new sensing techniques, including automatic surveillance of an environment to discover the environment topology, as well as the use of structured light for direct sensing of texture mapped 3D models. The behavior of this system is specified as a situation graph that is automatically compiled into rules for a Java based supervisory process. The design process for compiling a situation graph into a rule based for the federation supervisor is described. The process federation as well as the visual and acoustic observation processes for version 1.0 of the system have been demonstrated to the European Commission in Luxembourg in October 2003, as part of annual review of project IST FAME. A public demonstration of this system will be held at the World Cultural Forum in Barcelona in July 2004. The FGnet is a thematic network devoted to visual techniques for detection, tracking and recognition of faces and gestures. The aim of this project is to encourage technology development in the area of face and gesture recognition. The precise goals are: (1) to act as a focus for the workers developing face and gesture recognition technology (2) to create a set of foresight reports defining development roadmaps and future use scenarios for the technology in the medium (5-7 years) and long (10-20 years) term (3) to specify, develop and supply resources (eg image sets) supporting these scenarios (4) to use these resources to encouraging technology development. The use of shared resources and data sets to encourage the development of complex process and recognition systems has been very successful in the speech analysis and recognition field, and in the image analysis field in the specific cases where it has been applied. The basis of project, is that when properly defined and collects, such resources would also be of benefit in the development of wider problems in face and gesture recognition.ProjectPRIMA is responsible for organizing dissemination workshops for FGnet, as well as contributing to the collaction of banchmark data sets for performance evaluation. FGnet has provided resources for organizing the PETS series of workshops (Performance Evaluation for Tracking and Surveillance).

[1]  P. Johnson-Laird How We Reason , 2006 .

[2]  Oliver Brdiczka,et al.  Extracting Activities from Multimodal Observation , 2006, KES.

[3]  Pong C. Yuen,et al.  Automatic Acquisition of Context Models and its Application to Video Surveillance , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[4]  Daniela Hall Automatic parameter regulation of perceptual systems , 2006, Image Vis. Comput..

[5]  J. Crowley,et al.  Learning individual roles from video in a smart home , 2006 .

[6]  Oliver Brdiczka,et al.  Learning context models for the recognition of scenarios , 2006, AIAI.

[7]  Oliver Brdiczka,et al.  Attentional Model for Perceiving Social Context in Intelligent Environments , 2006, AIAI.

[8]  Oliver Brdiczka,et al.  Unsupervised Segmentation of Meeting Configurations and Activities using Speech Activity Detection , 2006, AIAI.

[9]  Thi-Thanh-Hai Tran,et al.  Etude de lignes d'interet naturelles pour la Representation d'objets en Vision par ordinateur. (Natural line extraction for Object Representation in Computer Vision) , 2006 .

[10]  Oliver Brdiczka,et al.  Deterministic and probabilistic implementation of context , 2006, Fourth Annual IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOMW'06).

[11]  James L. Crowley,et al.  User-Centric Design of a Vision System for Interactive Applications , 2006, Fourth IEEE International Conference on Computer Vision Systems (ICVS'06).

[12]  J. Tisseau,et al.  Comportements perceptifs d'acteurs virtuels autonomes. une application aux cartes cognitives floues. , 2006 .

[13]  Amaury Nègre,et al.  Real-Time Time-to-Collision from Variation of Intrinsic Scale , 2006, ISER.

[14]  Daniela Hall,et al.  A System for Object Class Detection , 2006, Cognitive Vision Systems.

[15]  J. Crowley,et al.  LEARNING SITUATION MODELS FOR UNDERSTANDING ACTIVITY , 2006 .

[16]  T. List,et al.  Comparison of target detection algorithms using adaptive background models , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[17]  Oliver Brdiczka,et al.  Supervised learning of an abstract context model for an intelligent environment , 2005, sOc-EUSAI '05.

[18]  Oliver Brdiczka,et al.  Automatic detection of interaction groups , 2005, ICMI '05.

[19]  H. Tran,et al.  A Novel Approach for Text Detection in Images Using Structural Features , 2005, ICAPR.

[20]  Daniela Hall Automatic parameter regulation for a tracking system with an auto-critical function , 2005, Seventh International Workshop on Computer Architecture for Machine Perception (CAMP'05).

[21]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[22]  James L. Crowley,et al.  Localisation intra-bâtiment multi-technologies: RFID, wifi et vision , 2005, UbiMob '05.

[23]  Oliver Brdiczka,et al.  Automatic development of an abstract context model for an intelligent environment , 2005, Third IEEE International Conference on Pervasive Computing and Communications Workshops.

[24]  David Garlan,et al.  Context is key , 2005, CACM.

[25]  J. J. Kulikowski,et al.  Fourier analysis and spatial representation in the visual cortex , 1981, Experientia.

[26]  Justus H. Piater,et al.  Introduction to the special issue: International Conference on Vision Systems , 2004, Machine Vision and Applications.

[27]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[28]  James L. Crowley,et al.  Facial features detection robust to pose, illumination and identity , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[29]  James L. Crowley,et al.  Spatial Control of Interactive Surfaces in an Augmented Environment , 2004, EHCI/DS-VIS.

[30]  Mary Shaw,et al.  Dynamic configuration of resource-aware services , 2004, Proceedings. 26th International Conference on Software Engineering.

[31]  Bradley R. Schmerl,et al.  Rainbow: architecture-based self-adaptation with reusable infrastructure , 2004, International Conference on Autonomic Computing, 2004. Proceedings..

[32]  Jianbo Shi,et al.  Tele-Graffiti: A Camera-Projector Based Remote Sketching System with Hand-Based User Interface and Automatic Session Summarization , 2003, International Journal of Computer Vision.

[33]  Daniela Hall,et al.  Brand identification using Gaussian derivative histograms , 2003, Machine Vision and Applications.

[34]  Augustin Lux,et al.  The Imalab method for vision systems , 2003, Machine Vision and Applications.

[35]  Tony Lindeberg,et al.  Feature Detection with Automatic Scale Selection , 1998, International Journal of Computer Vision.

[36]  J. Koenderink,et al.  Representation of local geometry in the visual system , 1987, Biological Cybernetics.

[37]  Bernt Schiele,et al.  Recognition without Correspondence using Multidimensional Receptive Field Histograms , 2004, International Journal of Computer Vision.

[38]  Clips-Ima G Bp Visual Tracking of Bare Fingers for Interactive Surfaces , 2004 .

[39]  J. Crowley,et al.  Estimating Face orientation from Robust Detection of Salient Facial Structures , 2004 .

[40]  Paul A. Beardsley,et al.  Natural video matting using camera arrays , 2006, ACM Trans. Graph..

[41]  James L. Crowley,et al.  Dynamic composition of process federations for context aware perception of human activity , 2003, IEMC '03 Proceedings. Managing Technologically Driven Organizations: The Human Side of Innovation and Change (IEEE Cat. No.03CH37502).

[42]  James L. Crowley,et al.  Context Driven Observation of Human Activity , 2003, EUSAI.

[43]  Pietro Perona,et al.  Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[44]  James L. Crowley,et al.  Computation of Generic Features for Object Classification , 2003, Scale-Space.

[45]  James L. Crowley,et al.  Fast Computation of Scale Normalised Gaussian Receptive Fields , 2003, Scale-Space.

[46]  Darius Burschka,et al.  VICs: A Modular Vision-Based HCI Framework , 2003, ICVS.

[47]  Claudio S. Pinhanez,et al.  Steerable interfaces for pervasive computing spaces , 2003, Proceedings of the First IEEE International Conference on Pervasive Computing and Communications, 2003. (PerCom 2003)..

[48]  Fabien Pélisson Reconnaissance et indexation basées sur l'apparence , 2003 .

[49]  François Bérard,et al.  The Magic Table: Computer-Vision Based Augmentation of a Whiteboard for Creative Meetings , 2003 .

[50]  James L. Crowley,et al.  Projecting Rectified Images in an Augmented Environment , 2003 .

[51]  Jeffrey O. Kephart,et al.  The Vision of Autonomic Computing , 2003, Computer.

[52]  Terry Winograd,et al.  PointRight: experience with flexible input redirection in interactive workspaces , 2002, UIST '02.

[53]  Blair MacIntyre,et al.  Integrating virtual and physical context to support knowledge workers , 2002, IEEE Pervasive Computing.

[54]  Frédéric Vernier,et al.  Visualization techniques for circular tabletop interfaces , 2002, AVI '02.

[55]  Dominique Vaufreydaz Modélisation statistique du langage à partir d'Internet pour la reconnaissance automatique de la parole continue. (Statistical language modelling using Internet documents for continuous speech recognition) , 2002 .

[56]  Nobutatsu Nakamura,et al.  Active Projector: Image correction for moving image over uneven screens , 2002 .

[57]  J. Crowley,et al.  An Architecture for Context Aware Observation of Human Activity , 2002 .

[58]  J. Crowley,et al.  Context Aware Observation of Human Activity , 2002 .

[59]  J. Crowley,et al.  Fast Computation of Characteristic Scale Using a Half-Octave Pyramid , 2002 .

[60]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[61]  Claudio S. Pinhanez The Everywhere Displays Projector: A Device to Create Ubiquitous Graphical Interfaces , 2001, UbiComp.

[62]  Petr Jan Horn,et al.  Autonomic Computing: IBM's Perspective on the State of Information Technology , 2001 .

[63]  James L. Crowley,et al.  Object Recognition Using Coloured Receptive Fields , 2000, ECCV.

[64]  James L. Crowley,et al.  Local Scale Selection for Gaussian Based Description Techniques , 2000, ECCV.

[65]  James L. Crowley,et al.  Robust face tracking using color , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[66]  Jérôme Martin,et al.  Smart Office: An Intelligent and Interactive Environment , 2000 .

[67]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[68]  Jitendra Malik,et al.  Recognizing surfaces using three-dimensional textons , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[69]  Hiroshi Ishii,et al.  Emancipated pixels: real-world graphics in the luminous room , 1999, SIGGRAPH.

[70]  James L. Crowley,et al.  Probabilistic recognition of activity using local appearance , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[71]  Norbert A. Streitz,et al.  i-LAND: an interactive landscape for creativity and innovation , 1999, CHI '99.

[72]  Lucas J. van Vliet,et al.  Recursive Gaussian derivative filters , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[73]  Greg Welch,et al.  The office of the future: a unified approach to image-based modeling and spatially immersive displays , 1998, SIGGRAPH.

[74]  James L. Crowley,et al.  Visual Recognition Using Local Appearance , 1998, ECCV.

[75]  Aleix M. Martinez,et al.  The AR face database , 1998 .

[76]  James L. Crowley,et al.  Multi-modal tracking of faces for video communications , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[77]  Maurizio Omologo,et al.  Use of the crosspower-spectrum phase in acoustic event location , 1997, IEEE Trans. Speech Audio Process..

[78]  Bruno Zoppis Outils pour l'intégration et le contrôle en vision et robotique mobile , 1997 .

[79]  Bernt Schiele,et al.  Object Recognition Using Multidimensional Receptive Field Histograms , 1996, ECCV.

[80]  Mary Shaw,et al.  Software architecture - perspectives on an emerging discipline , 1996 .

[81]  James L. Crowley Vision for man-machine interaction , 1995, Robotics Auton. Syst..

[82]  Yasuhiro Hayashi,et al.  Recovery of Illuminant and Surface Colors from Images Based on the CIE Daylight , 1994, ECCV.

[83]  James L. Crowley,et al.  Integration and control of reactive visual processes , 1994, Robotics Auton. Syst..

[84]  John Rasure,et al.  The Khoros Application Development Environment , 1994 .

[85]  James L. Crowley,et al.  Experimental Environments for Computer Vision and Image Processing , 1994 .

[86]  Pierre Wellner The DigitalDesk calculator: tangible manipulation on a desk top display , 1991, UIST '91.

[87]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[88]  R. Young GAUSSIAN DERIVATIVE THEORY OF SPATIAL VISION: ANALYSIS OF CORTICAL CELL RECEPTIVE FIELD LINE-WEIGHTING PROFILES. , 1985 .

[89]  James F. Allen Towards a General Theory of Action and Time , 1984, Artif. Intell..

[90]  P. Johnson-Laird,et al.  Mental Models: Towards a Cognitive Science of Language, Inference, and Consciousness , 1985 .

[91]  Thanh Tran A METHOD FOR RIDGE EXTRACTION , 2022 .

[92]  D. Hall,et al.  Détection de visages par caractéristiques génériques calculées à partir des images de luminance Face detection by robust generic features computed from luminance , 2022 .