Joint Attention in Driver-Pedestrian Interaction: from Theory to Practice

Today, one of the major challenges that autonomous vehicles are facing is the ability to drive in urban environments. Such a task requires communication between autonomous vehicles and other road users in order to resolve various traffic ambiguities. The interaction between road users is a form of negotiation in which the parties involved have to share their attention regarding a common objective or a goal (e.g. crossing an intersection), and coordinate their actions in order to accomplish it. In this literature review we aim to address the interaction problem between pedestrians and drivers (or vehicles) from joint attention point of view. More specifically, we will discuss the theoretical background behind joint attention, its application to traffic interaction and practical approaches to implementing joint attention for autonomous vehicles.

[1]  Pei-Sung Lin,et al.  Impact of Connected Vehicles and Autonomous Vehicles on Future Transportation , 2016 .

[2]  Li-Ta Hsu,et al.  Probability estimation for pedestrian crossing intention at signalized crosswalks , 2015, 2015 IEEE International Conference on Vehicular Electronics and Safety (ICVES).

[3]  Yizhou Yu,et al.  Visual saliency based on multiscale deep features , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Wolfram Burgard,et al.  Efficient deep models for monocular road segmentation , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[5]  P. Fua,et al.  Pose estimation for category specific multiview object localization , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Stefan Roth,et al.  Stixmantics: A Medium-Level Model for Real-Time Semantic Scene Understanding , 2014, ECCV.

[7]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Kenji Mase,et al.  Recognition of walking behaviors for pedestrian navigation , 2001, Proceedings of the 2001 IEEE International Conference on Control Applications (CCA'01) (Cat. No.01CH37204).

[9]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[10]  H. Ishiguro,et al.  EXPLORING THE UNCANNY VALLEY WITH GEMINOID HI-1 IN A REAL-WORLD APPLICATION , 2010 .

[11]  R. Krauss,et al.  Nonverbal Behavior and Nonverbal Communication: What do Conversational Hand Gestures Tell Us? , 1996 .

[12]  Sanja Fidler,et al.  Monocular 3D Object Detection for Autonomous Driving , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Luc Van Gool,et al.  The 2005 PASCAL Visual Object Classes Challenge , 2005, MLCW.

[14]  Paola Mello,et al.  Image analysis and rule-based reasoning for a traffic monitoring system , 1999, Proceedings 199 IEEE/IEEJ/JSAI International Conference on Intelligent Transportation Systems (Cat. No.99TH8383).

[15]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[16]  Michael C. Pyryt Human cognitive abilities: A survey of factor analytic studies , 1998 .

[17]  Nalini Ambady,et al.  Nonverbal communication and psychology: Past and future , 1999 .

[18]  Juan Carlos Niebles,et al.  Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification , 2010, ECCV.

[19]  B E Sabey,et al.  INTERACTING ROLES OF ROAD ENVIRONMENT VEHICLE AND ROAD USER IN ACCIDENTS , 1975 .

[20]  P. Mundy,et al.  Individual differences in joint attention skill development in the second year , 1998 .

[21]  C. Darwin The Expression of the Emotions in Man and Animals , .

[22]  Zehang Sun,et al.  On-road vehicle detection using evolutionary Gabor filter optimization , 2005, IEEE Transactions on Intelligent Transportation Systems.

[23]  Luc Van Gool,et al.  Human Pose Estimation Using Body Parts Dependent Joint Regressors , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Young-Woo Seo,et al.  Recognizing temporary changes on highways for reliable autonomous driving , 2012, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[25]  A. Mehrabian Public Places and Private Spaces:The Psychology of Work, Play, and Living Environments , 1980 .

[26]  Hossein Ragheb,et al.  MuHAVi: A Multicamera Human Action Video Dataset for the Evaluation of Action Recognition Methods , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[27]  Andrew Zisserman,et al.  2D Human Pose Estimation in TV Shows , 2009, Statistical and Geometrical Approaches to Visual Motion Analysis.

[28]  Albert Ali Salah,et al.  Joint visual attention modeling for naturally interacting robotic agents , 2009, 2009 24th International Symposium on Computer and Information Sciences.

[29]  David G. Lowe,et al.  Three-Dimensional Object Recognition from Single Two-Dimensional Images , 1987, Artif. Intell..

[30]  James W. Davis,et al.  The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Luc Van Gool,et al.  Segmentation-Based Urban Traffic Scene Understanding , 2009, BMVC.

[32]  H. Barlow Inductive Inference, Coding, Perception, and Language , 1974, Perception.

[33]  Arnold W. M. Smeulders,et al.  Color Based Object Recognition , 1997, ICIAP.

[34]  Tova Rosenbloom,et al.  Crossing at a red light: Behaviour of individuals and groups , 2009 .

[35]  Ronen Lerner,et al.  Recent progress in road and lane detection: a survey , 2012, Machine Vision and Applications.

[36]  Jiri Matas,et al.  A system for real-time detection and tracking of vehicles from a single car-mounted camera , 2012, 2012 15th International IEEE Conference on Intelligent Transportation Systems.

[37]  Roland Siegwart,et al.  A data-driven approach for pedestrian intention estimation , 2016, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC).

[38]  Hanghang Tong,et al.  Activity recognition with smartphone sensors , 2014 .

[39]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Sebastian Thrun,et al.  Probabilistic Terrain Analysis For High-Speed Desert Driving , 2006, Robotics: Science and Systems.

[41]  Kang-Hyun Jo,et al.  Estimation of collision risk for improving driver's safety , 2016, IECON 2016 - 42nd Annual Conference of the IEEE Industrial Electronics Society.

[42]  Dariu Gavrila,et al.  A new benchmark for stereo-based pedestrian detection , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[43]  John K. Tsotsos,et al.  Agreeing to cross: How drivers and pedestrians communicate , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[44]  Xiaowei Zhou,et al.  Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Bolei Zhou,et al.  Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  A. Bourke,et al.  A threshold-based fall-detection algorithm using a bi-axial gyroscope sensor. , 2008, Medical engineering & physics.

[47]  Dan Roth,et al.  Learning a Sparse Representation for Object Detection , 2002, ECCV.

[48]  Rahul Mohan,et al.  Deep Deconvolutional Networks for Scene Parsing , 2014, ArXiv.

[49]  Avshalom Suissa,et al.  The Daimler-Benz steering assistant: a spin-off from autonomous driving , 1994, Proceedings of the Intelligent Vehicles '94 Symposium.

[50]  Daniel Hepperle,et al.  Hybrid City Lighting - Improving Pedestrians' Safety through Proactive Street Lighting , 2015, 2015 International Conference on Cyberworlds (CW).

[51]  Ragunathan Rajkumar,et al.  Towards a viable autonomous driving research platform , 2013, 2013 IEEE Intelligent Vehicles Symposium (IV).

[52]  Hans P. Moravec Obstacle avoidance and navigation in the real world by a seeing robot rover , 1980 .

[53]  Josephine Sullivan,et al.  Using Richer Models for Articulated Pose Estimation of Footballers , 2012, BMVC.

[54]  Massimiliano Pontil,et al.  Support Vector Machines for 3D Object Recognition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[55]  Takeo Kanade,et al.  A statistical method for 3D object detection applied to faces and cars , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[56]  Richard Bishop Intelligent Vehicle Applications Worldwide , 2000, IEEE Intell. Syst..

[57]  R. Brooks,et al.  The cog project: building a humanoid robot , 1999 .

[58]  Gerald J.S. Wilde,et al.  Immediate and delayed social interaction in road user behaviour , 1980 .

[59]  A. K. Basu A Theory of Decision-Making , 1973, The Journal of Sociology & Social Welfare.

[60]  Massimo Bertozzi,et al.  Vision-based intelligent vehicles: State of the art and perspectives , 2000, Robotics Auton. Syst..

[61]  Brian Scassellati Mechanisms of Shared Attention for a Humanoid Robot , 1998 .

[62]  Judith A. Hall,et al.  Beliefs about female and male nonverbal communication , 1995 .

[63]  Tobias Lagström,et al.  AVIP - Autonomous vehicles' interaction with pedestrians - An investigation of pedestrian-driver communication and development of a vehicle external interface , 2016 .

[64]  John M. Dolan,et al.  A robust autonomous freeway driving algorithm , 2009, 2009 IEEE Intelligent Vehicles Symposium.

[65]  Roland Siegwart,et al.  Feature Relevance Estimation for Learning Pedestrian Behavior at Crosswalks , 2015, 2015 IEEE 18th International Conference on Intelligent Transportation Systems.

[66]  Hideki Kozima,et al.  Interactive robots for communication-care: a case-study in autism therapy , 2005, ROMAN 2005. IEEE International Workshop on Robot and Human Interactive Communication, 2005..

[67]  Motoyuki Akamatsu,et al.  Electrophysiological evaluation of attention in drivers and passengers: Toward an understanding of drivers’ attentional state in autonomous vehicles , 2016 .

[68]  Wolfram Burgard,et al.  Autonomous driving in a multi-level parking structure , 2009, 2009 IEEE International Conference on Robotics and Automation.

[69]  Edward H. Adelson,et al.  Analyzing and recognizing walking figures in XYT , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[70]  Osama Masoud,et al.  Detection and classification of vehicles , 2002, IEEE Trans. Intell. Transp. Syst..

[71]  Mubarak Shah,et al.  Action MACH a spatio-temporal Maximum Average Correlation Height filter for action recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[72]  Jun Tani,et al.  Joint attention between a humanoid robot and users in imitation game , 2004 .

[73]  Andreas Lawitzky,et al.  A Combined Model- and Learning-Based Framework for Interaction-Aware Maneuver Prediction , 2016, IEEE Transactions on Intelligent Transportation Systems.

[74]  Jean Underwood,et al.  Visual attention while driving: sequences of eye fixations made by experienced and novice drivers , 2003, Ergonomics.

[75]  Harold Bekkering,et al.  Joint attention: Inferring what others perceive (and don’t perceive) , 2008, Consciousness and Cognition.

[76]  Cordelia Schmid,et al.  Actions in context , 2009, CVPR.

[77]  Pietro Perona,et al.  Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[78]  Sebastian Scherer,et al.  VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[79]  Thomas A. Dingus,et al.  The Impact of Driver Inattention on Near-Crash/Crash Risk: An Analysis Using the 100-Car Naturalistic Driving Study Data , 2006 .

[80]  Nan Gao,et al.  A Pedestrian Dead Reckoning system using SEMG based on activities recognition , 2016, 2016 IEEE Chinese Guidance, Navigation and Control Conference (CGNCC).

[81]  A. Murata,et al.  Natural imitation induced by joint attention in Japanese monkeys. , 2003, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[82]  Silvio Savarese,et al.  What are they doing? : Collective activity classification using spatio-temporal relationship among people , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[83]  M. Dimatteo,et al.  Predicting Patient Satisfaction from Physicians’ Nonverbal Communication Skills , 1980, Medical care.

[84]  Tae-Kyun Kim,et al.  STARE: Spatio-Temporal Attention Relocation for Multiple Structured Activities Detection , 2015, IEEE Transactions on Image Processing.

[85]  Nanning Zheng,et al.  Automatic salient object segmentation based on context and shape prior , 2011, BMVC.

[86]  Miquel Sànchez-Marrè,et al.  A purely reactive navigation scheme for dynamic environments using Case-Based Reasoning , 2006, Auton. Robots.

[87]  Pietro Perona,et al.  One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[88]  John K. Tsotsos Toward a computational model of visual attention , 1995 .

[89]  Luc Van Gool,et al.  Traffic sign recognition — How far are we from the solution? , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[90]  Cynthia Breazeal,et al.  Robot in Society: Friend or Appliance? , 1999 .

[91]  Noah Snavely,et al.  NYC3DCars: A Dataset of 3D Vehicles in Geographic Context , 2013, 2013 IEEE International Conference on Computer Vision.

[92]  Jake K. Aggarwal,et al.  Human activity recognition from 3D data: A review , 2014, Pattern Recognit. Lett..

[93]  Weiyu Zhang,et al.  From Actemes to Action: A Strongly-Supervised Representation for Detailed Action Understanding , 2013, 2013 IEEE International Conference on Computer Vision.

[94]  Bir Bhanu,et al.  VideoWeb Dataset for Multi-camera Activities and Non-verbal Communication , 2011 .

[95]  Derek Hoiem,et al.  Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[96]  Antonio Torralba,et al.  Sharing features: efficient boosting procedures for multiclass object detection , 2004, CVPR 2004.

[97]  Indira Thouvenin,et al.  Estimating Driver Unawareness of Pedestrian Based On Visual Behaviors and Driving Behaviors , 2014, VISIGRAPP 2014.

[98]  Rynson W. H. Lau,et al.  Exemplar-Driven Top-Down Saliency Detection via Deep Association , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[99]  Christian Szegedy,et al.  DeepPose: Human Pose Estimation via Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[100]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[101]  Luca Gatti,et al.  The VisLab Intercontinental Autonomous Challenge: 13,000 km, 3 months,… no driver , 2010 .

[102]  Yves Demazeau,et al.  A Social Reasoning Mechanism Based On Dependence Networks , 1997, ECAI.

[103]  Siva R K Narla,et al.  The Evolution of Connected Vehicle Technology: From Smart Drivers to Smart Cars to....Self-Driving Cars. , 2013 .

[104]  Tieniu Tan,et al.  Silhouette Analysis-Based Gait Recognition for Human Identification , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[105]  C. Wöhler,et al.  A TIME-DELAY NEURAL NETWORK ALGORITHM FOR REAL-TIME PEDESTRIAN RECOGNITION , 1998 .

[106]  Zehang Sun,et al.  On-road vehicle detection: a review , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[107]  R. Sternberg,et al.  Toward a Unified Componential Theory of Human Reasoning. , 1978 .

[108]  Thomas Serre,et al.  HMDB: A large video database for human motion recognition , 2011, 2011 International Conference on Computer Vision.

[109]  Ennio Gambi,et al.  Time synchronization and data fusion for RGB-Depth cameras and inertial sensors in AAL applications , 2015, 2015 IEEE International Conference on Communication Workshop (ICCW).

[110]  Greg Mori,et al.  Social roles in hierarchical models for human activity recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[111]  Paul S. Rosenbloom,et al.  Improving Rule-Based Systems Through Case-Based Reasoning , 1991, AAAI.

[112]  D. Clay,et al.  Driver attitude and attribution : implications for accident prevention , 1995 .

[113]  B. Soheilian,et al.  CIRCULAR ROAD SIGN EXTRACTION FROM STREET LEVEL IMAGES USING COLOUR, SHAPE AND TEXTURE DATABASE MAPS , 2009 .

[114]  Thomas Mauthner,et al.  Improved Sport Activity Recognition using Spatio-temporal Context , 2014 .

[115]  Illah R. Nourbakhsh,et al.  A survey of socially interactive robots , 2003, Robotics Auton. Syst..

[116]  A. Scheflen THE SIGNIFICANCE OF POSTURE IN COMMUNICATION SYSTEMS. , 1964, Psychiatry.

[117]  Thomas Zielke,et al.  An integrated obstacle detection framework for intelligent cruise control on motorways , 1995, Proceedings of the Intelligent Vehicles '95. Symposium.

[118]  Junji Yamato,et al.  Recognizing human action in time-sequential images using hidden Markov model , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[119]  Zsolt Kira,et al.  Fusing LIDAR and images for pedestrian detection using convolutional neural networks , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[120]  Frédo Durand,et al.  Where Should Saliency Models Look Next? , 2016, ECCV.

[121]  R. Birdwhistell Kinesics and Context: Essays on Body Motion Communication , 1971 .

[122]  F. Larsson,et al.  Correlating fourier descriptors of local patches for road sign recognition , 2011 .

[123]  Jitendra Malik,et al.  Shape Context: A New Descriptor for Shape Matching and Object Recognition , 2000, NIPS.

[124]  Chun Chen,et al.  A survey of human pose estimation: The body parts parsing based methods , 2015, J. Vis. Commun. Image Represent..

[125]  Song-Chun Zhu,et al.  Recognizing Car Fluents from Video , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[126]  Aude Billard,et al.  Studying robot social cognition within a developmental psychology framework , 1999, 1999 Third European Workshop on Advanced Mobile Robots (Eurobot'99). Proceedings (Cat. No.99EX355).

[127]  Xiaolin Hu,et al.  Recurrent convolutional neural network for object recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[128]  Massimo Bertozzi,et al.  Stereo Vision-based approaches for Pedestrian Detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[129]  Tomaso A. Poggio,et al.  Pedestrian detection using wavelet templates , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[130]  G. Rizzolatti,et al.  Parietal Lobe: From Action Organization to Intention Understanding , 2005, Science.

[131]  Klaus C. J. Dietmayer,et al.  Stereo-Vision-Based Pedestrian's Intention Detection in a Moving Vehicle , 2015, 2015 IEEE 18th International Conference on Intelligent Transportation Systems.

[132]  Shaogang Gong,et al.  Real-time face pose estimation , 1998, Real Time Imaging.

[133]  R. Johansson,et al.  Action plans used in action observation , 2003, Nature.

[134]  Sangho Park,et al.  Recognition of two-person interactions using a hierarchical Bayesian network , 2003, IWVS '03.

[135]  Ashwin Ram,et al.  A Case-Based Approach to Reactive Control for Autonomous Robots * , 1992 .

[136]  Katharine Hunter-Zaworski,et al.  Passive Pedestrian Detection at Unsignalized Crossings , 1998 .

[137]  James H. Garrett,et al.  A neural network for image based vehicle detection , 1992 .

[138]  Gina Green,et al.  Behavioral assessment of joint attention: a methodological report. , 2006, Research in developmental disabilities.

[139]  Alberto Broggi,et al.  Vehicle and Guard Rail Detection Using Radar and Vision Data Fusion , 2007, IEEE Transactions on Intelligent Transportation Systems.

[140]  Larry S. Davis,et al.  AVSS 2011 demo session: A large-scale benchmark dataset for event recognition in surveillance video , 2011, AVSS.

[141]  Shengcai Liao,et al.  Robust Multi-resolution Pedestrian Detection in Traffic Scenes , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[142]  R. Elsley,et al.  The DARPA grand challenge - development of an autonomous vehicle , 2004, IEEE Intelligent Vehicles Symposium, 2004.

[143]  Herbert Jaeger Artificial intelligence: Deep neural reasoning , 2016, Nature.

[144]  Jana M. Price,et al.  The Relationship between Crash Rates and Drivers' Hazard Assessments Using the Connecticut Photolog , 2000 .

[145]  Cordelia Schmid,et al.  Mixing Body-Part Sequences for Human Pose Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[146]  Greg Mori,et al.  Structure Inference Machines: Recurrent Neural Networks for Analyzing Relations in Group Activity Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[147]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[148]  S. Handel,et al.  Social reasoning and spatial paralogic. , 1965, Journal of personality and social psychology.

[149]  Jiebo Luo,et al.  Recognizing realistic actions from videos “in the wild” , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[150]  Nobuyuki Uchida,et al.  Driver behavior in car-to-pedestrian incidents: An application of the Driving Reliability and Error Analysis Method (DREAM). , 2013, Accident; analysis and prevention.

[151]  Matúš Šucha,et al.  Road users’ strategies and communication: driver-pedestrian interaction , 2014 .

[152]  Changxu Wu,et al.  The estimation of vehicle speed and stopping distance by pedestrians crossing streets in a naturalistic traffic environment , 2015 .

[153]  Yukie Nagai,et al.  The Role of Motion Information in Learning Human-Robot Joint Attention , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[154]  H. Kozima,et al.  A Robot that Learns to Communicate with Human Caregivers , 2001 .

[155]  Randall Davis,et al.  Meta-Rules: Reasoning about Control , 1980, Artif. Intell..

[156]  Hirotake Yamazoe,et al.  Gaze-communicative behavior of stuffed-toy robot with joint attention and eye contact based on ambient gaze-tracking , 2007, ICMI '07.

[157]  Bo Zhang,et al.  Color-based road detection in urban traffic scenes , 2004, IEEE Transactions on Intelligent Transportation Systems.

[158]  Deva Ramanan,et al.  Learning to parse images of articulated bodies , 2006, NIPS.

[159]  Seiichi Mita,et al.  Vehicle detection and tracking at nighttime for urban autonomous driving , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[160]  Kent Larson,et al.  Activity Recognition in the Home Using Simple and Ubiquitous Sensors , 2004, Pervasive.

[161]  Rodney A. Brooks,et al.  Elephants don't play chess , 1990, Robotics Auton. Syst..

[162]  Arati Dandavate,et al.  Semantic Texton Forests for Image Categorization and Segmentation , 2018, IJARCCE.

[163]  Francisco López-Ferreras,et al.  Road-Sign Detection and Recognition Based on Support Vector Machines , 2007, IEEE Transactions on Intelligent Transportation Systems.

[164]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[165]  A. Enis Çetin,et al.  HMM Based Falling Person Detection Using Both Audio and Video , 2005, 2006 IEEE 14th Signal Processing and Communications Applications.

[166]  Larry H. Matthies,et al.  First-Person Activity Recognition: What Are They Doing to Me? , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[167]  Charles E. Thorpe,et al.  UNSCARF-a color vision system for the detection of unstructured roads , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[168]  Markus Vincze,et al.  Automation of “ground truth” annotation for multi-view RGB-D object instance recognition datasets , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[169]  Christoph H. Lampert,et al.  Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[170]  Danqi Chen,et al.  Reasoning With Neural Tensor Networks for Knowledge Base Completion , 2013, NIPS.

[171]  Bernhard Friedrich,et al.  The Effect of Autonomous Vehicles on Traffic , 2016 .

[172]  C. Moore,et al.  The role of movement in the development of joint visual attention , 1997 .

[173]  W. von Seelen,et al.  Walking pedestrian recognition , 1999, Proceedings 199 IEEE/IEEJ/JSAI International Conference on Intelligent Transportation Systems (Cat. No.99TH8383).

[174]  Sebastian Thrun,et al.  Stanley: The robot that won the DARPA Grand Challenge , 2006, J. Field Robotics.

[175]  G. Rizzolatti,et al.  I Know What You Are Doing A Neurophysiological Study , 2001, Neuron.

[176]  P. Mundy,et al.  Joint Attention and Early Social Communication: Implications for Research on Intervention with Autism , 1997, Journal of autism and developmental disorders.

[177]  Bart Selman,et al.  Unstructured human activity detection from RGBD images , 2011, 2012 IEEE International Conference on Robotics and Automation.

[178]  Johannes Stallkamp,et al.  Detection of traffic signs in real-world images: The German traffic sign detection benchmark , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[179]  Max Q.-H. Meng,et al.  Impacts of Robot Head Gaze on Robot-to-Human Handovers , 2015, Int. J. Soc. Robotics.

[180]  Jannik Fritsch,et al.  A multi-modal object attention system for a mobile robot , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[181]  Takayuki Kanda,et al.  Nonverbal leakage in robots: Communication of intentions through seemingly unintentional behavior , 2009, 2009 4th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[182]  Bernt Schiele,et al.  Monocular Visual Scene Understanding: Understanding Multi-Object Traffic Scenes , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[183]  Tetsuo Ono,et al.  Physical relation and expression: joint attention for human-robot interaction , 2003, IEEE Trans. Ind. Electron..

[184]  Antonio Torralba,et al.  Exploiting hierarchical context on a large database of object categories , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[185]  Dariu Gavrila,et al.  Analysis of pedestrian dynamics from a vehicle perspective , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[186]  Svetlana Lazebnik,et al.  Active Object Localization with Deep Reinforcement Learning , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[187]  John K. Tsotsos Analyzing vision at the complexity level , 1990, Behavioral and Brain Sciences.

[188]  R. R. Blake,et al.  Status factors in pedestrain violation of traffic signals. , 1955, Journal of abnormal psychology.

[189]  Akira Ito,et al.  An attention-based approach to symbol acquisition , 1998, Proceedings of the 1998 IEEE International Symposium on Intelligent Control (ISIC) held jointly with IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA) Intell.

[190]  Charles A. Green,et al.  Human Factors Issues Associated with Limited Ability Autonomous Driving Systems: Drivers’ Allocation of Visual Attention to the Forward Roadway , 2017 .

[191]  ByoungChul Ko,et al.  Pedestrian intention prediction based on dynamic fuzzy automata for vehicle driving at nighttime , 2017 .

[192]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[193]  Karl Verfaillie,et al.  Representing and anticipating human actions in vision , 2002 .

[194]  P. Johnson-Laird,et al.  Focussing in reasoning and decision making , 1993, Cognition.

[195]  John K. Tsotsos,et al.  Are They Going to Cross? A Benchmark Dataset and Baseline for Pedestrian Crosswalk Behavior , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[196]  Andrew U. Frank,et al.  Qualitative spatial reasoning about distances and directions in geographic space , 1992, J. Vis. Lang. Comput..

[197]  Herbert Freeman,et al.  Characteristic Views As A Basis For Three-Dimensional Object Recognition , 1982, Other Conferences.

[198]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[199]  Rainer Lienhart,et al.  An extended set of Haar-like features for rapid object detection , 2002, Proceedings. International Conference on Image Processing.

[200]  Byeongkeun Kang,et al.  A computational framework for driver's visual attention using a fully convolutional architecture , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[201]  N W Heimstra,et al.  An experimental methodology for analysis of child pedestrian behavior. , 1969, Pediatrics.

[202]  D. Sperber,et al.  Précis of Relevance: Communication and Cognition , 1987 .

[203]  Mohan M. Trivedi,et al.  A General Active-Learning Framework for On-Road Vehicle Recognition and Tracking , 2010, IEEE Transactions on Intelligent Transportation Systems.

[204]  Andrea Lockerd Thomaz,et al.  Effects of nonverbal communication on efficiency and robustness in human-robot teamwork , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[205]  Peter V. Gehler,et al.  Strong Appearance and Expressive Spatial Models for Human Pose Estimation , 2013, 2013 IEEE International Conference on Computer Vision.

[206]  Sameer A. Nene,et al.  Columbia Object Image Library (COIL100) , 1996 .

[207]  Emilio Frazzoli,et al.  Intention-Aware Pedestrian Avoidance , 2012, ISER.

[208]  Anupam Agrawal,et al.  Vision based hand gesture recognition for human computer interaction: a survey , 2012, Artificial Intelligence Review.

[209]  R. Chapuis,et al.  Shape-based pedestrian detection and localization , 2003, Proceedings of the 2003 IEEE International Conference on Intelligent Transportation Systems.

[210]  Kang Zheng,et al.  Combining local appearance and holistic view: Dual-Source Deep Neural Networks for human pose estimation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[211]  Se-Young Oh,et al.  Three-feature based automatic lane detection algorithm (TFALDA) for autonomous driving , 2003, IEEE Trans. Intell. Transp. Syst..

[212]  Dieter Fox,et al.  Learning hierarchical sparse features for RGB-(D) object recognition , 2014, Int. J. Robotics Res..

[213]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[214]  Sergio Escalera,et al.  Multi-modal gesture recognition challenge 2013: dataset and results , 2013, ICMI '13.

[215]  Luc Van Gool,et al.  A mobile vision system for robust multi-person tracking , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[216]  Rainer Stiefelhagen,et al.  A Controlled Interactive Multiple Model Filter for Combined Pedestrian Intention Recognition and Path Prediction , 2015, 2015 IEEE 18th International Conference on Intelligent Transportation Systems.

[217]  Fei-Fei Li,et al.  Modeling mutual context of object and human pose in human-object interaction activities , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[218]  Vincent Lepetit,et al.  Learning descriptors for object recognition and 3D pose estimation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[219]  Kam-Fai Wong,et al.  Towards Neural Network-based Reasoning , 2015, ArXiv.

[220]  Markus Vincze,et al.  Survey of recent advances in 3D visual attention for robotics , 2017, Int. J. Robotics Res..

[221]  Ramesh C. Jain,et al.  Three-dimensional object recognition , 1985, CSUR.

[222]  Bernard Ghanem,et al.  ActivityNet: A large-scale video benchmark for human activity understanding , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[223]  Vincent Frémont,et al.  Exploiting fully convolutional neural networks for fast road detection , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[224]  M. Tomasello,et al.  Social cognition, joint attention, and communicative competence from 9 to 15 months of age. , 1998, Monographs of the Society for Research in Child Development.

[225]  Mohan M. Trivedi,et al.  Robust and continuous estimation of driver gaze zone by dynamic analysis of multiple face videos , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[226]  Thierry Chateau,et al.  Deep MANTA: A Coarse-to-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis from Monocular Image , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[227]  M. Crocker,et al.  Investigating joint attention mechanisms through spoken human–robot interaction , 2011, Cognition.

[228]  Nicole van Nes,et al.  The study design of UDRIVE: the naturalistic driving study across Europe for cars, trucks and scooters , 2016 .

[229]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[230]  John K. Tsotsos,et al.  Towards the Quantitative Evaluation of Visual Attention Models Bottom−up Top-down Dynamic Static 0 0 0 , 2022 .

[231]  Larry S. Davis,et al.  Multiple vehicle detection and tracking in hard real-time , 1996, Proceedings of Conference on Intelligent Vehicles.

[232]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[233]  Reinhold Behringer,et al.  The seeing passenger car 'VaMoRs-P' , 1994, Proceedings of the Intelligent Vehicles '94 Symposium.

[234]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[235]  Luis Salgado,et al.  Video analysis-based vehicle detection and tracking using an MCMC sampling framework , 2012, EURASIP J. Adv. Signal Process..

[236]  Dariu Gavrila,et al.  UvA-DARE ( Digital Academic Repository ) Pedestrian Path Prediction with Recursive Bayesian Filters : A Comparative Study , 2013 .

[237]  M. Asada,et al.  How does an infant acquire the ability of joint attention?: A Constructive Approach , 2003 .

[238]  Ming-Hsuan Yang,et al.  Top-down visual saliency via joint CRF and dictionary learning , 2012, CVPR.

[239]  Hideki Kozima,et al.  Can a robot empathize with people? , 2004, Artificial Life and Robotics.

[240]  Satish V. Ukkusuri,et al.  Modeling of Motorist-Pedestrian Interaction at Uncontrolled Mid-block Crosswalks , 2003 .

[241]  S. Baron-Cohen Mindblindness: An Essay on Autism and Theory of Mind , 1997 .

[242]  S. Axelrod,et al.  Eye contact as an antecedent to compliant behavior. , 1984, Journal of applied behavior analysis.

[243]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[244]  Kenji Mase,et al.  Activity and Location Recognition Using Wearable Sensors , 2002, IEEE Pervasive Comput..

[245]  Cynthia Breazeal,et al.  Social Robots that Interact with People , 2008, Springer Handbook of Robotics.

[246]  C. Lawrence Zitnick,et al.  Zero-Shot Learning via Visual Abstraction , 2014, ECCV.

[247]  Joan Serrat,et al.  Nighttime Vehicle Detection for Intelligent Headlight Control , 2008, ACIVS.

[248]  Mohan M. Trivedi,et al.  Looking-in and looking-out vision for Urban Intelligent Assistance: Estimation of driver attentive state and dynamic surround for safe merging and braking , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[249]  Dariu Gavrila,et al.  Context-Based Pedestrian Path Prediction , 2014, ECCV.

[250]  Antonio Criminisi,et al.  Object categorization by learned universal visual dictionary , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[251]  Nassir Navab,et al.  3D Pictorial Structures for Multiple Human Pose Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[252]  Richard P. Wildes,et al.  Dynamically encoded actions based on spacetime saliency , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[254]  Jianxiong Xiao,et al.  3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[255]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[256]  James Hays,et al.  SUN attribute database: Discovering, annotating, and recognizing scene attributes , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[257]  Alex Graves,et al.  Neural Turing Machines , 2014, ArXiv.

[258]  Ling Bao,et al.  Activity Recognition from User-Annotated Acceleration Data , 2004, Pervasive.

[259]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[260]  Xiaogang Wang,et al.  Saliency detection by multi-context deep learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[261]  Rui Zhang,et al.  Top-Down Saliency Detection via Contextual Pooling , 2014, J. Signal Process. Syst..

[262]  Pietro Perona,et al.  Integral Channel Features , 2009, BMVC.

[263]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[264]  Charles E. Thorpe,et al.  Vision-based neural network road and intersection detection and traversal , 1995, Proceedings 1995 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human Robot Interaction and Cooperative Robots.

[265]  Dumitru Erhan,et al.  Scalable Object Detection Using Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[266]  Christoph Stiller,et al.  Comparison and evaluation of pedestrian motion models for vehicle safety systems , 2016, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC).

[267]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[268]  Dean A. Pomerleau,et al.  Neural Network Vision for Robot Driving , 1997 .

[269]  Alex Mihailidis,et al.  A Survey on Ambient-Assisted Living Tools for Older Adults , 2013, IEEE Journal of Biomedical and Health Informatics.

[270]  Yang Wang,et al.  Recognizing human actions from still images with latent poses , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[271]  Massimo Bertozzi,et al.  Stereo vision-based vehicle detection , 2000, Proceedings of the IEEE Intelligent Vehicles Symposium 2000 (Cat. No.00TH8511).

[272]  Mubarak Shah,et al.  Recognizing 50 human action categories of web videos , 2012, Machine Vision and Applications.

[273]  Feng Jiang,et al.  Pedestrian behavior analysis using 110-car naturalistic driving data in USA , 2013 .

[274]  Christopher R. Baker,et al.  A reasoning framework for autonomous urban driving , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[275]  Karl Rohr,et al.  Incremental recognition of pedestrians from image sequences , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[276]  Ho Gi Jung,et al.  A New Approach to Urban Pedestrian Detection for Automatic Braking , 2009, IEEE Transactions on Intelligent Transportation Systems.

[277]  Michael Goldhammer,et al.  Analysis on termination of pedestrians' gait at urban intersections , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[278]  Amaury Nègre,et al.  Probabilistic Analysis of Dynamic Scenes and Collision Risks Assessment to Improve Driving Safety , 2011, IEEE Intelligent Transportation Systems Magazine.

[279]  Varun Ramakrishna,et al.  Convolutional Pose Machines , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[280]  Guodong Guo,et al.  A survey on still image based human action recognition , 2014, Pattern Recognit..

[281]  Stan Sclaroff,et al.  Learning Activity Progression in LSTMs for Activity Detection and Early Detection , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[282]  Albert Ali Salah,et al.  Head Pose and Neural Network Based Gaze Direction Estimation for Joint Attention Modeling in Embodied Agents , 2009 .

[283]  Luc Van Gool,et al.  You'll never walk alone: Modeling social behavior for multi-target tracking , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[284]  G. P. Bhattacharjee,et al.  Temporal representation and reasoning in artificial intelligence: A review , 2001 .

[285]  Stanley M. Bileschi,et al.  Street Scenes: towards scene understanding in still images , 2006 .

[286]  Mubarak Shah,et al.  Floor Fields for Tracking in High Density Crowd Scenes , 2008, ECCV.

[287]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[288]  Renee L Baillargeon,et al.  Physical Reasoning in Infancy , 2012 .

[289]  Alberto Broggi,et al.  Vision-Based Road Detection in Automotive Systems: A Real-Time Expectation-Driven Approach , 1995, J. Artif. Intell. Res..

[290]  Cordelia Schmid,et al.  Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[291]  Greg Mori,et al.  Learning Structured Inference Neural Networks with Label Relations , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[292]  Miguel A. Labrador,et al.  A Survey on Human Activity Recognition using Wearable Sensors , 2013, IEEE Communications Surveys & Tutorials.

[293]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[294]  Hiroshi Murase,et al.  Prediction of driver's pedestrian detectability by image processing adaptive to visual fields of view , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[295]  L. Cosmides The logic of social exchange: Has natural selection shaped how humans reason? Studies with the Wason selection task , 1989, Cognition.

[296]  Jeremy M Wolfe,et al.  Visual Attention , 2020, Computational Models for Cognitive Vision.

[297]  Huimin Ma,et al.  3D Object Proposals for Accurate Object Class Detection , 2015, NIPS.

[298]  S. Gallagher,et al.  Joint attention in joint action , 2013 .

[299]  Kristian Stormo Human Fall Detection Using Distributed Monostatic UWB Radars , 2014 .

[300]  Christoph Stiller,et al.  Path Planning for Autonomous Driving Based on Stereoscopic and Monoscopic Vision Cues , 2006, 2006 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems.

[301]  Valiallah Monajjemi,et al.  UAV, do you see me? Establishing mutual attention between an uninstrumented human and an outdoor UAV in flight , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[302]  E. D. Dickmanns,et al.  A Curvature-based Scheme for Improving Road Vehicle Guidance by Computer Vision , 1987, Other Conferences.

[303]  Roberto Cipolla,et al.  DEEP-CARVING: Discovering visual attributes by carving deep neural nets , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[304]  Daniel Göhring,et al.  Online vehicle detection using deep neural networks and lidar based preselected image patches , 2016, 2016 IEEE Intelligent Vehicles Symposium (IV).

[305]  I. Walker,et al.  Drivers' gaze fixations during judgements about a bicyclist's intentions , 2007 .

[306]  John K. Tsotsos,et al.  Active object recognition , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[307]  Hans Rudi Fischer,et al.  Abductive Reasoning as a Way of Worldmaking , 2001 .

[308]  Xiaogang Wang,et al.  Deep Learning Strong Parts for Pedestrian Detection , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[309]  Larry S. Davis,et al.  Probabilistic template based pedestrian detection in infrared videos , 2002, Intelligent Vehicle Symposium, 2002. IEEE.

[310]  L. Carlson,et al.  Spatial Reasoning , 2010 .

[311]  C. J. Radford,et al.  Vehicle detection in open-world scenes using a Hough transform technique , 1989 .

[312]  I. Hyman,et al.  Did you see the unicycling clown? Inattentional blindness while walking and talking on a cell phone , 2009 .

[313]  Mohan M. Trivedi,et al.  Attention estimation by simultaneous analysis of viewer and view , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[314]  Shumeet Baluja,et al.  Evolution of an artificial neural network based autonomous land vehicle controller , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[315]  Klaus Dietmayer,et al.  Pedestrian recognition in urban traffic using a vehicle based multilayer laserscanner , 2002, Intelligent Vehicle Symposium, 2002. IEEE.

[316]  Ernest Davis,et al.  Physical Reasoning , 2008, Handbook of Knowledge Representation.

[317]  Andrea Palazzi,et al.  DR(eye)VE: A Dataset for Attention-Based Tasks with Applications to Autonomous and Assisted Driving , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[318]  András Kovács,et al.  Enhancements of V2X communication in support of cooperative autonomous driving , 2015, IEEE Communications Magazine.

[319]  Fabian Kröger,et al.  Automated Driving in Its Social, Historical and Cultural Contexts , 2016 .

[320]  Ankur Agarwal,et al.  3D human pose from silhouettes by relevance vector regression , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[321]  Michael Goldhammer,et al.  Early prediction of a pedestrian's trajectory at intersections , 2013, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013).

[322]  Johannes Stallkamp,et al.  The German Traffic Sign Recognition Benchmark: A multi-class classification competition , 2011, The 2011 International Joint Conference on Neural Networks.

[323]  Dariu Gavrila,et al.  Real-time object detection for "smart" vehicles , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[324]  Manfred Tscheligi,et al.  Three Strategies for Autonomous Car-to-Pedestrian Communication: A Survival Guide , 2017, HRI.

[325]  Jana Novovičová,et al.  Road Sign Classification without Color Information , 2000 .

[326]  N. van Nes,et al.  Understanding safety critical interactions between bicycles and motor vehicles in Europe by means of Naturalistic Driving techniques , 2012 .

[327]  Anupam Agrawal,et al.  A survey on activity recognition and behavior understanding in video surveillance , 2012, The Visual Computer.

[328]  Allen Allport,et al.  Visual attention , 1989 .

[329]  Yoshinori Kuno,et al.  Active eye contact for human-robot communication , 2004, CHI EA '04.

[330]  G Underwood,et al.  Visual attention and the transition from novice to advanced driver , 2007, Ergonomics.

[331]  A. Tom,et al.  Gender differences in pedestrian rule compliance and visual search at signalized and unsignalized crossroads. , 2011, Accident; analysis and prevention.

[332]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[333]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[334]  Francisco Madrigal,et al.  Intention-Aware Multiple Pedestrian Tracking , 2014, 2014 22nd International Conference on Pattern Recognition.

[335]  Jitendra Malik,et al.  Learning Rich Features from RGB-D Images for Object Detection and Segmentation , 2014, ECCV.

[336]  Tom Michael Gasser Fundamental and Special Legal Questions for Autonomous Vehicles , 2016 .

[337]  Tsukasa Ogasawara,et al.  A hand-pose estimation for vision-based human interfaces , 2003, IEEE Trans. Ind. Electron..

[338]  Alan Hartley,et al.  Joint Attention is Slowed in Older Adults , 2016, Experimental aging research.

[339]  Arturo de la Escalera,et al.  Traffic sign recognition and analysis for intelligent vehicles , 2003, Image Vis. Comput..

[340]  George Yannis,et al.  A critical assessment of pedestrian behaviour models , 2009 .

[341]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[342]  Bernt Schiele,et al.  2D Human Pose Estimation: New Benchmark and State of the Art Analysis , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[343]  Bingbing Ni,et al.  HCP: A Flexible CNN Framework for Multi-Label Image Classification , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[344]  Mike McDonald,et al.  Study of pedestrians' gap acceptance behavior when they jaywalk outside crossing facilities , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[345]  S. Nowicki,et al.  Individual differences in the nonverbal communication of affect: The diagnostic analysis of nonverbal accuracy scale , 1994 .

[346]  Nicolas Guéguen,et al.  A pedestrian’s stare and drivers’ stopping behavior: A field experiment at the pedestrian crossing , 2015 .

[347]  Per Holth,et al.  An Operant Analysis of Joint Attention Skills. , 2005 .

[348]  G. Baird,et al.  Testing joint attention, imitation, and play as infancy precursors to language and theory of mind , 2000 .

[349]  Liang Lin,et al.  Is Faster R-CNN Doing Well for Pedestrian Detection? , 2016, ECCV.

[350]  Sanja Fidler,et al.  Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[351]  Wenyu Liu,et al.  Traffic sign detection and recognition using fully convolutional network guided proposals , 2016, Neurocomputing.

[352]  Alex Pentland,et al.  Invariant features for 3-D gesture recognition , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[353]  Andrea Palazzi,et al.  Learning where to attend like a human driver , 2016, 2017 IEEE Intelligent Vehicles Symposium (IV).

[354]  Dean A. Pomerleau,et al.  Combining artificial neural networks and symbolic processing for autonomous robot guidance , 1991 .

[355]  M. Tomasello,et al.  Joint attention and lexical acquisition style , 1983 .

[356]  Marco Dozza,et al.  Introducing naturalistic cycling data: What factors influence bicyclists’ safety in the real world? , 2014 .

[357]  Patrick Heinemann,et al.  Context-based detection of pedestrian crossing intention for autonomous driving in urban environments , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[358]  Panos D. Bardis Pi Gamma Mu , International Honor Society in Social Sciences Social Interaction and Social Processes , 2013 .

[359]  Anders Lindgren,et al.  Requirements for the Design of Advanced Driver Assistance Systems - The Differences between Swedish and Chinese Drivers , 2008 .

[360]  Mohan M. Trivedi,et al.  Vehicle Detection by Independent Parts for Urban Driver Assistance , 2013, IEEE Transactions on Intelligent Transportation Systems.

[361]  Ying Wang,et al.  Human Activity Recognition Based on R Transform , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[362]  L. A. Zadeh,et al.  Fuzzy logic and approximate reasoning , 1975, Synthese.

[363]  Dariu Gavrila,et al.  Multi-cue pedestrian classification with partial occlusion handling , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[364]  Andrew Zisserman,et al.  Progressive search space reduction for human pose estimation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[365]  T. Kanade,et al.  Toward autonomous driving: the CMU Navlab. I. Perception , 1991, IEEE Expert.

[366]  M. Tomasello,et al.  Shared intentionality. , 2007, Developmental science.

[367]  David González,et al.  A Review of Motion Planning Techniques for Automated Vehicles , 2016, IEEE Transactions on Intelligent Transportation Systems.

[368]  Julius Ziegler,et al.  Making Bertha Drive—An Autonomous Journey on a Historic Route , 2014, IEEE Intelligent Transportation Systems Magazine.

[369]  Dariu Gavrila,et al.  An Experimental Study on Pedestrian Classification , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[370]  Lluís Vila,et al.  A Survey on Temporal Reasoning in Artificial Intelligence , 1994, AI Communications.

[371]  Ulrich Brunsmann,et al.  Autonomous evasive maneuvers triggered by infrastructure-based detection of pedestrian intentions , 2013, 2013 IEEE Intelligent Vehicles Symposium (IV).

[372]  C. V. Jawahar,et al.  Has My Algorithm Succeeded? An Evaluator for Human Pose Estimators , 2012, ECCV.

[373]  Mohan M. Trivedi,et al.  Trajectory analysis and prediction for improved pedestrian safety: Integrated framework and evaluations , 2015, 2015 IEEE Intelligent Vehicles Symposium (IV).

[374]  Marilyn N. Abrams,et al.  An Intelligent World Model for Autonomous Off-Road Driving , 2001 .

[375]  Sebastian Thrun,et al.  Self-supervised Monocular Road Detection in Desert Terrain , 2006, Robotics: Science and Systems.

[376]  John M Sullivan,et al.  Differences in geometry of pedestrian crashes in daylight and darkness. , 2011, Journal of safety research.

[377]  John K. Tsotsos,et al.  Understanding Pedestrian Behavior in Complex Traffic Scenes , 2018, IEEE Transactions on Intelligent Vehicles.

[378]  Bin Yang,et al.  Convolutional Channel Features , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[379]  Aude Billard,et al.  Experiments in Learning by Imitation - Grounding and Use of Communication in Robotic Agents , 1999, Adapt. Behav..

[380]  Minoru Asada,et al.  Developmental learning model for joint attention , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[381]  David Gerónimo Gómez,et al.  Survey of Pedestrian Detection for Advanced Driver Assistance Systems , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[382]  Jonathan Krause,et al.  3D Object Representations for Fine-Grained Categorization , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[383]  Fan Yang,et al.  Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[384]  Klaus C. J. Dietmayer,et al.  Early detection of the Pedestrian's intention to cross the street , 2012, 2012 15th International IEEE Conference on Intelligent Transportation Systems.

[385]  Nicholas Pennycooke AEVITA : designing biomimetic vehicle-to-pedestrian communication protocols for autonomously operating & parking on-road electric vehicles , 2012 .

[386]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[387]  Yoshua Bengio,et al.  Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[388]  David Hsu,et al.  Intention-aware online POMDP planning for autonomous driving in a crowd , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[389]  Luc Van Gool,et al.  Object Classification with Adaptable Regions , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[390]  K. Cave The FeatureGate model of visual selection , 1999, Psychological research.

[391]  Ernst D. Dickmanns,et al.  Distributed Scene Analysis For Autonomous Road Vehicle Guidance , 1987, Other Conferences.

[392]  Ivan Laptev,et al.  ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization , 2016, ECCV.

[393]  Xiaoou Tang,et al.  Facial Landmark Detection by Deep Multi-task Learning , 2014, ECCV.

[394]  Dariu Gavrila,et al.  Monocular Pedestrian Detection: Survey and Experiments , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[395]  Tao Deng,et al.  Where Does the Driver Look? Top-Down-Based Saliency Detection in a Traffic Driving Environment , 2016, IEEE Transactions on Intelligent Transportation Systems.

[396]  David W. Payton,et al.  Planning and reasoning for autonomous vehicle control , 1987 .

[397]  Feng Jiang,et al.  Pilot study on pedestrian step frequency in naturalistic driving environment , 2013, 2013 IEEE Intelligent Vehicles Symposium (IV).

[398]  Ben Taskar,et al.  Parsing human motion with stretchable models , 2011, CVPR 2011.

[399]  Mohan M. Trivedi,et al.  Attention estimation by simultaneous observation of viewer and view , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[400]  Yehezkel Lamdan,et al.  Object recognition by affine invariant matching , 2011, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition.

[401]  Ali Farhadi,et al.  Actions ~ Transformations , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[402]  Ronald Poppe,et al.  A survey on vision-based human action recognition , 2010, Image Vis. Comput..

[403]  Marek P. Michalowski,et al.  Keepon , 2009, Int. J. Soc. Robotics.

[404]  Ronen Basri,et al.  Actions as Space-Time Shapes , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[405]  Nasser Kehtarnavaz,et al.  Traffic sign recognition in noisy outdoor scenes , 1995, Proceedings of the Intelligent Vehicles '95. Symposium.

[406]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[407]  Kenneth D. Forbus,et al.  Solving Everyday Physical Reasoning Problems by Analogy Using Sketches , 2005, AAAI.

[408]  Brian Scassellati Knowing What to Imitate and Knowing When You Succeed , 2007 .

[409]  Ashweeni Beeharee,et al.  Is Joint Attention Detectable at a Distance? Three Automated, Internet-Based Tests. , 2016, Explore.

[410]  Todd Litman,et al.  Autonomous Vehicle Implementation Predictions: Implications for Transport Planning , 2015 .

[411]  Rémi Ronfard,et al.  A survey of vision-based methods for action representation, segmentation and recognition , 2011, Comput. Vis. Image Underst..

[412]  Berthold Färber,et al.  Communication and Communication Problems Between Autonomous Vehicles and Human Drivers , 2016 .

[413]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[414]  Fei-Fei Li,et al.  Towards Viewpoint Invariant 3D Human Pose Estimation , 2016, ECCV.

[415]  Nikolaos Papanikolopoulos,et al.  Vision-based intelligent control of transportation systems , 1995, Proceedings of Tenth International Symposium on Intelligent Control.

[416]  Tomoya Ishikawa,et al.  A method of pedestrian dead reckoning using action recognition , 2010, IEEE/ION Position, Location and Navigation Symposium.

[417]  Tomaso A. Poggio,et al.  A Trainable System for Object Detection , 2000, International Journal of Computer Vision.

[418]  A Várhelyi,et al.  Drivers' speed behaviour at a zebra crossing: a case study. , 1998, Accident; analysis and prevention.

[419]  Nicholas J. Ward,et al.  OF VISION ENHANCEMENT SYSTEMS TO IMPROVE DRIVER SAFETY , 2016 .

[420]  Du Tran,et al.  Human Activity Recognition with Metric Learning , 2008, ECCV.

[421]  Thomas A. Dingus,et al.  Driver Inattention: A Contributing Factor to Crashes and Near-Crashes , 2005 .

[422]  Yvonne Barnard,et al.  UDRIVE: the European naturalistic driving study , 2014 .

[423]  Wei Chen,et al.  A research on automatic human fall detection method based on wearable inertial force information acquisition system , 2009, 2009 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[424]  Pinar Duygulu Sahin,et al.  Recognizing actions from still images , 2008, 2008 19th International Conference on Pattern Recognition.

[425]  Matthew W. Crocker,et al.  Visual attention in spoken human-robot interaction , 2009, 2009 4th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[426]  Ian D. Reid,et al.  High Five: Recognising human interactions in TV shows , 2010, BMVC.

[427]  Luis Moreno,et al.  Road traffic sign detection and classification , 1997, IEEE Trans. Ind. Electron..

[428]  D. Yagil Beliefs, motives and situational factors related to pedestrians' self-reported behavior at signal-controlled crossings , 2000 .

[429]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[430]  B. Scassellati Imitation and mechanisms of joint attention: a developmental structure for building social skills on a humanoid robot , 1999 .

[431]  C.J. Tomlin,et al.  Autonomous Automobile Trajectory Tracking for Off-Road Driving: Controller Design, Experimental Validation and Racing , 2007, 2007 American Control Conference.

[432]  Rüdiger Dillmann,et al.  Probabilistic decision-making under uncertainty for autonomous driving using continuous POMDPs , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[433]  Thomas B. Moeslund,et al.  Vision-Based Traffic Sign Detection and Analysis for Intelligent Driver Assistance Systems: Perspectives and Survey , 2012, IEEE Transactions on Intelligent Transportation Systems.

[434]  Tae-Kyun Kim,et al.  Tensor Canonical Correlation Analysis for Action Classification , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[435]  Antonio Fernández-Caballero,et al.  A survey of video datasets for human action and activity recognition , 2013, Comput. Vis. Image Underst..

[436]  James J. Little,et al.  Real-Time Human Motion Capture with Multiple Depth Cameras , 2016, 2016 13th Conference on Computer and Robot Vision (CRV).

[437]  M. Argyle,et al.  EYE-CONTACT, DISTANCE AND AFFILIATION. , 1965, Sociometry.

[438]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[439]  Mohamed Aly,et al.  Real time detection of lane markers in urban streets , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[440]  C. Michaels,et al.  To Cross or Not to Cross: The Effect of Locomotion on Street-Crossing Behavior , 1996 .

[441]  Ben Taskar,et al.  MODEC: Multimodal Decomposable Models for Human Pose Estimation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[442]  Mehrtash Tafazzoli Harandi,et al.  Going deeper into action recognition: A survey , 2016, Image Vis. Comput..

[443]  Richard Wener,et al.  Mobile telephones, distracted attention, and pedestrian safety. , 2008, Accident; analysis and prevention.

[444]  G. Butterworth,et al.  Towards a Mechanism of Joint Visual Attention in Human Infancy , 1980 .

[445]  Thomas A. Dingus,et al.  An overview of the 100-car naturalistic study and findings , 2005 .

[446]  Jake K. Aggarwal,et al.  Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[447]  Geoffrey P. Goodwin,et al.  Logic, probability, and human reasoning , 2015, Trends in Cognitive Sciences.

[448]  Xiaogang Wang,et al.  A discriminative deep model for pedestrian detection with occlusion handling , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[449]  Yimin Zhang,et al.  Radar Signal Processing for Elderly Fall Detection: The future for in-home monitoring , 2016, IEEE Signal Processing Magazine.

[450]  Bernt Schiele,et al.  Multi-cue onboard pedestrian detection , 2009, CVPR.

[451]  Sebastian Thrun,et al.  Autonomous driving in semi-structured environments: Mapping and planning , 2009, 2009 IEEE International Conference on Robotics and Automation.

[452]  Mark H. Johnson,et al.  The eye contact effect: mechanisms and development , 2009, Trends in Cognitive Sciences.

[453]  Thomas Deselaers,et al.  Weakly Supervised Localization and Learning with Generic Knowledge , 2012, International Journal of Computer Vision.

[454]  Xiaogang Wang,et al.  Multi-source Deep Learning for Human Pose Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[455]  Koray Kavukcuoglu,et al.  Visual Attention , 2020, Computational Models for Cognitive Vision.

[456]  Lutz Priese,et al.  Traffic Sign Recognition Based On Color Image Evaluationion , 1993, Proceedings of the Intelligent Vehicles '93 Symposium.

[457]  Ivan Laptev,et al.  Weakly supervised object recognition with convolutional neural networks , 2014 .

[458]  Bilge Mutlu,et al.  Coordination Mechanisms in Human-Robot Collaboration , 2013 .

[459]  Roberto Cipolla,et al.  Segmentation and Recognition Using Structure from Motion Point Clouds , 2008, ECCV.

[460]  Anthony G. Cohn,et al.  Qualitative Spatial Representation and Reasoning Techniques , 1997, KI.

[461]  Minho Lee,et al.  Human-Robot Interaction using Intention Recognition , 2015, HAI.

[462]  Massimo Bertozzi,et al.  Shape-based pedestrian detection , 2000, Proceedings of the IEEE Intelligent Vehicles Symposium 2000 (Cat. No.00TH8511).

[463]  Joachim Hertzberg,et al.  AI Reasoning Methods for Robotics , 2008, Springer Handbook of Robotics, 2nd Ed..

[464]  M. Reed,et al.  Intersection kinematics: A pilot study of driver turning behavior with application to pedestrian obscuration by A-pillars , 2008 .

[465]  Bernt Schiele,et al.  CityPersons: A Diverse Dataset for Pedestrian Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[466]  Francisco José Madrid-Cuevas,et al.  Stereo Pictorial Structure for 2D articulated human pose estimation , 2015, Machine Vision and Applications.

[467]  Hans P. Moravec,et al.  The Stanford Cart and the CMU Rover , 1983, Proceedings of the IEEE.

[468]  Alberto Broggi,et al.  The ARGO Autonomous Vehicle , 2007 .

[469]  Lynn Hasher,et al.  Cultural differences in visual attention: Implications for distraction processing , 2017, British journal of psychology.

[470]  Andrew Zisserman,et al.  Domain Adaptation for Upper Body Pose Tracking in Signed TV Broadcasts , 2013, BMVC.

[471]  P. Hancock,et al.  The Perception of Arrival Time for Different Oncoming Vehicles at an Intersection , 1994 .

[472]  Shaogang Gong,et al.  Composite support vector machines for detection of faces across views and pose estimation , 2002, Image Vis. Comput..

[473]  Ankit Laddha,et al.  Map-supervised road detection , 2016, 2016 IEEE Intelligent Vehicles Symposium (IV).

[474]  W. Ahearn,et al.  Toward a behavioral analysis of joint attention , 2004, The Behavior analyst.

[475]  David Crundall,et al.  Driving experience and the acquisition of visual information , 1999 .

[476]  Sergio Gomez Colmenarejo,et al.  Hybrid computing using a neural network with dynamic external memory , 2016, Nature.

[477]  Mohan Manubhai Trivedi,et al.  When Vehicles See Pedestrians With Phones: A Multicue Framework for Recognizing Phone-Based Activities of Pedestrians , 2018, IEEE Transactions on Intelligent Vehicles.

[478]  H. Bekkering,et al.  Joint action: bodies and minds moving together , 2006, Trends in Cognitive Sciences.

[479]  Takashi Naito,et al.  Pedestrian recognition using high-definition LIDAR , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[480]  Jian Dong,et al.  Contextualizing Object Detection and Classification , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[481]  Plamen Angelov,et al.  A Comprehensive Review on Handcrafted and Learning-Based Action Representation Approaches for Human Activity Recognition , 2017 .

[482]  Robert Bergevin,et al.  Semantic human activity recognition: A literature review , 2015, Pattern Recognit..

[483]  Jitendra Malik,et al.  Recognizing action at a distance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[484]  Ramesh Raskar,et al.  Learning Gaze Transitions from Depth to Improve Video Saliency Estimation , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[485]  Hideki Kozima,et al.  An epigenetic approach to human-robot communication , 2000, Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499).

[486]  Massimo Bertozzi,et al.  A real-time oriented system for vehicle detection , 1997, J. Syst. Archit..

[487]  Liujuan Cao,et al.  Robust vehicle detection by combining deep features with exemplar classification , 2016, Neurocomputing.

[488]  Yoshiaki Shirai,et al.  Object Recognition Using Three-Dimensional Information , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[489]  Cristian Sminchisescu,et al.  Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[490]  Chunhua Shen,et al.  Pushing the Limits of Deep CNNs for Pedestrian Detection , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[491]  Thierry Fraichard,et al.  Fuzzy control to drive car-like vehicles , 2001, Robotics Auton. Syst..

[492]  Matthew J. Hausknecht,et al.  Beyond short snippets: Deep networks for video classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[493]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[494]  Bernt Schiele,et al.  Ten Years of Pedestrian Detection, What Have We Learned? , 2014, ECCV Workshops.

[495]  Keita Higuchi,et al.  Discovering Objects of Joint Attention via First-Person Sensing , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[496]  Mohan M. Trivedi,et al.  Looking at Humans in the Age of Self-Driving and Highly Automated Vehicles , 2016, IEEE Transactions on Intelligent Vehicles.

[497]  Xiaoou Tang,et al.  Pedestrian Attribute Recognition At Far Distance , 2014, ACM Multimedia.

[498]  M. Golparvar-Fard,et al.  Multi-class Traffic Sign Detection and Classification Using Google Street View Images , 2015 .

[499]  Wei Wang,et al.  Understanding and Modeling of WiFi Signal Based Human Activity Recognition , 2015, MobiCom.

[500]  Sven J. Dickinson,et al.  Active Object Recognition Integrating Attention and Viewpoint Control , 1994, Comput. Vis. Image Underst..

[501]  Tomaso Poggio,et al.  A Trainable Object Detection System: Car Detection in Static Images , 1999 .

[502]  Ennio Gambi,et al.  A Depth-Based Fall Detection System Using a Kinect® Sensor , 2014, Sensors.

[503]  Mohan M. Trivedi,et al.  Head, Eye, and Hand Patterns for Driver Activity Recognition , 2014, 2014 22nd International Conference on Pattern Recognition.

[504]  Andrew Zisserman,et al.  Convolutional Two-Stream Network Fusion for Video Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[505]  C. Kleinke,et al.  Compliance to requests made by gazing and touching experimenters in field settings. , 1977 .

[506]  John K. Tsotsos,et al.  Visual Saliency Improves Autonomous Visual Search , 2014, 2014 Canadian Conference on Computer and Robot Vision.

[507]  Chonhyon Park,et al.  HI Robot: Human intention-aware robot planning for safe and efficient navigation in crowds , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[508]  J. Crowley,et al.  CAVIAR Context Aware Vision using Image-based Active Recognition , 2005 .

[509]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[510]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[511]  Mohan M. Trivedi,et al.  Looking at Vehicles on the Road: A Survey of Vision-Based Vehicle Detection, Tracking, and Behavior Analysis , 2013, IEEE Transactions on Intelligent Transportation Systems.

[512]  Robert B. Noland,et al.  Behavioural Issues in Pedestrian Speed Choice and Street Crossing Behaviour: A Review , 2008 .

[513]  Shang-Hong Lai,et al.  Fusing generic objectness and visual saliency for salient object detection , 2011, 2011 International Conference on Computer Vision.

[514]  Kerstin Dautenhahn,et al.  Issues of Robot-Human Interaction Dynamics in the Rehabilitation of Children with Autism , 2000 .

[515]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[516]  Ralf Philipsen,et al.  From V2X to Control2Trust - Why Trust and Control Are Major Attributes in Vehicle2X Technologies , 2015, HCI.

[517]  Ji Wan,et al.  Multi-view 3D Object Detection Network for Autonomous Driving , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[518]  Daniel C. Dennett,et al.  Brainstorms: Philosophical Essays on Mind and Psychology , 1981 .

[519]  Tae-Kyun Kim,et al.  Unconstrained Monocular 3D Human Pose Estimation by Action Detection and Cross-Modality Regression Forest , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[520]  A. Ito,et al.  Why robots need body for mind communication - an attempt of eye-contact between human and robot , 2004, RO-MAN 2004. 13th IEEE International Workshop on Robot and Human Interactive Communication (IEEE Catalog No.04TH8759).

[521]  Wendy Ju,et al.  Ghost driver: A field study investigating the interaction between pedestrians and driverless vehicles , 2016, 2016 25th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN).

[522]  Ling Shao,et al.  Video Salient Object Detection via Fully Convolutional Networks , 2017, IEEE Transactions on Image Processing.

[523]  R. Buck,et al.  Verbal and Nonverbal Communication: Distinguishing Symbolic, Spontaneous, and Pseudo-Spontaneous Nonverbal Behavior , 2002 .

[524]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[525]  Sinisa Segvic,et al.  A computer vision assisted geoinformation inventory for traffic infrastructure , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[526]  Raúl Quintero,et al.  Pedestrian path prediction based on body language and action classification , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[527]  Paul E. Rybski,et al.  A Multisensor Multiobject Tracking System for an Autonomous Vehicle Driving in an Urban Environment , 2008 .

[528]  E. Goffman The Presentation of Self in Everyday Life , 1959 .

[529]  John K. Tsotsos,et al.  Attention based on information maximization , 2010 .

[530]  Gueesang Lee,et al.  Fall Detection Based on Movement and Smart Phone Technology , 2012, 2012 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future.

[531]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[532]  Albert Ali Salah,et al.  Joint Attention by Gaze Interpolation and Saliency , 2013, IEEE Transactions on Cybernetics.

[533]  Gaurav Sharma,et al.  AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[534]  N. Chater,et al.  The probabilistic approach to human reasoning , 2001, Trends in Cognitive Sciences.

[535]  Sarah Schmidt,et al.  Pedestrians at the kerb – Recognising the action intentions of humans , 2009 .

[536]  Ennio Gambi,et al.  Proposal and Experimental Evaluation of Fall Detection Solution Based on Wearable and Depth Data Fusion , 2015, ICT Innovations.

[537]  John K. Tsotsos,et al.  Integrating Three Mechanisms of Visual Attention for Active Visual Search , 2017, ArXiv.

[538]  Ron Kimmel,et al.  Rule of thumb: Deep derotation for improved fingertip detection , 2015, BMVC.

[539]  Mubarak Shah,et al.  UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.

[540]  Hossein Azizpour,et al.  Multi-view Body Part Recognition with Random Forests , 2013, BMVC.

[541]  Rémi Ronfard,et al.  Free viewpoint action recognition using motion history volumes , 2006, Comput. Vis. Image Underst..

[542]  Rüdiger Dillmann,et al.  Using case-based reasoning for autonomous vehicle guidance , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[543]  William Whittaker,et al.  Autonomous driving in urban environments: Boss and the Urban Challenge , 2008, J. Field Robotics.

[544]  Matt Richtel,et al.  Google’s Driverless Cars Run into Problem: Cars with Drivers , 2015 .

[545]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[546]  Farhi Marir,et al.  Case-based reasoning: A review , 1994, The Knowledge Engineering Review.

[547]  Ernst D. Dickmanns,et al.  An integrated spatio-temporal approach to automatic visual guidance of autonomous vehicles , 1990, IEEE Trans. Syst. Man Cybern..

[548]  Rainer Stiefelhagen,et al.  “Look at this!” learning to guide visual saliency in human-robot interaction , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[549]  R Risser,et al.  Behavior in traffic conflict situations. , 1985, Accident; analysis and prevention.

[550]  Pavlo Molchanov,et al.  Multi-sensor system for driver's hand-gesture recognition , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[551]  Don H. Zimmerman,et al.  Joint attention as action , 2007 .

[552]  Ralf Risser,et al.  Pedestrian-driver communication and decision strategies at marked crossings. , 2017, Accident; analysis and prevention.

[553]  W. Andrew Harrell,et al.  Factors influencing pedestrian cautiousness in crossing streets , 1991 .

[554]  Gwenn Englebienne,et al.  Accurate activity recognition in a home setting , 2008, UbiComp.

[555]  Mark Everingham,et al.  Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation , 2010, BMVC.

[556]  H. Simon,et al.  Theories of Decision-Making in Economics and Behavioural Science , 1966 .

[557]  James J. Clark,et al.  Attentional Push: Augmenting Salience with Shared Attention Modeling , 2016, ArXiv.

[558]  Lars Åberg,et al.  Driver Behaviour in Intersections: Formal and Informal Traffic Rules , 2005 .

[559]  Lei Chen,et al.  Deep Structured Models For Group Activity Recognition , 2015, BMVC.

[560]  Andrew Zisserman,et al.  Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.

[561]  Yang Wang,et al.  Learning hierarchical poselets for human parsing , 2011, CVPR 2011.

[562]  Shraga Shoval,et al.  Micro-Simulation Model for Assessing the Risk of Vehicle–Pedestrian Road Accidents , 2015, J. Intell. Transp. Syst..

[563]  Tomaso,et al.  A Trainable System for People DetectionMichael , 1997 .

[564]  Antonio Torralba,et al.  Context-based vision system for place and object recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[565]  Xiaogang Wang,et al.  Pedestrian detection aided by deep learning semantic tasks , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[566]  Karl C. Kluge,et al.  Extracting road curvature and orientation from image edge points without perceptual grouping into features , 1994, Proceedings of the Intelligent Vehicles '94 Symposium.

[567]  Heinrich H. Bülthoff,et al.  Going into depth: Evaluating 2D and 3D cues for object classification on a new, large-scale object dataset , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[568]  John K. Tsotsos,et al.  Joint Attention in Autonomous Driving (JAAD) , 2016, ArXiv.

[569]  Despina Stavrinos,et al.  Distraction and pedestrian safety: how talking on the phone, texting, and listening to music impact crossing the street. , 2012, Accident; analysis and prevention.

[570]  Andrew Zisserman,et al.  Flowing ConvNets for Human Pose Estimation in Videos , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[571]  Gary M. Weiss,et al.  Activity recognition using cell phone accelerometers , 2011, SKDD.

[572]  Ingo Wolf The Interaction Between Humans and Autonomous Agents , 2016 .

[573]  Hartmut König,et al.  Location-independent fall detection with smartphone , 2013, PETRA '13.

[574]  Sílvio Filipe,et al.  RETRACTED ARTICLE: From the human visual system to the computational models of visual attention: a survey , 2015, Artificial Intelligence Review.

[575]  Thomas Winkle,et al.  Safety Benefits of Automated Vehicles: Extended Findings from Accident Research for Development, Validation and Testing , 2016 .

[576]  D. Dolgov Practical Search Techniques in Path Planning for Autonomous Driving , 2008 .

[577]  Katerina Fragkiadaki,et al.  Two-Granularity Tracking: Mediating Trajectory and Detection Graphs for Tracking under Occlusions , 2012, ECCV.

[578]  Hema Swetha Koppula,et al.  Learning human activities and object affordances from RGB-D videos , 2012, Int. J. Robotics Res..

[579]  Xiaoming Liu,et al.  Illuminating Pedestrians via Simultaneous Detection and Segmentation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[580]  Ali Borji,et al.  Quantitative Analysis of Human-Model Agreement in Visual Saliency Modeling: A Comparative Study , 2013, IEEE Transactions on Image Processing.

[581]  Namil Kim,et al.  Multispectral pedestrian detection: Benchmark dataset and baseline , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[582]  J. Bruner,et al.  The capacity for joint visual attention in the infant , 1975, Nature.

[583]  John K. Tsotsos,et al.  On computational modeling of visual saliency: Examining what’s right, and what’s left , 2015, Vision Research.

[584]  Martial Hebert,et al.  Event Detection in Crowded Videos , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[585]  Fumio Kishino,et al.  Human posture estimation from multiple images using genetic algorithm , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[586]  Dean A. Pomerleau,et al.  Progress in neural network-based vision for autonomous robot driving , 1992, Proceedings of the Intelligent Vehicles `92 Symposium.

[587]  Tetsuo Ono,et al.  Robovie: an interactive humanoid robot , 2001 .

[588]  Andreas Geiger,et al.  Joint 3D Estimation of Objects and Scene Layout , 2011, NIPS.

[589]  Dieter Fox,et al.  A large-scale hierarchical multi-view RGB-D object dataset , 2011, 2011 IEEE International Conference on Robotics and Automation.

[590]  Gang Song,et al.  Object Detection Combining Recognition and Segmentation , 2007, ACCV.

[591]  Linda Ng Boyle,et al.  The Interaction of Cognitive Load and Attention-Directing Cues in Driving , 2009, Hum. Factors.

[592]  Ivan Laptev,et al.  Recognizing human actions in still images: a study of bag-of-features and part-based representations , 2010, BMVC.

[593]  Andrew Y. Ng,et al.  Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .

[594]  Maria Botero,et al.  Tactless scientists: Ignoring touch in the study of joint attention , 2016 .

[595]  Takayuki Kanda,et al.  Interactive Humanoid Robots for a Science Museum , 2007, IEEE Intell. Syst..

[596]  Robert B. Fisher,et al.  The BEHAVE video dataset: ground truthed video for multi-person behavior classification , 2010 .