Building Smart Transportation Hubs with 3D Vision and Video Technologies to Improve Servicesto People with Disabilities

1 Large transportation hubs are difficult to navigate, especially for people with disabilities such as those 2 with visual or mobility impairment, Autism Spectrum Disorder (ASD), or simply those with navigation 3 challenges. The primary objective of this research is to design and develop a novel cyber-physical 4 infrastructure that can effectively and efficiently transform existing transportation hubs into smart 5 facilities capable of providing better location-aware services (e.g. finding terminals, improving travel 6 experience, obtaining security alerts). We investigated the integration of a number of novel Internet of 7 Things elements, including video analytics, low-cost Bluetooth beacons, mobile computing, and LiDAR8 scanned 3D semantic models, to provide reliable indoor navigation services to people with traveling 9 challenges, yet requiring minimum infrastructure changes since our approach leverages existing 10 cyberinfrastructures such as surveillance cameras, facility models, and mobile phones, and incorporates a 11 minimum number of new and small devices such as beacons to achieve reliable navigation services. We 12 choose two groups of people for our initial study– those with visual impairment and ASD since both 13 groups face difficulties in a crowded and complex 3D environment. Thus two unique features of our 14 solution are the use of 3D digital semantic models and crowd analysis with surveillance cameras for 15 providing the best available paths. We have started a pilot test with people with disabilities at a multi16 floor building in New York City to demonstrate the effectiveness of our proposed framework. 17 18 19 20 Glossary of Terms: 21 ASD: Autism Spectrum Disorder 22 BLE: Bluetooth Low Energy 23 BoW: Bag of Words 24 BVI: Blind and Visual Impairment 25 CNN: Convolutional Neural Network 26 ConvNet: Convolutional Neural Network 27 DCT: Discrete Cosine Transform 28 GIST: a low dimensional representation of the scene, which does not require any form of segmentation 29 IoT: Internet of things 30 Lidar: Light Detection and Ranging 31 SfM: Structure from Motion 32 33 34 35 36

[1]  Radu Bogdan Rusu,et al.  Semantic 3D Object Maps for Everyday Manipulation in Human Living Environments , 2010, KI - Künstliche Intelligenz.

[2]  Zhigang Zhu,et al.  Mobile Panoramic Vision for Assisting the Blind via Indexing and Localization , 2014, ECCV Workshops.

[3]  Giovanni Fusco,et al.  Towards a Sign-Based Indoor Navigation System for People with Visual Impairments , 2016, ASSETS.

[4]  Xiaogang Wang,et al.  Cross-scene crowd counting via deep convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Andrew Zisserman,et al.  Learning To Count Objects in Images , 2010, NIPS.

[6]  Lakshminarayanan Subramanian,et al.  Mobile Accessibility Tools for the Visually Impaired , 2012 .

[7]  Roberto Manduchi,et al.  Mobile Vision as Assistive Technology for the Blind: An Experimental Study , 2012, ICCHP.

[8]  Torsten Sattler,et al.  Hyperpoints and Fine Vocabularies for Large-Scale Location Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[9]  Roberto Manduchi,et al.  The last meter: blind visual guidance to a target , 2014, CHI.

[10]  Serge J. Belongie,et al.  Recognizing locations with Google Glass: A case study , 2014, IEEE Winter Conference on Applications of Computer Vision.

[11]  Hanghang Tong,et al.  Mobile Crowd Assisted Navigation for the Visually Impaired , 2015, 2015 IEEE 12th Intl Conf on Ubiquitous Intelligence and Computing and 2015 IEEE 12th Intl Conf on Autonomic and Trusted Computing and 2015 IEEE 15th Intl Conf on Scalable Computing and Communications and Its Associated Workshops (UIC-ATC-ScalCom).

[12]  Andrew G. Dempster,et al.  How feasible is the use of magnetic field alone for indoor positioning? , 2012, 2012 International Conference on Indoor Positioning and Indoor Navigation (IPIN).

[13]  James M. Coughlan,et al.  Smartphone-based crosswalk detection and localization for visually impaired pedestrians , 2013, 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[14]  Aaron Steinfeld,et al.  Incorporating information from trusted sources to enhance urban navigation for blind travelers , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[16]  Shaogang Gong,et al.  Feature Mining for Localised Crowd Counting , 2012, BMVC.

[17]  Giovanni Maria Farinella,et al.  Representing scenes for real-time context classification on mobile devices , 2015, Pattern Recognit..

[18]  Hironobu Takagi,et al.  NavCog: a navigational cognitive assistant for the blind , 2016, MobileHCI.

[19]  Harold Willmington,et al.  What You Need to Know About You , 2010 .

[20]  Niko Sünderhauf,et al.  On the performance of ConvNet features for place recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[21]  Sheikh Tahir Bakhsh,et al.  Indoor positioning in Bluetooth networks using fingerprinting and lateration approach , 2011, 2011 International Conference on Information Science and Applications.

[22]  Gérard Lachapelle,et al.  Indoor Positioning System Using Accelerometry and High Accuracy Heading Sensors , 2003 .

[23]  Anant Sahai,et al.  Algorithms for GPS operation indoors and downtown , 2002 .

[24]  R. Varma,et al.  Visual Impairment and Blindness in Adults in the United States: Demographic and Geographic Variations From 2015 to 2050. , 2016, JAMA ophthalmology.

[25]  Tao Chen,et al.  Landmark recognition with compact BoW histogram and ensemble ELM , 2015, Multimedia Tools and Applications.