论文信息 - Efficient Pipelines for Vision-Based Context Sensing

Efficient Pipelines for Vision-Based Context Sensing

Context awareness is an essential part of mobile and ubiquitous computing. Its goal is to unveil situational information about mobile users like locations and activities. The sensed context can enable many services like navigation, AR, and smarting shopping. Such context can be sensed in different ways including visual sensors. There is an emergence of vision sources deployed worldwide. The cameras could be installed on roadside, in-house, and on mobile platforms. This trend provides huge amount of vision data that could be used for context sensing. However, the vision data collection and analytics are still highly manual today. It is hard to deploy cameras at large scale for data collection. Organizing and labeling context from the data are also labor intensive. In recent years, advanced vision algorithms and deep neural networks are used to help analyze vision data. But this approach is limited by data quality, labeling effort, and dependency on hardware resources. In summary, there are three major challenges for today's vision-based context sensing systems: data collection and labeling at large scale, process large data volumes efficiently with limited hardware resources, and extract accurate context out of vision data. The thesis explores the design space that consists of three dimensions: sensing task, sensor types, and task locations. Our prior work explores several points in this design space. We make contributions by (1) developing efficient and scalable solutions for different points in the design space of vision-based sensing tasks; (2) achieving state-of-the-art accuracy in those applications; (3) and developing guidelines for designing such sensing systems.

Xiaochen Liu | Xiaochen Liu

[1] Yunhao Liu,et al. Enhancing wifi-based localization with visual clues , 2015, UbiComp.

[2] Xiaochen Liu,et al. TAR: Enabling Fine-Grained Targeted Advertising in Retail Stores , 2018, MobiSys.

[3] Lorenzo Bruzzone,et al. Extraction of Building Heights from VHR SAR Imagery using an Iterative Simulation and Match Procedure , 2008, IGARSS 2008 - 2008 IEEE International Geoscience and Remote Sensing Symposium.

[4] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[5] Sergio Escalera,et al. Probability-Based Dynamic Time Warping for Gesture Recognition on RGB-D Data , 2012, WDIA.

[6] Andrea Cavallaro,et al. Self-Reconfigurable Smart Camera Networks , 2014, Computer.

[7] Jon Froehlich,et al. Exploring Early Solutions for Automatically Identifying Inaccessible Sidewalks in the Physical World using Google Street View , 2014 .

[8] Tao Mei,et al. Finding perfect rendezvous on the go: accurate mobile visual localization and its applications to routing , 2012, ACM Multimedia.

[9] Luc Van Gool,et al. The 2005 PASCAL Visual Object Classes Challenge , 2005, MLCW.

[10] Amir Roshan Zamir,et al. City scale geo-spatial trajectory estimation of a moving camera , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Matthew Cooper,et al. LoCo: a ready-to-deploy framework for efficient room localization using Wi-Fi , 2014, UbiComp.

[12] Hugh F. Durrant-Whyte,et al. Simultaneous localization and mapping: part I , 2006, IEEE Robotics & Automation Magazine.

[13] Qi Tian,et al. MARS: A Video Benchmark for Large-Scale Person Re-Identification , 2016, ECCV.

[14] Dietrich Paulus,et al. Simple online and realtime tracking with a deep association metric , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[15] Rong Zheng,et al. IDyLL: indoor localization using inertial and light sensors on smartphones , 2015, UbiComp.

[16] Weiwei Sun,et al. Is only one gps position sufficient to locate you to the road network accurately? , 2016, UbiComp.

[17] Malcolm David Macnaughtan,et al. Positioning GSM telephones , 1998, IEEE Commun. Mag..

[18] Paramvir Bahl,et al. Live Video Analytics at Scale with Approximation and Delay-Tolerance , 2017, NSDI.

[19] Gaurav S. Sukhatme,et al. On Localizing a Camera from a Single Image , 2020, ArXiv.

[20] Jiming Chen,et al. Last-Mile Navigation Using Smartphones , 2015, MobiCom.

[21] Jordi Vitrià,et al. Generic Object Recognition in Urban Image Databases , 2009, CCIA.

[22] Xinlei Chen,et al. An Implementation of Faster RCNN with Study for Region Sampling , 2017, ArXiv.

[23] Pietro Perona,et al. Cataloging Public Objects Using Aerial and Street-Level Images — Urban Trees , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Song-Chun Zhu,et al. Cross-View People Tracking by Scene-Centered Spatio-Temporal Parsing , 2017, AAAI.

[25] Xin Wang,et al. Clipper: A Low-Latency Online Prediction Serving System , 2016, NSDI.

[26] Kaiqi Huang,et al. An Equalized Global Graph Model-Based Approach for Multicamera Object Tracking , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[27] Charless C. Fowlkes,et al. Globally-optimal greedy algorithms for tracking a variable number of objects , 2011, CVPR 2011.

[28] Pascal Fua,et al. Ieee Transactions on Pattern Analysis and Machine Intelligence 1 Multiple Object Tracking Using K-shortest Paths Optimization , 2022 .

[29] Jean-Yves Tourneret,et al. Deep fusion of vector tracking GNSS receivers and a 3D city model for robust positioning in urban canyons with NLOS signals , 2014, 2014 7th ESA Workshop on Satellite Navigation Technologies and European Workshop on GNSS Signals and Signal Processing (NAVITEC).

[30] Moustafa Youssef,et al. CheckInside: a fine-grained indoor location-based social network , 2014, UbiComp.

[31] Seung-Hwan Bae,et al. Confidence-Based Data Association and Discriminative Deep Appearance Learning for Robust Online Multi-Object Tracking , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32] Pasquale Daponte,et al. Metrology for drone and drone for metrology: Measurement systems on small civilian drones , 2015, 2015 IEEE Metrology for Aerospace (MetroAeroSpace).

[33] Takayoshi Yamashita,et al. Multiple Object Extraction from Aerial Imagery with Convolutional Neural Networks , 2016, IRIACV.

[34] Paul Congdon,et al. Avoiding multipath to revive inbuilding WiFi localization , 2013, MobiSys '13.

[35] Erik Duval,et al. Context-Aware Recommender Systems for Learning: A Survey and Future Challenges , 2012, IEEE Transactions on Learning Technologies.

[36] Hongxia Jin,et al. Secure Pick Up: Implicit Authentication When You Start Using the Smartphone , 2017, SACMAT.

[37] Konrad Schindler,et al. Online Multi-Target Tracking Using Recurrent Neural Networks , 2016, AAAI.

[38] Will Hedgecock,et al. Accurate real-time relative localization using single-frequency GPS , 2014, SenSys.

[39] P. Groves,et al. Smartphone Shadow Matching for Better Cross-street GNSS Positioning in Urban Environments , 2015 .

[40] Asaf Tzadok,et al. Fine-Grained Recognition of Thousands of Object Categories with Single-Example Training , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41] Bingbing Ni,et al. Zero-Shot Action Recognition with Error-Correcting Output Codes , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42] Ali Farhadi,et al. You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43] Ioan Marius Bilasco,et al. Events Detection Using a Video-Surveillance Ontology and a Rule-Based Approach , 2014, ECCV Workshops.

[44] Wei Zhang,et al. Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[45] Thomas B. Moeslund,et al. Vision-Based Traffic Sign Detection and Analysis for Intelligent Driver Assistance Systems: Perspectives and Survey , 2012, IEEE Transactions on Intelligent Transportation Systems.

[46] Reza M. Vaghefi,et al. Bearing-only target localization with uncertainties in observer position , 2010, 2010 IEEE 21st International Symposium on Personal, Indoor and Mobile Radio Communications Workshops.

[47] Konrad Schindler,et al. Discrete-continuous optimization for multi-target tracking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[48] François Brémond,et al. Globality–Locality-Based Consistent Discriminant Feature Ensemble for Multicamera Tracking , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[49] Ali Farhadi,et al. YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50] F. Grenez,et al. NLOS-multipath effects on Pseudo-Range estimation in urban canyons for GNSS applications , 2006, 2006 First European Conference on Antennas and Propagation.

[51] A. Volgenant,et al. A shortest augmenting path algorithm for dense and sparse linear assignment problems , 1987, Computing.

[52] Ramakant Nevatia,et al. Multi-target tracking by online learning of non-linear motion patterns and robust appearance models , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[53] Ronen Basri,et al. A Survey on Structure from Motion , 2017, ArXiv.

[54] Octavia I. Camps,et al. DukeMTMC4ReID: A Large-Scale Multi-camera Person Re-identification Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[55] Xun Xu,et al. Multi-Task Zero-Shot Action Recognition with Prioritised Data Augmentation , 2016, ECCV.

[56] K. R. Ramakrishnan,et al. Inter-Camera Person Tracking in Non-overlapping Networks: Re-identification Protocol and On-line Update , 2017, ICDSC.

[57] Mahadev Satyanarayanan,et al. Towards wearable cognitive assistance , 2014, MobiSys.

[58] Yassine Ruichek,et al. Fisheye-Based Method for GPS Localization Improvement in Unknown Semi-Obstructed Areas , 2017, Sensors.

[59] Mohamed R. Amer,et al. HiRF: Hierarchical Random Field for Collective Activity Recognition in Videos , 2014, ECCV.

[60] Juergen Gall,et al. PoseTrack: Joint Multi-person Pose Estimation and Tracking , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[61] M. Golparvar-Fard,et al. Multi-class Traffic Sign Detection and Classification Using Google Street View Images , 2015 .

[62] B. S. Manjunath,et al. Kestrel: Video Analytics for Augmented Multi-Camera Vehicle Tracking , 2018, 2018 IEEE/ACM Third International Conference on Internet-of-Things Design and Implementation (IoTDI).

[63] Cordelia Schmid,et al. Action Tubelet Detector for Spatio-Temporal Action Localization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[64] Qi Tian,et al. Scalable Person Re-identification: A Benchmark , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[65] Andrew Zisserman,et al. Video Action Transformer Network , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[66] Zhedong Zheng,et al. Joint Discriminative and Generative Learning for Person Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[67] Georgios Portokalidis,et al. Techu: Open and Privacy-Preserving Crowdsourced GPS for the Masses , 2017, MobiSys.

[68] Grace Xingxin Gao,et al. Reflected blessings: Position estimation using non-line-of-sight GPS signals , 2017 .

[69] He Wang,et al. I am a smartphone and i can tell my user's walking direction , 2014, MobiSys.

[70] Christian Früh,et al. Google Street View: Capturing the World at Street Level , 2010, Computer.

[71] Suman Nath,et al. ALPS: accurate landmark positioning at city scales , 2016, UbiComp.

[72] P. Groves. Shadow Matching: A New GNSS Positioning Technique for Urban Canyons , 2011, Journal of Navigation.

[73] Li Wei,et al. Multipath Error Correction in Radio Interferometric Positioning Systems , 2017, ArXiv.

[74] Byung-Gon Chun,et al. PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems , 2018, OSDI.

[75] Hae-Sang Park,et al. A simple and fast algorithm for K-medoids clustering , 2009, Expert Syst. Appl..

[76] Takuya Maekawa,et al. Transferring positioning model for device-free passive indoor localization , 2015, UbiComp.

[77] P. Groves,et al. Enhancing Conventional GNSS Positioning with 3D Mapping without Accurate Prior Knowledge , 2015 .

[78] Yu Qiao,et al. Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks , 2016, IEEE Signal Processing Letters.

[79] Fabio Tozeto Ramos,et al. Simple online and realtime tracking , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[80] Harald Haelterman,et al. Crime Script Analysis: Preventing Crimes Against Business , 2016 .

[81] Miguel Ortiz,et al. How to Enhance Accuracy and Integrity of Satellite Positioning for Mobility Pricing in Cities: The Urban Trench Method , 2014 .

[82] Jie Liu,et al. COIN-GPS: indoor localization from direct GPS receiving , 2014, MobiSys.

[83] Parth H. Pathak,et al. Monitoring building door events using barometer sensor in smartphones , 2015, UbiComp.

[84] Alex Bewley,et al. Deep Cosine Metric Learning for Person Re-identification , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[85] Zimu Zhou,et al. Enabling Gesture-based Interactions with Objects , 2017, MobiSys.

[86] S Abirami,et al. Suspicious Human Activity Detection from Surveillance Videos , 2012 .

[87] Peter Robinson,et al. OpenFace: An open source facial behavior analysis toolkit , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[88] Jon M. Kleinberg,et al. Mapping the world's photos , 2009, WWW '09.

[89] Harold W. Kuhn,et al. The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[90] Yujie Zhang,et al. Multipath mitigation in the frequency domain , 2004, PLANS 2004. Position Location and Navigation Symposium (IEEE Cat. No.04CH37556).

[91] B. S. Manjunath,et al. Actor Conditioned Attention Maps for Video Action Detection , 2018, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[92] Miguel Ortiz,et al. About Non-Line-Of-Sight Satellite Detection and Exclusion in a 3D Map-Aided Localization Algorithm , 2013, Sensors.

[93] Xiang-Yang Li,et al. SmartLoc: push the limit of the inertial sensor based metropolitan localization using smartphone , 2013, MobiCom.

[94] Xiaogang Wang,et al. Spindle Net: Person Re-identification with Human Body Region Guided Feature Decomposition and Fusion , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[95] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[96] Li-Ta Hsu,et al. NLOS Exclusion using Consistency Check and City Building Model in Deep Urban Canyons , 2015 .

[97] Francesco Solera,et al. Tracking Social Groups Within and Across Cameras , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[98] Uwe Soergel,et al. Stereo analysis of high-resolution SAR images for building height estimation in cases of orthogonal aspect directions , 2009 .

[99] Slawomir Bak,et al. Human Re-identification Through a Video Camera Network. (Ré-identification de personne dans un réseau de cameras vidéo) , 2012 .

[100] Roland Göcke,et al. Joint Registration and Representation Learning for Unconstrained Face Identification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[101] Philip Chan,et al. Toward accurate dynamic time warping in linear time and space , 2007, Intell. Data Anal..

[102] Bernt Schiele,et al. Multiple People Tracking by Lifted Multicut and Person Re-identification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[103] Philippe Bonnifait,et al. iGPS: Global Positioning in Urban Canyons with Road Surface Maps , 2012, IEEE Intelligent Transportation Systems Magazine.

[104] Marco Gruteser,et al. ParkNet: drive-by sensing of road-side parking statistics , 2010, MobiSys '10.

[105] Francesco Solera,et al. Performance Measures and a Data Set for Multi-target, Multi-camera Tracking , 2016, ECCV Workshops.

[106] Yang Shao,et al. Shadow detection and building-height estimation using IKONOS data , 2011 .

[107] Bodo Rosenhahn,et al. Improvements to Frank-Wolfe optimization for multi-detector multi-object tracking , 2017, ArXiv.

[108] Andrew Zisserman,et al. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[109] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.

[110] Penina Axelrad,et al. An Adaptive SNR-Based Carrier Phase Multipath Mitigation Technique , 1996 .

[111] Ramesh Govindan,et al. Grab: Fast and Accurate Sensor Processing for Cashier-Free Shopping , 2020, ArXiv.

[112] Shih-Fu Chang,et al. Online Detection of Action Start in Untrimmed, Streaming Videos , 2018, ECCV.

[113] Ronald Poppe,et al. A survey on vision-based human action recognition , 2010, Image Vis. Comput..

[114] Xin Yao,et al. Resource-aware configuration in smart camera networks , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[115] Kristine M. Larson,et al. Mapping the GPS multipath environment using the signal‐to‐noise ratio (SNR) , 2007 .

[116] Ramakant Nevatia,et al. How does person identity recognition help multi-person tracking? , 2011, CVPR 2011.

[117] Eric A. Wan,et al. MobileRF: a robust device-free tracking system based on a hybrid neural network HMM classifier , 2014, UbiComp.

[118] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.

[119] Lorenzo Bruzzone,et al. Building height retrieval from airborne VHR SAR imagery based on an iterative simulation and matching procedure , 2008, Remote Sensing.

[120] Qi Tian,et al. Beyond Part Models: Person Retrieval with Refined Part Pooling , 2017, ECCV.

[121] Jianguo Zhang,et al. The PASCAL Visual Object Classes Challenge , 2006 .

[122] Mubarak Shah,et al. GIS-Assisted Object Detection and Geospatial Localization , 2014, ECCV.

[123] Gary R. Bradski,et al. ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[124] Hojung Cha,et al. Non-obstructive room-level locating system in home environments using activity fingerprints from smartwatch , 2015, UbiComp.

[125] Marcello Pelillo,et al. Multi-target Tracking in Multiple Non-overlapping Cameras Using Fast-Constrained Dominant Sets , 2019, International Journal of Computer Vision.

[126] Ramakant Nevatia,et al. Global data association for multi-object tracking using network flows , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[127] Lei Yu,et al. Calibration-free fusion of step counter and wireless fingerprints for indoor localization , 2015, UbiComp.

[128] Cees Snoek,et al. Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[129] Deli Zhao,et al. Recognizing an Action Using Its Name: A Knowledge-Based Approach , 2016, International Journal of Computer Vision.

[130] Danfei Xu,et al. Topometric localization on a road network , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[131] Yaser Sheikh,et al. OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[132] Yi Yang,et al. Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in Vitro , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[133] Mark Sandler,et al. MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[134] XianPing Tao,et al. MaLoc: a practical magnetic fingerprinting approach to indoor localization using smartphones , 2014, UbiComp.

[135] Limin Wang,et al. Temporal Action Detection with Structured Segment Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[136] Ian Oakley,et al. Indoor-ALPS: an adaptive indoor location prediction system , 2014, UbiComp.

[137] Ramesh Govindan,et al. Augmented Vehicular Reality: Enabling Extended Vision for Future Vehicles , 2017, HotMobile.

[138] Ali Farhadi,et al. YOLOv3: An Incremental Improvement , 2018, ArXiv.

[139] William Whittaker,et al. Autonomous driving in urban environments: Boss and the Urban Challenge , 2008, J. Field Robotics.

[140] Kaiqi Huang,et al. Beyond Triplet Loss: A Deep Quadruplet Network for Person Re-identification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[141] Li-Ta Hsu,et al. GPS Error Correction With Pseudorange Evaluation Using Three-Dimensional Maps , 2015, IEEE Transactions on Intelligent Transportation Systems.

[142] Takeshi Oishi,et al. IDENTIFYING REFLECTED GPS SIGNALS AND IMPROVING POSITION ESTIMATION USING 3 D MAP SIMULTANEOUSLY BUILT WITH LASER RANGE SCANNER , 2014 .

[143] Takeo Kanade,et al. Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[144] Carlo Tomasi,et al. Features for Multi-target Multi-camera Tracking and Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[145] Kaigui Bian,et al. Contextual-code: Simplifying information pulling from targeted sources in physical world , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[146] Yap-Peng Tan,et al. A color histogram based people tracking system , 2001, ISCAS 2001. The 2001 IEEE International Symposium on Circuits and Systems (Cat. No.01CH37196).

[147] Yi Yang,et al. A Discriminatively Learned CNN Embedding for Person Reidentification , 2016, ACM Trans. Multim. Comput. Commun. Appl..

[148] Ramesh Govindan,et al. QuickSketch: Building 3D Representations in Unknown Environments Using Crowdsourcing , 2018, 2018 21st International Conference on Information Fusion (FUSION).

[149] Mubarak Shah,et al. Accurate Image Localization Based on Google Maps Street View , 2010, ECCV.

[150] Ross B. Girshick,et al. Mask R-CNN , 2017, 1703.06870.

[151] V. Shettigara,et al. HEIGHT DETERMINATION OF EXTENDED OBJECTS USING SHADOWS IN SPOT IMAGES , 1998 .

[152] Li-Ta Hsu,et al. 3D building model-based pedestrian positioning method using GPS/GLONASS/QZSS and its reliability calculation , 2016, GPS Solutions.

[153] Song-Chun Zhu,et al. Predicting Human Activities Using Stochastic Grammar , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[154] Ramesh Govindan,et al. Satyam: Democratizing Groundtruth for Machine Vision , 2018, ArXiv.

[155] Ramesh Govindan,et al. Olympian: Scheduling GPU Usage in a Deep Neural Network Model Serving System , 2018, Middleware.

[156] Will Hedgecock,et al. High-accuracy differential tracking of low-cost GPS receivers , 2013, MobiSys '13.

[157] Gérard G. Medioni,et al. Pose-Aware Face Recognition in the Wild , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[158] Yunhao Liu,et al. Orientation-Aware RFID Tracking with Centimeter-Level Accuracy , 2018, 2018 17th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN).

[159] Maneesh Agrawala,et al. Rekall: Specifying Video Events using Compositions of Spatiotemporal Labels , 2019, ArXiv.

[160] Luc Van Gool,et al. Domain Adaptive Faster R-CNN for Object Detection in the Wild , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[161] Robert C. Bolles,et al. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[162] Vijay Ramaraju,et al. Energy Efficient Image Transmission In Wireless Multimedia Sensor Networks , 2014 .

[163] Aakanksha Chowdhery,et al. Networked Drone Cameras for Sports Streaming , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[164] Georgios K. Ouzounis,et al. Smart cities of the future , 2012, The European Physical Journal Special Topics.

[165] Eckehard Steinbach,et al. Graph-based data fusion of pedometer and WiFi measurements for mobile indoor positioning , 2014, UbiComp.

[166] Xiaoming Liu,et al. Disentangled Representation Learning GAN for Pose-Invariant Face Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[167] Ramesh Govindan,et al. Data Acquisition for Real-Time Decision-Making under Freshness Constraints , 2015, 2015 IEEE Real-Time Systems Symposium.

[168] Silvio Savarese,et al. Robust single-view instance recognition , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[169] Tom Minka,et al. You are facing the Mona Lisa: spot localization using PHY layer information , 2012, MobiSys '12.

[170] Roberto Manduchi,et al. A Power-Aware, Self-Managing Wireless Camera Network for, Wide Area Monitoring , 2006 .

[171] Suman Nath,et al. Gnome: A Practical Approach to NLOS Mitigation for GPS Positioning in Smartphones , 2018, MobiSys.

[172] Arkady B. Zaslavsky,et al. Context Aware Computing for The Internet of Things: A Survey , 2013, IEEE Communications Surveys & Tutorials.

[173] B. S. Manjunath,et al. Caesar: cross-camera complex activity recognition , 2019, SenSys.

[174] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[175] Bin Liu,et al. PinPlace: associate semantic meanings with indoor locations without active fingerprinting , 2015, UbiComp.

[176] Larry D. Hostetler,et al. The estimation of the gradient of a density function, with applications in pattern recognition , 1975, IEEE Trans. Inf. Theory.

[177] Kang G. Shin,et al. Locating and Tracking BLE Beacons with Smartphones , 2017, CoNEXT.

[178] Luc Van Gool,et al. The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[179] Weiwei Sun,et al. CLSTERS: A General System for Reducing Errors of Trajectories Under Challenging Localization Situations , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[180] Romaric Audigier,et al. Improving Multi-frame Data Association with Sparse Representations for Robust Near-online Multi-object Tracking , 2016, ECCV.

[181] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[182] Ramakant Nevatia,et al. An online learned CRF model for multi-target tracking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[183] Wolfram Burgard,et al. Metric localization using Google Street View , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[184] Konrad Schindler,et al. Detection- and Trajectory-Level Exclusion in Multiple Object Tracking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[185] Xiaolin Hu,et al. Scale-Aware Face Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[186] Sang Jeong Lee,et al. Understanding customer malling behavior in an urban shopping mall using smartphones , 2013, UbiComp.

[187] Anuj Srivastava,et al. Action Recognition Using Rate-Invariant Analysis of Skeletal Shape Trajectories , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[188] F. Cellier,et al. Building Height Estimation using Fine Analysis of Altimetric Mixtures in Layover Areas on Polarimetric Interferometric X-band SAR Images , 2006, 2006 IEEE International Symposium on Geoscience and Remote Sensing.

[189] Mahadev Satyanarayanan,et al. OpenFace: A general-purpose face recognition library with mobile applications , 2016 .

[190] Qi Tian,et al. Scalable Person Re-identification on Supervised Smoothed Manifold , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[191] Pascal Fua,et al. Multicamera People Tracking with a Probabilistic Occupancy Map , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[192] Saiful Islam,et al. Mahalanobis Distance , 2009, Encyclopedia of Biometrics.

[193] Bernt Schiele,et al. DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model , 2016, ECCV.

[194] Silvio Savarese,et al. Social Scene Understanding: End-to-End Multi-person Action Localization and Collective Activity Recognition , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[195] Rainer Lienhart,et al. An extended set of Haar-like features for rapid object detection , 2002, Proceedings. International Conference on Image Processing.

[196] Shiliang Zhang,et al. Deep Attributes Driven Multi-Camera Person Re-identification , 2016, ECCV.

[197] Michael J. Rycroft,et al. Understanding GPS. Principles and Applications , 1997 .