Adapted deep feature fusion for person re-identification in aerial images

Person re-identification is the task of matching visual appearances of the same person in image or video data while distinguishing appearances of different persons. With falling hardware costs cameras mounted on unmanned aerial vehicles (UAVs) have become increasingly useful for security and surveillance tasks in recent years. Re-identification approaches have to adapt to the new challenges posed by this type of data, such as unusual and changing viewpoints or camera motion. Furthermore, the characteristics of the data will change between the scenarios the UAV is used in. This requires robust models that can handle a wide range of characteristics. In this work, we train convolutional neural networks for person re-identification. However, datasets of sufficient size for training all consist of data from fixed camera networks. We show that the resulting models, while performing strongly on camera network data, struggle to handle the different characteristics of aerial imagery, likely because of an overfitting to data bias inherent in the training data. To address this issue we combine the deep features with hand-crafted covariance features which introduce a higher degree of invariance into our combined representation. The fusion of both types of features is achieved by including the covariance information into the training process of the deep model. We evaluate the combined representation on a dataset consisting of twelve people moving through a scene recorded by four fixed cameras and one mobile aerial camera. We discuss strengths and weaknesses of the features and show that our combined approach outperforms baselines as well as previous work.

[1]  Qi Tian,et al.  Scalable Person Re-identification: A Benchmark , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2]  Bernt Schiele,et al.  DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model , 2016, ECCV.

[3]  Chunxiao Liu,et al.  Person Re-identification: What Features Are Important? , 2012, ECCV Workshops.

[4]  Xiang Li,et al.  An enhanced deep feature representation for person re-identification , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[5]  Lucas Beyer,et al.  In Defense of the Triplet Loss for Person Re-Identification , 2017, ArXiv.

[6]  Yang Song,et al.  Person re-identification using visual attention , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[7]  Xiaogang Wang,et al.  DeepReID: Deep Filter Pairing Neural Network for Person Re-identification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Alexandre Bernardino,et al.  The HDA+ Data Set for Research on Fully Automated Re-identification Systems , 2014, ECCV Workshops.

[9]  Michael Jones,et al.  An improved deep learning architecture for person re-identification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  P. Meer,et al.  Covariance Tracking using Model Update Based on Means on Riemannian Manifolds , 2005 .

[11]  Alessandro Perina,et al.  Person re-identification by symmetry-driven accumulation of local features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12]  Shiliang Zhang,et al.  Pose-Driven Deep Convolutional Model for Person Re-identification , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[13]  Vittorio Murino,et al.  Custom Pictorial Structures for Re-identification , 2011, BMVC.

[14]  Horst-Michael Groß,et al.  Evaluation of multi feature fusion at score-level for appearance-based person re-identification , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[15]  Qi Tian,et al.  Query-adaptive late fusion for image search and person re-identification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Xiaogang Wang,et al.  Spindle Net: Person Re-identification with Human Body Region Guided Feature Decomposition and Fusion , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Jingdong Wang,et al.  Deeply-Learned Part-Aligned Representations for Person Re-identification , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[18]  Yi Yang,et al.  Person Re-identification: Past, Present and Future , 2016, ArXiv.

[19]  Chunxiao Liu,et al.  On-the-fly feature importance mining for person re-identification , 2014, Pattern Recognit..

[20]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Arne Schumann,et al.  Person re-identification across aerial and ground-based cameras by deep feature fusion , 2017, Defense + Security.