Human Attribute Recognition— A Comprehensive Survey

Human Attribute Recognition (HAR) is a highly active research field in computer vision and pattern recognition domains with various applications such as surveillance or fashion. Several approaches have been proposed to tackle the particular challenges in HAR. However, these approaches have dramatically changed over the last decade, mainly due to the improvements brought by deep learning solutions. To provide insights for future algorithm design and dataset collections, in this survey, (1) we provide an in-depth analysis of existing HAR techniques, concerning the advances proposed to address the HAR’s main challenges; (2) we provide a comprehensive discussion over the publicly available datasets for the development and evaluation of novel HAR approaches; (3) we outline the applications and typical evaluation metrics used in the HAR context.

[1]  Timo Aila,et al.  A Style-Based Generator Architecture for Generative Adversarial Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Shiguang Shan,et al.  A Unified Multiplicative Framework for Attribute Learning , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[3]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  Venu Govindaraju,et al.  Learning deep features for online person tracking using non-overlapping cameras: A survey , 2019, Image Vis. Comput..

[5]  Yang Yang,et al.  Relation-Aware Pedestrian Attribute Recognition with Graph Convolutional Networks , 2020, AAAI.

[6]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[7]  Andrew Y. Ng,et al.  End-to-End People Detection in Crowded Scenes , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[9]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[10]  Bernt Schiele,et al.  2D Human Pose Estimation: New Benchmark and State of the Art Analysis , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[12]  Ivan Laptev,et al.  Is object localization for free? - Weakly-supervised learning with convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  W S McCulloch,et al.  A logical calculus of the ideas immanent in nervous activity , 1990, The Philosophy of Artificial Intelligence.

[14]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[15]  Wei Chen,et al.  A survey of traditional and deep learning-based feature descriptors for high dimensional data in computer vision , 2019, International Journal of Multimedia Information Retrieval.

[16]  Yu Wu,et al.  Pose-Guided Feature Alignment for Occluded Person Re-Identification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[17]  Kaiqi Huang,et al.  Rethinking of Pedestrian Attribute Recognition: Realistic Datasets with Efficient Method , 2020, 2005.11909.

[18]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Jason Weston,et al.  Curriculum learning , 2009, ICML '09.

[20]  Asifullah Khan,et al.  A survey of the recent architectures of deep convolutional neural networks , 2019, Artificial Intelligence Review.

[21]  Jun Wan,et al.  Attention-Based Pedestrian Attribute Analysis , 2019, IEEE Transactions on Image Processing.

[22]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[23]  Chia-Hao Chang,et al.  On the Effect of Data Imbalance for Multi-Label Pedestrian Attribute Recognition , 2018, 2018 Conference on Technologies and Applications of Artificial Intelligence (TAAI).

[24]  Tinghuai Wang,et al.  Graph-Boosted Attentive Network for Semantic Body Parsing , 2019, ICANN.

[25]  Liang Zheng,et al.  Improving Person Re-identification by Attribute and Identity Learning , 2017, Pattern Recognit..

[26]  Bastian Leibe,et al.  Person Attribute Recognition with a Jointly-Trained Holistic CNN Model , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[27]  ShangJennifer,et al.  Learning from class-imbalanced data , 2017 .

[28]  Shaogang Gong,et al.  Multi-task Curriculum Transfer Deep Learning of Clothing Attributes , 2016, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[29]  Stan Matwin,et al.  Addressing the Curse of Imbalanced Training Sets: One-Sided Selection , 1997, ICML.

[30]  Simone Calderara,et al.  Generative adversarial models for people attribute recognition in surveillance , 2017, 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[31]  Song-Chun Zhu,et al.  Human Attribute Recognition by Rich Appearance Dictionary , 2013, 2013 IEEE International Conference on Computer Vision.

[32]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[33]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Philippe Terrier,et al.  Gait recognition via deep learning of the center-of-pressure trajectory , 2019, Applied Sciences.

[35]  Shengcai Liao,et al.  Multi-label convolutional neural network based pedestrian attribute classification , 2017, Image Vis. Comput..

[36]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[37]  Takayuki Okatani,et al.  Mix and Match: Joint Model for Clothing and Attribute Recognition , 2015, BMVC.

[38]  Huimin Yu,et al.  Attributes-aided Part Detection and Refinement for Person Re-identification , 2019, Pattern Recognit..

[39]  Xiao Wang,et al.  Pedestrian Attribute Recognition: A Survey , 2019, Pattern Recognit..

[40]  Haoyu Wang,et al.  Pose Flow: Efficient Online Pose Tracking , 2018, BMVC.

[41]  Alice J. O'Toole,et al.  An other-race effect for face recognition algorithms , 2011, TAP.

[42]  Alfredo Petrosino,et al.  Iris recognition through machine learning techniques: A survey , 2016, Pattern Recognit. Lett..

[43]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Lu Sheng,et al.  Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[45]  Qiaozhe Li,et al.  Visual-Semantic Graph Reasoning for Pedestrian Attribute Recognition , 2019, AAAI.

[46]  Yee Wei Law,et al.  Drone-Action: An Outdoor Recorded Drone Video Dataset for Action Recognition , 2019, Drones.

[47]  Guanghui Wang,et al.  Adversarially Approximated Autoencoder for Image Generation and Manipulation , 2019, IEEE Transactions on Multimedia.

[48]  Hui Han,et al.  Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning , 2005, ICIC.

[49]  Federico Tombari,et al.  Query-Guided End-To-End Person Search , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Yanning Zhang,et al.  Person Re-Identification in Aerial Imagery , 2019, IEEE Transactions on Multimedia.

[51]  Emily M. Hand,et al.  Facial Attribute Recognition: A Survey , 2020, Computer Vision.

[52]  Raquel Urtasun,et al.  Understanding the Effective Receptive Field in Deep Convolutional Neural Networks , 2016, NIPS.

[53]  K. Tsagarakis,et al.  A Comparative Analysis of the Legislation Evolution for Drone Use in OECD Countries , 2019, Drones.

[54]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[55]  Annan Li,et al.  A Temporal Attentive Approach for Video-Based Pedestrian Attribute Recognition , 2019, PRCV.

[56]  Mark S. Nixon,et al.  Soft biometric retrieval to describe and identify surveillance images , 2016, 2016 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA).

[57]  Bingbing Ni,et al.  Learning Context Graph for Person Search , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[58]  Jitendra Malik,et al.  Actions and Attributes from Wholes and Parts , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[59]  Lai-Man Po,et al.  Dominant Color Structure Descriptor for Image Retrieval , 2007, 2007 IEEE International Conference on Image Processing.

[60]  Hugo Proença,et al.  Region-Based CNNs for Pedestrian Gender Recognition in Visual Surveillance Environments , 2019, 2019 International Conference of the Biometrics Special Interest Group (BIOSIG).

[61]  Sanja Fidler,et al.  Be Your Own Prada: Fashion Synthesis with Structural Coherence , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[62]  Chen Huang,et al.  Human Attribute Recognition by Deep Hierarchical Contexts , 2016, ECCV.

[63]  Xin Zheng,et al.  A Survey of Deep Facial Attribute Analysis , 2018, International Journal of Computer Vision.

[64]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[65]  Xiaogang Wang,et al.  Deeply learned attributes for crowded scene understanding , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[66]  Zhi-Hua Zhou,et al.  Ieee Transactions on Knowledge and Data Engineering 1 Training Cost-sensitive Neural Networks with Methods Addressing the Class Imbalance Problem , 2022 .

[67]  Honglak Lee,et al.  Learning hierarchical representations for face verification with convolutional deep belief networks , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[68]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[69]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[70]  Xiangyang Xue,et al.  Adaptively Weighted Multi-task Deep Network for Person Attribute Classification , 2017, ACM Multimedia.

[71]  Hai Tao,et al.  Evaluating Appearance Models for Recognition, Reacquisition, and Tracking , 2007 .

[72]  Song-Chun Zhu,et al.  Attentive Fashion Grammar Network for Fashion Landmark Detection and Clothing Category Classification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[73]  Kaiqi Huang,et al.  A Richly Annotated Dataset for Pedestrian Attribute Recognition , 2016, ArXiv.

[74]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[75]  Riccardo Satta,et al.  Appearance Descriptors for Person Re-identification: a Comprehensive Review , 2013, ArXiv.

[76]  Kim-Hui Yap,et al.  AANet: Attribute Attention Network for Person Re-Identifications , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[77]  Yanqing Zhang,et al.  SVMs Modeling for Highly Imbalanced Classification , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[78]  Qiaozhe Li,et al.  Pedestrian Attribute Recognition by Joint Visual-semantic Reasoning and Knowledge Distillation , 2019, IJCAI.

[79]  Xin Chen,et al.  Towards 3D Human Shape Recovery Under Clothing , 2019, ArXiv.

[80]  Р Ю Чуйков,et al.  Обнаружение транспортных средств на изображениях загородных шоссе на основе метода Single shot multibox Detector , 2017 .

[81]  Michal Jakubczyk,et al.  A framework for sensitivity analysis of decision trees , 2017, Central European Journal of Operations Research.

[82]  Kaiqi Huang,et al.  Weakly-supervised Learning of Mid-level Features for Pedestrian Attribute Recognition and Localization , 2016, BMVC.

[83]  Hironobu Fujiyoshi,et al.  Robust pedestrian attribute recognition for an unbalanced dataset using mini-batch training with rarity rate , 2016, 2016 IEEE Intelligent Vehicles Symposium (IV).

[84]  Xiaoou Tang,et al.  Pedestrian Attribute Recognition At Far Distance , 2014, ACM Multimedia.

[85]  Michael S. Lew,et al.  Deep learning for visual understanding: A review , 2016, Neurocomputing.

[86]  Kaiqi Huang,et al.  Multi-attribute learning for pedestrian attribute recognition in surveillance scenarios , 2015, 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR).

[87]  Zhong Ji,et al.  Image-attribute reciprocally guided attention network for pedestrian attribute recognition , 2019, Pattern Recognit. Lett..

[88]  Huizhong Chen,et al.  Describing Clothing by Semantic Attributes , 2012, ECCV.

[89]  Luis E. Ortiz,et al.  Parsing clothing in fashion photographs , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[90]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[91]  Trevor Darrell,et al.  PANDA: Pose Aligned Networks for Deep Attribute Modeling , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[92]  Alfredo Petrosino,et al.  TGLSTM: A time based graph deep learning approach to gait recognition , 2019, Pattern Recognit. Lett..

[93]  Bolei Zhou,et al.  Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[94]  Esube Bekele,et al.  The Deeper, the Better: Analysis of Person Attributes Recognition , 2019, 2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019).

[95]  S LewMichael,et al.  Deep learning for visual understanding , 2016 .

[96]  Yi Yang,et al.  Random Erasing Data Augmentation , 2017, AAAI.

[97]  Silvio Savarese,et al.  Recognizing human actions by attributes , 2011, CVPR 2011.

[98]  Wei Wu,et al.  Hierarchical Feature Embedding for Attribute Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[99]  Qi Tian,et al.  MARS: A Video Benchmark for Large-Scale Person Re-Identification , 2016, ECCV.

[100]  Nitesh V. Chawla,et al.  SMOTEBoost: Improving Prediction of the Minority Class in Boosting , 2003, PKDD.

[101]  J. L. Mazher Iqbal,et al.  Abnormal Human Activity Recognition using Scale Invariant Feature Transform , 2015 .

[102]  Seetha Hari,et al.  Learning From Imbalanced Data , 2019, Advances in Computer and Electrical Engineering.

[103]  Yuanfang Guo,et al.  Distraction-Aware Feature Learning for Human Attribute Recognition via Coarse-to-Fine Attention Mechanism , 2019, AAAI.

[104]  Yang Hu,et al.  Data Augmentation Imbalance For Imbalanced Attribute Classification , 2020, ArXiv.

[105]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[106]  Xiaogang Wang,et al.  DeepID3: Face Recognition with Very Deep Neural Networks , 2015, ArXiv.

[107]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[108]  Gaurav Sharma,et al.  Learning discriminative spatial representation for image classification , 2011, BMVC.

[109]  Jitendra Malik,et al.  Poselets: Body part detectors trained using 3D human pose annotations , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[110]  Xin Zhao,et al.  Recurrent Attention Model for Pedestrian Attribute Recognition , 2019, AAAI.

[111]  Huahu Xu,et al.  Attention Based CNN-ConvLSTM for Pedestrian Attribute Recognition , 2020, Sensors.

[112]  Yan Wang,et al.  Deep View-Sensitive Pedestrian Attribute Inference in an end-to-end Model , 2017, BMVC.

[113]  Kang Zheng,et al.  Combining local appearance and holistic view: Dual-Source Deep Neural Networks for human pose estimation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[114]  Guoqiang Peter Zhang,et al.  Neural networks for classification: a survey , 2000, IEEE Trans. Syst. Man Cybern. Part C.

[115]  Shengcai Liao,et al.  Multi-label CNN based pedestrian attribute learning for soft biometrics , 2015, 2015 International Conference on Biometrics (ICB).

[116]  Hedi Ben-younes,et al.  Leveraging Weakly Annotated Data for Fashion Image Retrieval and Label Prediction , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[117]  Zhu Wen,et al.  Fast Human Detection Using Motion Detection and Histogram of Oriented Gradients , 2011, J. Comput..

[118]  Nenghai Yu,et al.  Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[119]  Pietro Perona,et al.  Fine-grained classification of pedestrians in video: Benchmark and state of the art , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[120]  Jun Xiang,et al.  Clothing Attribute Recognition Based on RCNN Framework Using L-Softmax Loss , 2020, IEEE Access.

[121]  Xiaogang Wang,et al.  DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[122]  Zhong Ji,et al.  Deep pedestrian attribute recognition based on LSTM , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[123]  Kaiqi Huang,et al.  Pose Guided Deep Model for Pedestrian Attribute Recognition in Surveillance Scenarios , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[124]  Shengcai Liao,et al.  Pedestrian Attribute Classification in Surveillance: Database and Evaluation , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[125]  Ioannis A. Kakadiaris,et al.  Curriculum Learning for Multi-task Classification of Visual Attributes , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[126]  Francesco Solera,et al.  Performance Measures and a Data Set for Multi-target, Multi-camera Tracking , 2016, ECCV Workshops.

[127]  Shao-Yi Chien,et al.  Human Object Tracking Algorithm with Human Color Structure Descriptor for Video Surveillance Systems , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[128]  Xiao Ke,et al.  Human attribute recognition method based on pose estimation and multiple-feature fusion , 2020, Signal Image Video Process..

[129]  Xiaogang Wang,et al.  HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[130]  Mei Wang,et al.  Deep Face Recognition: A Survey , 2018, Neurocomputing.

[131]  Tao Xiang,et al.  Deep Learning for Person Re-Identification: A Survey and Outlook , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[132]  Hugo Proença,et al.  Biometric recognition in surveillance scenarios: a survey , 2016, Artificial Intelligence Review.

[133]  Sergio Escalera,et al.  CLOTH3D: Clothed 3D Humans , 2020, ECCV.

[134]  Sangeet Khemlani,et al.  Implementing a Robust Explanatory Bias in a Person Re-identification Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[135]  Kaiqi Huang,et al.  A Richly Annotated Pedestrian Dataset for Person Retrieval in Real Surveillance Scenarios , 2019, IEEE Transactions on Image Processing.

[136]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[137]  Mark S. Nixon,et al.  A Joint Density Based Rank-Score Fusion for Soft Biometric Recognition at a Distance , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[138]  Bo Ren,et al.  Sequence-based Person Attribute Recognition with Joint CTC-Attention Model , 2018, ArXiv.

[139]  Ehsan Yaghoubi,et al.  Person Re-identification: Implicitly Defining the Receptive Fields of Deep Learning Classification Frameworks , 2020 .

[140]  Yuxiao Hu,et al.  MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition , 2016, ECCV.

[141]  Zhong Ji,et al.  Pedestrian attribute recognition based on multiple time steps attention , 2020, Pattern Recognit. Lett..

[142]  Kai Han,et al.  Attribute Aware Pooling for Pedestrian Attribute Recognition , 2019, IJCAI.

[143]  Cordelia Schmid,et al.  Expanded Parts Model for Human Attribute and Action Recognition in Still Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[144]  Chen Zonghai,et al.  Part-Wise Pedestrian Gender Recognition Via Deep Convolutional Neural Networks , 2017 .

[145]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[146]  Qinghua Hu,et al.  Vision Meets Drones: A Challenge , 2018, ArXiv.

[147]  Subhransu Maji,et al.  Describing people: A poselet-based approach to attribute classification , 2011, 2011 International Conference on Computer Vision.

[148]  Shiguang Shan,et al.  VRSTC: Occlusion-Free Video Person Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[149]  Hongtao Lu,et al.  Attribute-Driven Feature Disentangling and Temporal Aggregation for Video Person Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[150]  Hao Liu,et al.  Person Attribute Recognition by Sequence Contextual Relation Learning , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[151]  Andrew Zisserman,et al.  Spatial Transformer Networks , 2015, NIPS.

[152]  Qi Tian,et al.  Scalable Person Re-identification: A Benchmark , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[153]  Samuel Murray,et al.  Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[154]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[155]  Shaogang Gong,et al.  Person Re-identification by Attributes , 2012, BMVC.

[156]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[157]  Kuldeep Singh,et al.  Convolutional neural networks for crowd behaviour analysis: a survey , 2019, The Visual Computer.

[158]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[159]  Luminita Vasiu,et al.  Biometric Recognition - Security and Privacy Concerns , 2004, ICETE.

[160]  Junjie Yan,et al.  Localization Guided Learning for Pedestrian Attribute Recognition , 2018, BMVC.

[161]  Hao Guo,et al.  Human attribute recognition by refining attention heat map , 2017, Pattern Recognit. Lett..

[162]  Jia Xu,et al.  Identification of pedestrian attributes based on video sequence , 2018, 2018 IEEE International Conference on Advanced Manufacturing (ICAM).

[163]  Hugo Proença,et al.  An Attention-Based Deep Learning Model for Multiple Pedestrian Attributes Recognition , 2020, Image Vis. Comput..

[164]  Jian Dong,et al.  Deep domain adaptation for describing people based on fine-grained clothing attributes , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[165]  Song-Chun Zhu,et al.  Attribute And-Or Grammar for Joint Parsing of Human Pose, Parts and Attributes , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[166]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[167]  Jonathan Krause,et al.  Learning Features and Parts for Fine-Grained Recognition , 2014, 2014 22nd International Conference on Pattern Recognition.

[168]  Ping Tan,et al.  Attribute Recognition from Adaptive Parts , 2016, BMVC.

[169]  Jiaheng Cao,et al.  Scale Space Histogram of Oriented Gradients for Human Detection , 2008, 2008 International Symposium on Information Science and Engineering.

[170]  Hugo Proença,et al.  The P-DESTRE: A Fully Annotated Dataset for Pedestrian Detection, Tracking, Re-Identification and Search from Aerial Devices , 2020, ArXiv.

[171]  Ronald M. Summers,et al.  Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning , 2016, IEEE Transactions on Medical Imaging.

[172]  Wei Wu,et al.  Dynamic Curriculum Learning for Imbalanced Data Classification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[173]  Massimo Piccardi,et al.  Track matching over disjoint camera views based on an incremental major color spectrum histogram , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[174]  Xin Zhao,et al.  Grouping Attribute Recognition for Pedestrian with Joint Recurrent Learning , 2018, IJCAI.

[175]  C. Victoria Priscilla,et al.  Pedestrian Detection - A Survey , 2019 .

[176]  Yijing Li,et al.  Learning from class-imbalanced data: Review of methods and applications , 2017, Expert Syst. Appl..

[177]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[178]  Long Chen,et al.  Multi-Task Learning Via Co-Attentive Sharing For Pedestrian Attribute Recognition , 2020, 2020 IEEE International Conference on Multimedia and Expo (ICME).

[179]  Per-Erik Forssén,et al.  Maximally Stable Colour Regions for Recognition and Matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[180]  Varun Ramakrishna,et al.  Convolutional Pose Machines , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[181]  Esube Bekele,et al.  Multi-attribute Residual Network (MAResNet) for Soft-Biometrics Recognition in Surveillance Scenarios , 2017, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[182]  Hyedong Jung,et al.  Variational Autoencoder-Based Multiple Image Captioning Using a Caption Attention Map , 2019, Applied Sciences.

[183]  Shiguang Shan,et al.  Attribute annotation on large-scale image database by active knowledge transfer , 2018, Image Vis. Comput..

[184]  Jia Deng,et al.  Stacked Hourglass Networks for Human Pose Estimation , 2016, ECCV.

[185]  Haibo He,et al.  ADASYN: Adaptive synthetic sampling approach for imbalanced learning , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[186]  Forrest N. Iandola,et al.  Deformable Part Descriptors for Fine-Grained Recognition and Attribute Prediction , 2013, 2013 IEEE International Conference on Computer Vision.

[187]  Chao Yang,et al.  A Survey on Deep Transfer Learning , 2018, ICANN.

[188]  Yan Zhang,et al.  Part-Based Attribute-Aware Network for Person Re-Identification , 2019, IEEE Access.

[189]  C. Lawrence Zitnick,et al.  Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.

[190]  Shaogang Gong,et al.  Attribute Recognition by Joint Recurrent Learning of Context and Correlation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[191]  David Obdrzálek,et al.  Detecting Scene Elements Using Maximally Stable Colour Regions , 2009, Eurobot Conference.

[192]  Ioannis A. Kakadiaris,et al.  Curriculum Learning of Visual Attribute Clusters for Multi-Task Classification , 2017, Pattern Recognit..

[193]  Yu Cheng,et al.  Fully-Adaptive Feature Sharing in Multi-Task Networks with Applications in Person Attribute Classification , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[194]  Taeho Jo,et al.  Class imbalances versus small disjuncts , 2004, SKDD.

[195]  Dahua Lin,et al.  Recognize complex events from static images by fusing deep channels , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[196]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[197]  Luis E. Ortiz,et al.  Chic or Social: Visual Popularity Analysis in Online Fashion Networks , 2014, ACM Multimedia.

[198]  Hugo Proença,et al.  Pose Switch-based Convolutional Neural Network for Clothing Analysis in Visual Surveillance Environment , 2019, 2019 International Conference of the Biometrics Special Interest Group (BIOSIG).

[199]  Ioannis A. Kakadiaris,et al.  Deep Imbalanced Attribute Classification using Visual Attention Aggregation , 2018, ECCV.

[200]  Ashish Kapoor,et al.  AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles , 2017, FSR.

[201]  John Langford,et al.  Cost-sensitive learning by cost-proportionate example weighting , 2003, Third IEEE International Conference on Data Mining.