Deep Neural Network with Extracted Features for Social Group Detection

Background and Objectives: Video processing is one of the essential concerns generally regarded over the last few years. Social group detection is one of the most necessary issues in crowd. For human-like robots, detecting groups and the relationship between members in groups are important. Moving in a group, consisting of two or more people, means moving the members of the group in the same direction and speed. Methods: Deep neural network (DNN) is applied for detecting social groups in the proposed method using the parameters including Euclidean distance, Proximity distance, Motion causality, Trajectory shape, and Heat-maps. First, features between pairs of all people in the video are extracted, and then the matrix of features is made. Next, the DNN learns social groups by the matrix of features. Results: The goal is to detect two or more individuals in social groups. The proposed method with DNN and extracted features detect social groups. Finally, the proposed method’s output is compared with different methods. Conclusion: In the latest years, the use of deep neural networks (DNNs) for learning and detecting has been increased. In this work, we used DNNs for detecting social groups with extracted features. The indexing consequences and the outputs of movies characterize the utility of DNNs with extracted features. ======================================================================================================Copyrights©2021 The author(s). This is an open access article distributed under the terms of the Creative Commons Attribution (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, as long as the original authors and source are cited. No permission is required from the authors or the publishers.======================================================================================================

[1]  Hassan Farsi,et al.  Content-based image retrieval by combining convolutional neural networks and sparse representation , 2019, Multimedia Tools and Applications.

[2]  L. Riek,et al.  Tempo Adaptation and Anticipation Methods for Human-Robot Teams , 2016 .

[3]  Bingbing Ni,et al.  Crowded Scene Analysis: A Survey , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Scott E. Hudson,et al.  Parallel detection of conversational groups of free-standing people and tracking of their lower-body orientation , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[5]  FengJiashi,et al.  A survey on deep learning-based fine-grained object classification and semantic segmentation , 2017 .

[6]  Francesco Solera,et al.  Socially Constrained Structural Learning for Groups Detection in Crowd , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Marcello Pelillo,et al.  Chapter 12 - Detecting conversational groups in images using clustering games , 2019 .

[8]  Pouriya Etezadifar,et al.  A New Sample Consensus Based on Sparse Coding for Improved Matching of SIFT Features on Remote Sensing Images , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[9]  Dong-jun Huang,et al.  Social Pedestrian Group Detection Based on Spatiotemporal-oriented Energy for Crowd Video Understanding , 2018, KSII Trans. Internet Inf. Syst..

[10]  Eye Gaze Detection Based on Learning Automata by Using SURF Descriptor , 2018 .

[11]  A. Sezavar,et al.  A Modified Grasshopper Optimization Algorithm Combined with Convolutional Neural Network for Content Based Image Retrieval , 2019, International Journal of Engineering.

[12]  Hadi Sadoghi Yazdi,et al.  Best Clustering Around the Color Images , 2009 .

[13]  Robert T. Collins,et al.  Vision-Based Analysis of Small Groups in Pedestrian Crowds , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Mohammad Hasheminejad,et al.  Frame level sparse representation classification for speaker verification , 2017, Multimedia Tools and Applications.

[15]  Mohammad Hasheminejad,et al.  Sample-specific late classifier fusion for speaker verification , 2017, Multimedia Tools and Applications.

[16]  Xiaogang Wang,et al.  Learning Scene-Independent Group Descriptors for Crowd Understanding , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Xuan-Tung Truong,et al.  “To Approach Humans?”: A Unified Framework for Approaching Pose Prediction and Socially Aware Robot Navigation , 2018, IEEE Transactions on Cognitive and Developmental Systems.

[18]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[19]  Kejun Wang,et al.  Video-Based Abnormal Human Behavior Recognition—A Review , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[20]  Laurel D. Riek,et al.  Movement Coordination in Human–Robot Teams: A Dynamical Systems Approach , 2016, IEEE Transactions on Robotics.

[21]  H. Rahmani,et al.  Improving voice activity detection used in ITU-T G.729.B , 2009 .

[22]  Francesco Solera,et al.  Tracking Social Groups Within and Across Cameras , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Jianbo Shi,et al.  Social saliency prediction , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Hassan Farsi Improvement of Minimum Tracking in Minimum Statistics Noise Estimation Method , 2010 .

[25]  Bernt Schiele,et al.  Multi-person Tracking by Multicut and Deep Matching , 2016, ECCV Workshops.

[26]  Ioannis A. Kakadiaris,et al.  Social Cues in Group Formation and Local Interactions for Collective Activity Analysis , 2013, VISAPP.

[27]  Song Gao,et al.  An Analysis Method of Crowd Abnormal Behavior for Video Service Robot , 2019, IEEE Access.

[28]  H. Farsi Improved Generic Object Retrieval In Large Scale Databases By SURF Descriptor , 2017 .

[29]  Arash Jalali,et al.  A new steganography algorithm based on video sparse representation , 2019, Multimedia Tools and Applications.

[30]  Hassan Farsi,et al.  Visual saliency object detection using sparse learning , 2019, IET Image Process..

[31]  Sridha Sridharan,et al.  GD-GAN: Generative Adversarial Networks for Trajectory Prediction and Group Detection in Crowds , 2018, ACCV.

[32]  Silvio Savarese,et al.  DANTE: Deep Affinity Network for Clustering Conversational Interactants , 2019, ArXiv.