Deep Neural Networks With Distance Distributions for Gender Recognition of 3D Human Shapes

Automatic human gender recognition is an important and classical problem in artificial intelligence. Most of the previous gender recognition works are based on vision appearance and biometric characteristics. However, there are fewer gender recognition approaches for 3D human shapes. In this article, we propose a novel deep neural network learning method for gender recognition of 3D human shapes. Firstly, we introduce effective descriptors to distinguish male and female of 3D human shapes via probability distributions of biharmonic distances among points. Secondly, the above distances-based low-level descriptors are fed into a fully connected neural network for gender recognition. Furthermore, we construct a larger 3D human shape dataset for evaluation of the proposed gender recognition method by collecting and labeling human shape models. Compared with previous works, our method obtains higher recognition accuracy and has more advantages, such as posture invariant, robust to noises, and no need of landmarks or pre-alignment process.

[1]  Zhongke Wu,et al.  A Scale-Invariant Diffusion Distance for Non-rigid Shape Analysis , 2014, International Conference on Virtual Reality and Visualization.

[2]  Michael J. Black,et al.  SMPL: A Skinned Multi-Person Linear Model , 2023 .

[3]  Yang Liu,et al.  Adaptive O-CNN: A Patch-based Deep Representation of 3D Shapes , 2018 .

[4]  Bo Li,et al.  Shape Retrieval of Non-rigid 3D Human Models , 2014, International Journal of Computer Vision.

[5]  Pierre Vandergheynst,et al.  Geometric Deep Learning: Going beyond Euclidean data , 2016, IEEE Signal Process. Mag..

[6]  Wojciech Matusik,et al.  Articulated mesh animation from multi-view silhouettes , 2008, ACM Trans. Graph..

[7]  Daniela Giorgi,et al.  SHape REtrieval Contest 2007: Watertight Models Track , 2007 .

[8]  Sebastian Thrun,et al.  SCAPE: shape completion and animation of people , 2005, SIGGRAPH '05.

[9]  Marc Rioux,et al.  Posture invariant gender classification for 3D human models , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[10]  Ruigang Yang,et al.  Semantic Parametric Reshaping of Human Body Models , 2014, 2014 2nd International Conference on 3D Vision.

[11]  Thomas A. Funkhouser,et al.  Biharmonic distance , 2010, TOGS.

[12]  Kathleen M. Robinette,et al.  Civilian American and European Surface Anthropometry Resource (CAESAR), Final Report. Volume 1. Summary , 2002 .

[13]  Michael J. Black,et al.  Dyna: a model of dynamic human shape in motion , 2015, ACM Trans. Graph..

[14]  Hans-Peter Seidel,et al.  A Statistical Model of Human Pose and Body Shape , 2009, Comput. Graph. Forum.

[15]  Wei Wu,et al.  PointCNN: Convolution On X-Transformed Points , 2018, NeurIPS.

[16]  A. Ben Hamza,et al.  A global geometric framework for 3D shape retrieval using deep learning , 2019, Comput. Graph..

[17]  Michael J. Black,et al.  MoSh: motion and shape capture from sparse markers , 2014, ACM Trans. Graph..

[18]  Maks Ovsjanikov,et al.  Automatic Location of Landmarks used in Manual Anthropometry , 2014, 3DOR@Eurographics.

[19]  Jonathan Masci,et al.  Geometric deep learning , 2016, SIGGRAPH ASIA Courses.

[20]  Alexander M. Bronstein,et al.  Shape Recognition with Spectral Distances , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Mark Meyer,et al.  Discrete Differential-Geometry Operators for Triangulated 2-Manifolds , 2002, VisMath.

[22]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Paul Suetens,et al.  SHREC '11 Track: Shape Retrieval on Non-rigid 3D Watertight Meshes , 2011, 3DOR@Eurographics.

[24]  Giuseppe Patanè,et al.  A Unified Definition and Computation of Laplacian Spectral Distances , 2019, Pattern Recognit..

[25]  Kathleen M. Robinette,et al.  Gender recognition with limited feature points from 3-D human body shapes , 2012, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[26]  Yizhou Yu,et al.  A fast modal space transform for robust nonrigid shape retrieval , 2016, The Visual Computer.

[27]  Michael J. Black,et al.  FAUST: Dataset and Evaluation for 3D Mesh Registration , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Bernt Schiele,et al.  Building statistical shape spaces for 3D human modeling , 2015, Pattern Recognit..

[29]  Navarro Pablo,et al.  Gender Recognition using 3D Human Body Scans , 2018, 2018 IEEE Biennial Congress of Argentina (ARGENCON).

[30]  Kathleen M. Robinette,et al.  Gender Recognition Using 3-D Human Body Shapes , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[31]  Bok-Min Goi,et al.  Recognizing Human Gender in Computer Vision: A Survey , 2012, PRICAI.

[32]  Rongrong Ji,et al.  Learning High-Level Feature by Deep Belief Networks for 3-D Model Retrieval and Recognition , 2014, IEEE Transactions on Multimedia.

[33]  Michael J. Black,et al.  Dynamic FAUST: Registering Human Bodies in Motion , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Wenyao Xu,et al.  Human Gender Classification: A Review , 2015, Int. J. Biom..

[35]  Wady Naanaa,et al.  Local commute-time guided MDS for 3D non-rigid object retrieval , 2017, Applied Intelligence.

[36]  Alexander M. Bronstein,et al.  Numerical Geometry of Non-Rigid Shapes , 2009, Monographs in Computer Science.

[37]  Guillermo Sapiro,et al.  Three-dimensional point cloud recognition via distributions of geometric distances , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[38]  Michael J. Black,et al.  Learning to Dress 3D People in Generative Clothing , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Subhransu Maji,et al.  Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[40]  Ioannis Pratikakis,et al.  Unsupervised human action retrieval using salient points in 3D mesh sequences , 2018, Multimedia Tools and Applications.