Enhancing Fashion Classification with Vision Transformer (ViT) and Developing Recommendation Fashion Systems Using DINOVA2

: As e-commerce platforms grow, consumers increasingly purchase clothes online; however, they often need clarification on clothing choices. Consumers and stores interact through the clothing recommendation system. A recommendation system can help customers to find clothing that they are interested in and can improve turnover. This work has two main goals: enhancing fashion classification and developing a fashion recommendation system. The main ob-jective of fashion classification is to apply a Vision Transformer (ViT) to enhance performance. ViT is a set of transformer blocks; each transformer block consists of two layers: a multi-head self-attention layer and a multilayer perceptron (MLP) layer. The hyperparameters of ViT are configured based on the fashion images dataset. CNN models have different layers, including multi-convolutional layers, multi-max pooling layers, multi-dropout layers, multi-fully connected layers, and batch normalization layers. Furthermore, ViT is compared with different models, i.e., deep CNN models, VGG16, DenseNet-121, Mobilenet, and ResNet50, using different evaluation meth-ods and two fashion image datasets. The ViT model performs the best on the Fashion-MNIST dataset (accuracy = 95.25, precision = 95.20, recall = 95.25, F1-score = 95.20). ViT records the highest performance compared to other models in the fashion product dataset (accuracy = 98.53, precision = 98.42, recall = 98.53, F1-score = 98.46). A recommendation fashion system is developed using Learning Robust Visual Features without Supervision (DINOv2) and a nearest neighbor search that is built in the FAISS library to obtain the top five similarity results for specific images.

[1]  Mohammad Mustafa Taye Theoretical Understanding of Convolutional Neural Network: Concepts, Architectures, Applications, Future Directions , 2023, Comput..

[2]  Filippo Cavallo,et al.  Image Classification Using Multiple Convolutional Neural Networks on the Fashion-MNIST Dataset , 2022, Sensors.

[3]  K. Kwak,et al.  Diagnosis Myocardial Infarction Based on Stacking Ensemble of Convolutional Neural Network , 2022, Electronics.

[4]  Kuan-Hsien Liu,et al.  Clothing Recommendation Based on Deep Learning , 2022, 2022 IEEE International Conference on Consumer Electronics - Taiwan.

[5]  D. Popescu,et al.  Comparative Study of Neural Networks Used in Halyomorpha Halys Detection* , 2022, 2022 30th Mediterranean Conference on Control and Automation (MED).

[6]  P. Natarajan,et al.  FashionVLP: Vision Language Transformer for Fashion Retrieval with Feedback , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Yoonseo Park,et al.  A Survey of Recommendation Systems: Recommendation Models, Techniques, and Application Fields , 2022, Electronics.

[8]  Tariq Hussain,et al.  Design and implementation of clothing fashion style recommendation system using deep learning , 2021, Revista Română de Informatică și Automatică.

[9]  Abdul Monem S. Rahma,et al.  An E-Commerce Recommendation System Based on Dynamic Analysis of Customer Behavior , 2021, Sustainability.

[10]  M. Suryanegara,et al.  Evaluation of VGG-16 and VGG-19 Deep Learning Architecture for Classifying Dementia People , 2021, 2021 4th International Conference of Computer and Informatics Engineering (IC2IE).

[11]  Edgar J. Lobaton,et al.  Fashion Recommendation Systems, Models and Methods: A Review , 2021, Informatics.

[12]  Marwa S. Elpeltagy,et al.  Automatic prediction of COVID− 19 from chest images using modified ResNet50 , 2021, Multimedia Tools and Applications.

[13]  A. Dosovitskiy,et al.  MLP-Mixer: An all-MLP Architecture for Vision , 2021, NeurIPS.

[14]  Felix Mödritscher,et al.  Image Classification for the Automatic Feature Extraction in Human Worn Fashion Data , 2021, Mathematics.

[15]  P. Radeva,et al.  Data preparation for artificial intelligence in medical imaging: A comprehensive guide to open-access platforms and tools. , 2021, Physica medica : PM : an international journal devoted to the applications of physics to medicine and biology : official journal of the Italian Association of Biomedical Physics.

[16]  Chitra Dadkhah,et al.  Content-based Clothing Recommender System using Deep Neural Network , 2021, 2021 26th International Computer Conference, Computer Society of Iran (CSICC).

[17]  Yakoub Bazi,et al.  Vision Transformers for Remote Sensing Image Classification , 2021, Remote. Sens..

[18]  E Sudarshan,et al.  Personalized fashion recommender system with image based neural networks , 2020, IOP Conference Series: Materials Science and Engineering.

[19]  Yan Chu,et al.  Automatic Image Captioning Based on ResNet50 and LSTM with Soft Attention , 2020, Wirel. Commun. Mob. Comput..

[20]  Malik Loudini,et al.  Classification of Brain Tumor by Combination of Pre-Trained VGG16 CNN , 2020 .

[21]  Thorsten Hoeser,et al.  Object Detection and Image Segmentation with Deep Learning on Earth Observation Data: A Review-Part I: Evolution and Recent Trends , 2020, Remote. Sens..

[22]  Wai Keung Wong,et al.  Knowledge Enhanced Neural Fashion Trend Forecasting , 2020, ICMR.

[23]  Jianping Fan,et al.  Deep embedding of concept ontology for hierarchical fashion recognition , 2020, Neurocomputing.

[24]  R. R. Sedamkar,et al.  Detecting Affect States Using VGG16, ResNet50 and SE-ResNet50 Networks , 2020, SN Computer Science.

[25]  Jan Platos,et al.  An Analysis of Convolutional Neural Network for Fashion Images Classification (Fashion-MNIST) , 2019, Advances in Intelligent Systems and Computing.

[26]  Chang-Hong Lin,et al.  Clothing Recommendation System based on Visual Information Analytics , 2019, 2019 International Automatic Control Conference (CACS).

[27]  Yang Long,et al.  Apparel-based deep learning system design for apparel style recommendation , 2019, International Journal of Clothing Science and Technology.

[28]  Kyung-shik Shin,et al.  Hierarchical convolutional neural networks for fashion image classification , 2019, Expert Syst. Appl..

[29]  Snehasis Mukherjee,et al.  Impact of Fully Connected Layers on Performance of Convolutional Neural Networks for Image Classification , 2019, Neurocomputing.

[30]  Yi Zhou,et al.  Elastic Neural Networks for Classification , 2018, 2019 IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS).

[31]  Dr. Ton J. Cleophas,et al.  Modern Bayesian Statistics in Clinical Research , 2018, Springer International Publishing.

[32]  Pankaj Agarwal,et al.  Personalizing Similar Product Recommendations in Fashion E-commerce , 2018, ArXiv.

[33]  Markus Haltmeier,et al.  Image Based Fashion Product Recommendation with Deep Learning , 2018, LOD.

[34]  Tareq Abed Mohammed,et al.  Understanding of a convolutional neural network , 2017, 2017 International Conference on Engineering and Technology (ICET).

[35]  Turan Arslan,et al.  A Weighted Euclidean Distance based TOPSIS Method for Modeling Public Subjective Judgments , 2017, Asia Pac. J. Oper. Res..

[36]  Jeff Johnson,et al.  Billion-Scale Similarity Search with GPUs , 2017, IEEE Transactions on Big Data.

[37]  Adhistya Erna Permanasari,et al.  Cosine similarity to determine similarity measure: Study case in online essay assessment , 2016, 2016 4th International Conference on Cyber and IT Service Management.

[38]  Keiron O'Shea,et al.  An Introduction to Convolutional Neural Networks , 2015, ArXiv.

[39]  Alexander C. Berg,et al.  Runway to Realway: Visual Analysis of Fashion , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[40]  Wai Keung Wong,et al.  A fashion mix-and-match expert system for fashion retailers using fuzzy screening approach , 2009, Expert Syst. Appl..

[41]  K. Zou,et al.  Correlation and simple linear regression. , 2003, Radiology.

[42]  Saleh Albelwi Deep Architecture based on DenseNet-121 Model for Weather Image Recognition , 2022, International Journal of Advanced Computer Science and Applications.

[43]  Amol C. Adamuthe,et al.  CNN Model for Image Classification on MNIST and Fashion-MNIST Dataset , 2020, Journal of scientific research.

[44]  Ekaba Bisong Regularization for Deep Learning , 2019, Building Machine Learning and Deep Learning Models on Google Cloud Platform.

[45]  Luyan Chen Image-based Product Recommendation System with Convolutional Neural Networks , 2017 .