Reliable customer analysis using federated learning and exploring deep-attention edge intelligence

Abstract The Internet of Things (IoT) and smart cities are flourishing with distributed systems in mobile wireless networks. As a result, an enormous amount of data are being generated for devices at the network edge. This results in privacy concerns, sensor data management issues, and data utilization issues. In this research, we propose a collaborative clustering method where the exchange of raw data is not required. The attention-based model used with a federated learning framework. The edge devices compute the model updates using local data and send them to the server for aggregation. Repetition is performed in multiple rounds until a convergence point reached. The transaction data used to train the attention model that gives a low dimensional embedding. Afterwards, we share the convergence model among the client/stores. Then, efficient pattern mining methods known as a clustering-based dynamic method (CBDM) are applied. For experimentation, we used retail store data to cluster the customer based on purchase behaviour. The proposed clustering method used semantic embedding to extract and then cluster them by discovering relevant patterns. The method achieved the 0.75 ROC values for the random distribution and 0.70 for the fixed distribution. The clustering method can help to reduce communication costs while ensuring privacy.

[1]  Somesh Jha,et al.  Model Inversion Attacks that Exploit Confidence Information and Basic Countermeasures , 2015, CCS.

[2]  Michael Wainberg,et al.  Deep learning in biomedicine , 2018, Nature Biotechnology.

[3]  Nathan L. Clarke,et al.  Evidence Identification in Heterogeneous Data Using Clustering , 2018, ARES.

[4]  Vitaly Shmatikov,et al.  Privacy-preserving deep learning , 2015, 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[5]  David A. Patterson,et al.  In-datacenter performance analysis of a tensor processing unit , 2017, 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA).

[6]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[7]  Yoshua Bengio,et al.  Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[8]  Margaret Hu Cambridge Analytica’s black box , 2020, Big Data Soc..

[9]  Bruce R. Rosen,et al.  Distributed deep learning networks among institutions for medical imaging , 2018, J. Am. Medical Informatics Assoc..

[10]  Ibrahim Khalil,et al.  Privacy Preserving Distributed Machine Learning with Federated Learning , 2021, Comput. Commun..

[11]  Ladislav Hluchý,et al.  Machine Learning and Deep Learning frameworks and libraries for large-scale data mining: a survey , 2019, Artificial Intelligence Review.

[12]  Andrew Y. Ng,et al.  CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning , 2017, ArXiv.

[13]  Tianjian Chen,et al.  Federated Machine Learning: Concept and Applications , 2019 .

[14]  Yang Song,et al.  The iNaturalist Species Classification and Detection Dataset , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[15]  Philip S. Yu,et al.  A Survey of Utility-Oriented Pattern Mining , 2018, IEEE Transactions on Knowledge and Data Engineering.

[16]  George Kurian,et al.  Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[17]  Hamido Fujita,et al.  A survey of incremental high‐utility itemset mining , 2018, WIREs Data Mining Knowl. Discov..

[18]  Peter Rindal,et al.  ABY3: A Mixed Protocol Framework for Machine Learning , 2018, IACR Cryptol. ePrint Arch..

[19]  ForestierGermain,et al.  Deep learning for time series classification , 2019 .

[20]  Ali Dehghantanha,et al.  A survey on security and privacy of federated learning , 2021, Future Gener. Comput. Syst..

[21]  Djamel Djenouri,et al.  Deep Learning Versus Traditional Solutions for Group Trajectory Outliers , 2020, IEEE Transactions on Cybernetics.

[22]  Daguang Xu,et al.  Federated Learning for Breast Density Classification: A Real-World Implementation , 2020, DART/DCL@MICCAI.

[23]  Jiasen Lu,et al.  Hierarchical Question-Image Co-Attention for Visual Question Answering , 2016, NIPS.

[24]  Gautam Srivastava,et al.  Attention-Based Deep Entropy Active Learning Using Lexical Algorithm for Mental Health Treatment , 2021, Frontiers in Psychology.

[25]  Niranjan Lal,et al.  K-Mean Clustering Algorithm Approach for Data Mining of Heterogeneous Data , 2018 .

[26]  Vivienne Sze,et al.  Efficient Processing of Deep Neural Networks: A Tutorial and Survey , 2017, Proceedings of the IEEE.

[27]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[28]  Miin-Shen Yang,et al.  Unsupervised K-Means Clustering Algorithm , 2020, IEEE Access.

[29]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[30]  István Hegedüs,et al.  Decentralized learning works: An empirical comparison of gossip learning and federated learning , 2021, J. Parallel Distributed Comput..

[31]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[32]  Gautam Srivastava,et al.  A Machine Learning Model for Data Sanitization , 2021, Comput. Networks.

[33]  Olga Ohrimenko,et al.  Contamination Attacks and Mitigation in Multi-Party Machine Learning , 2018, NeurIPS.

[34]  Unil Yun,et al.  ASRNN: A recurrent neural network with an attention model for sequence labeling , 2020, Knowl. Based Syst..

[35]  Martin Jägersand,et al.  Deep semantic segmentation for automated driving: Taxonomy, roadmap and challenges , 2017, 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC).

[36]  Xuanjing Huang,et al.  Incorporating Corporation Relationship via Graph Convolutional Neural Networks for Stock Price Prediction , 2018, CIKM.

[37]  Gautam Srivastava,et al.  A Deep Q-Learning Sanitization Approach for Privacy Preserving Data Mining , 2020, ICDCN.

[38]  Jianghui Cai,et al.  A Novel Clustering Algorithm Based on DPC and PSO , 2020, IEEE Access.

[39]  Philip S. Yu,et al.  HUOPM: High-Utility Occupancy Pattern Mining , 2018, IEEE Transactions on Cybernetics.

[40]  Gautam Srivastava,et al.  Agent architecture of an intelligent medical system based on federated learning and blockchain technology , 2021, J. Inf. Secur. Appl..