Performance of Raspberry Pi microclusters for Edge Machine Learning in Tourism

While a range of computing equipment has been developed or proposed for use to solve machine learning problems in edge computing, one of the least-explored options is the use of clusters of low-resource devices, such as the Raspberry Pi. Although such hardware configurations have been discussed in the past, their performance for ML tasks remains unexplored. In this paper, we discuss the performance of a Raspberry Pi micro-cluster, configured with industry-standard platforms, using Hadoop for distributed file storage and Spark for machine learning. Using the latest Raspberry Pi 4 model (quad core 1.5GHz, 4Gb RAM), we find encouraging results for use of such micro-clusters both for local training of ML models and execution of ML-based predictions. Our aim is to use such computing resources in a distributed architecture to serve tourism applications through the analysis of big data.

[1]  Chanwit Kaewkasi,et al.  A study of big data processing constraints on a low-power Hadoop cluster , 2014, 2014 International Computer Science and Engineering Conference (ICSEC).

[2]  Matheus A. Souza,et al.  A Low-Cost Energy-Efficient Raspberry Pi Cluster for Data Mining Algorithms , 2016, Euro-Par Workshops.

[3]  Sven Helmer,et al.  A Container-Based Edge Cloud PaaS Architecture Based on Raspberry Pi Clusters , 2016, 2016 IEEE 4th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW).

[4]  Abdurrachman Mappuji,et al.  Study of Raspberry Pi 2 quad-core Cortex-A7 CPU cluster as a mini supercomputer , 2016 .

[5]  Fung Po Tso,et al.  The Glasgow Raspberry Pi Cloud: A Scale Model for Cloud Computing Infrastructures , 2013, 2013 IEEE 33rd International Conference on Distributed Computing Systems Workshops.

[6]  Jian Zhang,et al.  Learning Cluster Computing by Creating a Raspberry Pi Cluster , 2017, ACM Southeast Regional Conference.

[7]  Baskoro Adi Pratomo,et al.  SQL injection detection and prevention system with raspberry Pi honeypot cluster for trapping attacker , 2014, 2014 International Symposium on Technology Management and Emerging Technologies.

[8]  Sven Helmer,et al.  Affordable and Energy-Efficient Cloud Computing Clusters: The Bolzano Raspberry Pi Cloud Cluster Experiment , 2013, 2013 IEEE 5th International Conference on Cloud Computing Technology and Science.

[9]  Suzanne J. Matthews,et al.  Investigating a Raspberry Pi cluster for detecting anomalies in the smart grid , 2017, 2017 IEEE MIT Undergraduate Research Technology Conference (URTC).

[10]  David Toth A Portable Cluster for Each Student , 2014, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops.

[11]  Enrico Valdani,et al.  A Practical Approach to Big Data in Tourism: A Low Cost Raspberry Pi Cluster , 2015, ENTER.