Visual computing resources distribution and balancing by multimodal cat swarm optimization

Abstract According to the fast development of Internet technologies, nowadays more and more web applications require accessing and processing massive-scale visual data, such as content image search and automatic navigation. Practically, the computing power of a single personal PC is insufficient for handling such massive-scale visual elements. Owing to the advancement of cloud platforms, this problem can be well handled. In this work, we study the problem of how to optimally distribute the P visual processing tasks to Q computing resources. Specifically, we proposed an enhanced multimodal cat swarm optimization (MCSO) algorithm to fulfill this task. Given a huge number of images/videos, we first extract color, texture, and semantic channel to represent the visual content of each image/video. Afterward, we develop a multi-view feature learning algorithm to intelligently combine the multiple features into a descriptive one, wherein the weights of different feature channels are adjusted automatically. Subsequently, we use the CSO algorithm to assign each image/video to different remote servers. The CSO algorithm mimics the conventional cat hunting process, that is, a set of cats is divided into two groups, one for searching and the other for tracking. The two patterns are interacted by the mixture ratio, which indicates how many cats will conduct tracking in the next round. Based on the output of CSO, each image/video will be assigned to the most optimal remote sensor. Finally, based on the optimal assignment, the image/video processing can be conducted with the minimal time consumption. Comprehensive experimental comparisons have demonstrated the advantages of our method, that is, our MCSO can achieve a visual processing time three times faster than the second best performer.

[1]  Chao Zhang,et al.  Weakly Supervised Complets Ranking for Deep Image Quality Modeling , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Songcan Chen,et al.  Multi-view kernel machine on single-view data , 2009, Neurocomputing.

[3]  Joachim M. Buhmann,et al.  Multi-assignment clustering for Boolean data , 2009, ICML '09.

[4]  Zhi-Hua Zhou,et al.  Tri-training: exploiting unlabeled data using three classifiers , 2005, IEEE Transactions on Knowledge and Data Engineering.

[5]  Hajime Kita,et al.  Adaptation to a Changing Environment by Means of the Thermodynamical Genetic Algorithm , 1999 .

[6]  Rajni V. Patel,et al.  HMFP-DBRNN: Real-Time Hand Motion Filtering and Prediction via Deep Bidirectional RNN , 2019, IEEE Robotics and Automation Letters.

[7]  Carlos A. Coello Coello,et al.  Particle Swarm Optimization in Non-stationary Environments , 2004, IBERAMIA.

[8]  Ariel Shamir,et al.  Improved seam carving for video retargeting , 2008, SIGGRAPH 2008.

[9]  Balasubramanian Raman,et al.  A novel colour- and texture-based image retrieval technique using multi-resolution local extrema peak valley pattern and RGB colour histogram , 2015, Pattern Analysis and Applications.

[10]  Agostinho C. Rosa,et al.  Self-adjusting the intensity of assortative mating in genetic algorithms , 2008, Soft Comput..

[11]  Debashis De,et al.  Entropy-aware ambient IoT analytics on humanized music information fusion , 2019, Journal of Ambient Intelligence and Humanized Computing.

[12]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.