Video Big Data Analytics in the Cloud: Research Issues and Challenges

On the rise of distributed computing technologies, video big data analytics in the cloud have attracted researchers and practitioners' attention. The current technology and market trends demand an efficient framework for video big data analytics. However, the current work is too limited to provide an architecture on video big data analytics in the cloud, including managing and analyzing video big data, the challenges, and opportunities. This study proposes a service-oriented layered reference architecture for intelligent video big data analytics in the cloud. Finally, we identify and articulate several open research issues and challenges, which have been raised by the deployment of big data technologies in the cloud for video big data analytics. This paper provides the research studies and technologies advancing video analyses in the era of big data and cloud computing. This is the first study that presents the generalized view of the video big data analytics in the cloud to the best of our knowledge.

[1]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[2]  Nikhil Ketkar,et al.  Introduction to PyTorch , 2021, Deep Learning with Python.

[3]  Sanjay Ghemawat,et al.  MapReduce: a flexible data processing tool , 2010, CACM.

[4]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[5]  Anand Deshpande,et al.  Artificial Intelligence for Big Data: Complete guide to automating Big Data solutions using Artificial Intelligence techniques , 2018 .

[6]  Shah Khalid,et al.  Social question and answer sites: the story so far , 2017, Program.

[7]  Pengtao Xie,et al.  Poseidon: An Efficient Communication Architecture for Distributed Deep Learning on GPU Clusters , 2017, USENIX Annual Technical Conference.

[8]  Nathan Marz,et al.  Big Data: Principles and best practices of scalable realtime data systems , 2015 .

[9]  Yang Yang,et al.  Deep Learning Scaling is Predictable, Empirically , 2017, ArXiv.

[10]  Jeffrey F. Naughton,et al.  Model Selection Management Systems: The Next Frontier of Advanced Analytics , 2016, SGMD.

[11]  Young-Koo Lee,et al.  TORNADO: Intermediate Results Orchestration Based Service-Oriented Data Curation Framework for Intelligent Video Big Data Analytics in the Cloud , 2020, Sensors.

[12]  Iyiola E. Olatunji,et al.  Dynamic Threshold for Resource Tracking in Observed Scenes , 2018, 2018 9th International Conference on Information, Intelligence, Systems and Applications (IISA).

[13]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[14]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[15]  Young-Koo Lee,et al.  IntelliBVR - Intelligent Large-Scale Video Retrieval for Objects and Events Utilizing Distributed Deep-Learning and Semantic Approaches , 2020, 2020 IEEE International Conference on Big Data and Smart Computing (BigComp).

[16]  Ameet Talwalkar,et al.  MLlib: Machine Learning in Apache Spark , 2015, J. Mach. Learn. Res..

[17]  Albert Gordo,et al.  Rosetta: Large Scale System for Text Detection and Recognition in Images , 2018, KDD.

[18]  Young-Koo Lee,et al.  FALKON: Large-Scale Content-Based Video Retrieval Utilizing Deep-Features and Distributed In-memory Computing , 2020, 2020 IEEE International Conference on Big Data and Smart Computing (BigComp).

[19]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Matthew J. Hausknecht,et al.  Beyond short snippets: Deep networks for video classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Yang Wang,et al.  BigDL: A Distributed Deep Learning Framework for Big Data , 2018, SoCC.

[22]  Lexing Xie,et al.  Event Mining in Multimedia Streams , 2008, Proceedings of the IEEE.

[23]  Seif Haridi,et al.  Apache Flink™: Stream and Batch Processing in a Single Engine , 2015, IEEE Data Eng. Bull..

[24]  Scott Shenker,et al.  Discretized streams: fault-tolerant streaming computation at scale , 2013, SOSP.

[25]  日経BP社,et al.  Amazon Web Services完全ソリューションガイド , 2016 .

[26]  Jingkuan Song,et al.  Learning in high-dimensional multimedia data: the state of the art , 2015, Multimedia Systems.

[27]  Jay Kreps,et al.  Kafka : a Distributed Messaging System for Log Processing , 2011 .

[28]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Young-Koo Lee,et al.  Feature Fusion of Deep Spatial Features and Handcrafted Spatiotemporal Features for Human Action Recognition , 2019, Sensors.

[30]  Marc'Aurelio Ranzato,et al.  Large Scale Distributed Deep Networks , 2012, NIPS.

[31]  Ruben Mayer,et al.  Scalable Deep Learning on Distributed Infrastructures: Challenges, Techniques and Tools , 2019 .

[32]  M. N. Vora,et al.  Hadoop-HBase for large-scale data , 2011, Proceedings of 2011 International Conference on Computer Science and Network Technology.

[33]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..