Large Scale Open Source Video Recommender Tool Using Metadata Surrogates

Video and multi-media sharing is a significant activity on social media platforms. Learning patterns of activities using raw video data is computationally intensive and impractical, and manual inspection is not scalable and prohibitively expensive. An alternate strategy is to learn information about video content using far less compute intensive metadata surrogates. This paper describes a video recommender tool implemented in GovCloud using a novel approach of using lightweight video metadata to learn and classify video content. In contrast to popular video recommender systems that use consumption models for classification, the new approach used in our tool is based solely on the video metadata along with domain expertise used to truth a relatively small subset of relevant video content. The tool is very user-friendly and captures practical knowledge of the user resulting in good learning model. The architecture and implementation specifics of the tool is outlined in this paper. The classifier performance using metadata from tens of thousands of real postings exceeds 90% for both recall and ROC metrics. This tool has shown promise in providing a console for aggregating social media videos for analysts to train the system consistent with the context and task at hand.

[1]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[2]  CARLOS A. GOMEZ-URIBE,et al.  The Netflix Recommender System , 2015, ACM Trans. Manag. Inf. Syst..

[3]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[4]  George Mathew Architectural considerations for highly scalable computing to support on-demand video analytics , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[5]  Sharath Pankanti,et al.  Smart Video Surveillance , 2005 .

[6]  George Mathew The Challenges and Solutions for Building an Integrated Video Analytics Platform , 2017, 2017 IEEE International Conference on Information Reuse and Integration (IRI).

[7]  Jason Thornton,et al.  Feedback-based social media filtering tool for improved situational awareness , 2016, 2016 IEEE Symposium on Technologies for Homeland Security (HST).

[8]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[9]  Leah R. Gerber,et al.  The Use of Surrogate Data in Demographic Population Viability Analysis: A Case Study of California Sea Lions , 2015, PloS one.

[10]  Luca Faes,et al.  Surrogate data analysis for assessing the significance of the coherence function , 2004, IEEE Transactions on Biomedical Engineering.

[11]  Chia Feng Lin,et al.  A Framework for Scalable Cloud Video Recorder System in Surveillance Environment , 2012, 2012 9th International Conference on Ubiquitous Intelligence and Computing and 9th International Conference on Autonomic and Trusted Computing.