Data analysis system for online short video comments

Online short video has become a very popular form of video nowadays. The video providers involve small ones, like a stand-alone video maker, a UGC studio, or some big video producers. They focus on providing the content, and they are eager to learn whether their videos are popular or not. Most of them hope that some widely accepted third party institution provide the benchmark for the short videos industry among the country. For their convenience, a data analysis system has been setup specialized for this purpose. Firstly, it can fetch short video comments information from the internet, either mobile or not. Then it analyzes the fetched information with machine learning algorithms. Finally it can provide the result to the video providers, for the utilization of product improvement. Sometimes this information can even help to decide the awards for the video provider's association. This article makes an introduction to the system, especially for the data fetching and data analyzing technologies and their utility.

[1]  Chun Ying Kang DOM-Based Web Pages to Determine the Structure of the Similarity Algorithm , 2009, 2009 Third International Symposium on Intelligent Information Technology Application.

[2]  Carlo Aliprandi,et al.  Sentiment Analysis on Social Media , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[3]  Liu Lu,et al.  Empirical study of sentiment classification for Chinese microblog based on machine learning , 2012 .

[4]  Hong Yu,et al.  Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences , 2003, EMNLP.

[5]  Michael Gamon,et al.  Automatic Identification of Sentiment Vocabulary: Exploiting Low Association with Known Sentiment Terms , 2005, ACL 2005.

[6]  Wen Shi,et al.  Sentiment Classification for Movie Reviews in Chinese by Improved Semantic Oriented Approach , 2006, Proceedings of the 39th Annual Hawaii International Conference on System Sciences (HICSS'06).

[7]  Cornelia Gyorodi,et al.  A comparative study: MongoDB vs. MySQL , 2015, 2015 13th International Conference on Engineering of Modern Electric Systems (EMES).

[8]  Li Guangli,et al.  Design of a Distributed Spiders System Based on Web Service , 2009, 2009 Second Pacific-Asia Conference on Web Mining and Web-based Application.

[9]  Michael L. Littman,et al.  Unsupervised Learning of Semantic Orientation from a Hundred-Billion-Word Corpus , 2002, ArXiv.

[10]  Luis Pedro Coelho,et al.  Building machine learning systems with Python : master the art of machine learning with Python and build effective machine learning systems with this intensive hands-on guide , 2013 .

[11]  Kamal Nigam,et al.  Towards a Robust Metric of Opinion , 2004 .

[12]  Xiao Sun,et al.  Chinese microblog sentiment classification based on convolution neural network with content extension method , 2015, 2015 International Conference on Affective Computing and Intelligent Interaction (ACII).

[13]  Xiangyu Kong,et al.  Sentiment Classification for Microblog by Machine Learning , 2012, 2012 Fourth International Conference on Computational and Information Sciences.

[14]  Jian Huang,et al.  The research and implementation of Web Spider in Search Engine , 2010, The 2010 International Conference on Apperceiving Computing and Intelligence Analysis Proceeding.

[15]  Li Jie,et al.  A programmable controller architecture for regular expression string matching , 2012, Proceedings of the 31st Chinese Control Conference.