Framework for Social Media Big Data Quality Analysis

Unlimited amount of unstructured data is being captured and analyzed over social media. The paper highlights the issue of lack of standard quality control approaches that could be utilized for all social media sites. This is due to the variety of formats of big data acceptable over these sites. The issue reveals a challenge not only in the capture of big data but also in the analysis and yield of valuable data, which affect decision-making. The paper reviews a collection of archived documents in the field of big data and social media. This paper presents a framework identifying the issues of quality analysis of big data on social media, examining current techniques used by social media companies to capture and analyze big data, and mapping social media sites and the appropriate combinations of big data capture and analysis techniques with the data quality control requirements.

[1]  Ehud Gudes,et al.  Security Issues in NoSQL Databases , 2011, 2011IEEE 10th International Conference on Trust, Security and Privacy in Computing and Communications.

[2]  Marc A. Smith,et al.  NodeXL: Simple network analysis for social media , 2013, 2013 International Conference on Collaboration Technologies and Systems (CTS).

[3]  S. Biruntha,et al.  Techniques on text mining , 2012, 2012 IEEE International Conference on Advanced Communication Control and Computing Technologies (ICACCCT).

[4]  Ben Shneiderman,et al.  Do You Know the Way to SNA?: A Process Model for Analyzing and Visualizing Social Media Network Data , 2012, 2012 International Conference on Social Informatics.

[5]  Yadira Espinal Viktor Mayer-Schonberger and Kenneth Cukier, Big Data: A Revolution That Will Transform How We Live, Work and Think , 2013 .

[6]  Samir Akrouf,et al.  Social Network Analysis and Information Propagation: A Case Study Using Flickr and YouTube Networks , 2013 .

[7]  Florin Radulescu,et al.  MongoDB vs Oracle -- Database Comparison , 2012, 2012 Third International Conference on Emerging Intelligent Data and Web Technologies.

[8]  Wael Khreich,et al.  A Survey of Techniques for Event Detection in Twitter , 2015, Comput. Intell..

[9]  Heather Small,et al.  What Your Tweets Tell Us About You: Identity, Ownership and Privacy of Twitter Data , 2012, Int. J. Digit. Curation.

[10]  Sathiamoorthy Manoharan,et al.  A performance comparison of SQL and NoSQL databases , 2013, 2013 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM).

[11]  Sam Shah,et al.  Hourglass: A library for incremental processing on Hadoop , 2013, 2013 IEEE International Conference on Big Data.

[12]  Ben Shneiderman,et al.  First Steps to Netviz Nirvana: Evaluating Social Network Analysis with NodeXL , 2009, 2009 International Conference on Computational Science and Engineering.

[13]  Ralph Deters,et al.  RSenter: terms mining tool from unstructured data sources , 2013, Int. J. Bus. Process. Integr. Manag..

[14]  Xin Chen,et al.  Mining Social Media Data for Understanding Students’ Learning Experiences , 2014, IEEE Transactions on Learning Technologies.

[15]  Viktor Mayer-Schnberger,et al.  Big Data: A Revolution That Will Transform How We Live, Work, and Think , 2013 .

[16]  Bi-Ru Dai,et al.  Opinion Mining on Social Media Data , 2013, 2013 IEEE 14th International Conference on Mobile Data Management.

[17]  Ieee Staff 2013 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM) , 2013 .

[18]  Huan Liu,et al.  Twitter Data Analytics , 2013, SpringerBriefs in Computer Science.

[19]  M. Gold Debates in the Digital Humanities , 2012 .

[20]  Bernhard Rieder,et al.  Studying Facebook via data extraction: the Netvizz application , 2013, WebSci.

[21]  Xin Chen,et al.  A Web-Based Tool for Collaborative Social Media Data Analysis , 2013, 2013 International Conference on Cloud and Green Computing.

[22]  George Danezis,et al.  Prying Data out of a Social Network , 2009, 2009 International Conference on Advances in Social Network Analysis and Mining.