Multimedia Social Big Data: Mining

The rapid evolution and adoption of the SMAC (Social media, Mobile, Analytics and Cloud) technology paradigm, has generated massive volumes of human-centric, real-time, multimodal, heterogeneous data. Human-sourced information from social networks, process-mediated data from business systems and machine-generated data from Internet-of-Things are the three primary sources of big data which define the richness and scale of multimedia content available. With the proliferation of social networks (Twitter, Tumblr, Google+, Facebook, Instagram, Snapchat, YouTube, etc.), the user can post and share all kinds of multimedia content (text, image, audio, video) in the social setting using the Internet without much knowledge about the Web’s client-server architecture and network topology. This proffer novel opportunities and challenges to leverage high-diversity multimedia data in concurrence to the huge amount of social data. In recent years, multimedia analytics as a technology-based solution has attracted a lot of attention by both researchers and practitioners. The mining opportunities to analyze, model and discover knowledge from the social web applications/services are not restricted to the text-based big data, but extend to the partially unknown complex structures of image, audio and video. Interestingly, the big data is estimated to be 90% unstructured further, making it crucial to tap and analyze information using contemporary tools. The work presented is an extensive and organized overview of the multimedia social big data mining and applications. A comprehensive coverage of the taxonomy, types and techniques of Multimedia Social Big Data mining is put forward. A SWOT Analysis is done to understand the feasibility and scope of social multimedia content and big data analytics is also illustrated. Recent applications and suitable directions for future research have been identified which validate and endorse this correlation of multimedia to big data for mining social data.

[1]  Kyomin Jung,et al.  Prominent Features of Rumor Propagation in Online Social Media , 2013, 2013 IEEE 13th International Conference on Data Mining.

[2]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[3]  Hanna M. Wallach,et al.  Computational social science and social computing , 2013, Machine Learning.

[4]  Marilyn A. Campbell,et al.  Cyber Bullying: An Old Problem in a New Guise? , 2005, Australian Journal of Guidance and Counselling.

[5]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[6]  Ronen Feldman,et al.  Techniques and applications for sentiment analysis , 2013, CACM.

[7]  Shrikanth S. Narayanan,et al.  A System for Real-time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle , 2012, ACL.

[8]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[9]  Huan Liu,et al.  Text Analytics in Social Media , 2012, Mining Text Data.

[10]  Dragomir R. Radev,et al.  Rumor has it: Identifying Misinformation in Microblogs , 2011, EMNLP.

[11]  Jared Oliverio,et al.  A Survey of Social Media, Big Data, Data Mining, and Analytics , 2018, Journal of Industrial Integration and Management.

[12]  Akshi Kumar,et al.  Socio-Sentic framework for sustainable agricultural governance , 2020, Sustain. Comput. Informatics Syst..

[13]  José A. Casas,et al.  Bullying and cyberbullying: Convergent and divergent predictor variables , 2013, Comput. Hum. Behav..

[14]  Michael I. Jordan,et al.  Machine learning: Trends, perspectives, and prospects , 2015, Science.

[15]  Ema Utami,et al.  Determination of Status of Family Stage Prosperous of Sidareja District Using Data Mining Techniques , 2018, International Journal of Intelligent Systems and Applications.

[16]  Andry Rakotonirainy,et al.  A Critical Review of Proactive Detection of Driver Stress Levels Based on Multimodal Measurements , 2018, ACM Comput. Surv..

[17]  Xiangyang Wang,et al.  Feature selection based on rough sets and particle swarm optimization , 2007, Pattern Recognit. Lett..

[18]  Ralph L. Rosnow,et al.  Who Hears What from Whom and with What Effect , 1980 .

[19]  Marcel Salathé,et al.  An ensemble heterogeneous classification methodology for discovering health-related knowledge in social media messages , 2014, J. Biomed. Informatics.

[20]  Barbara Poblete,et al.  Predicting information credibility in time-sensitive social media , 2013, Internet Res..

[21]  Peter K. Smith,et al.  Cyberbullying: its nature and impact in secondary school pupils. , 2008, Journal of child psychology and psychiatry, and allied disciplines.

[22]  Akshi Kumar,et al.  Rumor Detection Using Machine Learning Techniques on Social Media , 2018, International Conference on Innovative Computing and Communications.

[23]  Arkaitz Zubiaga,et al.  Detection and Resolution of Rumours in Social Media , 2017, ACM Comput. Surv..

[24]  Jason Weston,et al.  WSABIE: Scaling Up to Large Vocabulary Image Annotation , 2011, IJCAI.

[25]  Björn W. Schuller,et al.  New Avenues in Opinion Mining and Sentiment Analysis , 2013, IEEE Intelligent Systems.

[26]  Parthasarati Dileepan,et al.  A SWOT analysis of big data , 2016 .

[27]  Rongrong Ji,et al.  SentiBank: large-scale ontology and classifiers for detecting sentiment and emotions in visual content , 2013, ACM Multimedia.

[28]  Akshi Kumar,et al.  Sentiment Analysis: A Perspective on its Past, Present and Future , 2012 .

[29]  Daniel A. Keim,et al.  Visual Analysis of Social Media Data , 2013, Computer.

[30]  Kang Liu,et al.  Book Review: Sentiment Analysis: Mining Opinions, Sentiments, and Emotions by Bing Liu , 2015, CL.

[31]  Muhammad Waqar,et al.  Predicting political preference of Twitter users , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[32]  Qing Li Cyberbullying in High Schools: A Study of Students' Behaviors and Beliefs about This New Phenomenon , 2010 .

[33]  Robert S. Tokunaga,et al.  Following you home from school: A critical review and synthesis of research on cyberbullying victimization , 2010, Comput. Hum. Behav..

[34]  M. P. S. Bhatia,et al.  Content based approach to find the credibility of user in social networks: an application of cyberbullying , 2015, International Journal of Machine Learning and Cybernetics.

[35]  Vishal Gupta,et al.  Big data analytics techniques: A survey , 2015, 2015 International Conference on Green Computing and Internet of Things (ICGCIoT).

[36]  L. Hartling,et al.  Prevalence and Effect of Cyberbullying on Children and Young People: A Scoping Review of Social Media Studies. , 2015, JAMA pediatrics.

[37]  Arkaitz Zubiaga,et al.  Learning Reporting Dynamics during Breaking News for Rumour Detection in Social Media , 2016, ArXiv.

[38]  Muhammad Kashif Hanif,et al.  Text Mining: Techniques, Applications and Issues , 2016 .

[39]  Mor Naaman,et al.  Diamonds in the rough: Social media visual analytics for journalistic inquiry , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[40]  Nishikant Mishra,et al.  Social media data analytics to improve supply chain management in food industries , 2017, Transportation Research Part E: Logistics and Transportation Review.

[41]  Maria Cristina Ferreira de Oliveira,et al.  Seeing beyond reading: a survey on visual text analytics , 2012, WIREs Data Mining Knowl. Discov..

[42]  Vincent A. Knight,et al.  Tweeting the terror: modelling the social media reaction to the Woolwich terrorist attack , 2014, Social Network Analysis and Mining.

[43]  Matthew Leighton Williams,et al.  Cyber Hate Speech on Twitter: An Application of Machine Classification and Statistical Modeling for Policy and Decision Making , 2015 .

[44]  R. Iannotti,et al.  Cyber and traditional bullying: differential association with depression. , 2011, The Journal of adolescent health : official publication of the Society for Adolescent Medicine.

[45]  Fredrik Johansson,et al.  Emotion classification of social media posts for estimating people’s reactions to communicated alert messages during crises , 2014, Security Informatics.

[46]  Alexey Podlasov,et al.  The Role of Images in Social Media Analytics: A Multimodal Digital Humanities Approach , 2014 .

[47]  Renaud Lambiotte,et al.  Predicting links in ego-networks using temporal information , 2015, EPJ Data Science.

[48]  Barbara Poblete,et al.  Twitter under crisis: can we trust what we RT? , 2010, SOMA '10.

[49]  M. Cha,et al.  Rumor Detection over Varying Time Windows , 2017, PloS one.

[50]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[51]  Uffe Kock Wiil,et al.  Criminal network investigation , 2014, Security Informatics.

[52]  Ambuj K. Singh,et al.  Modeling individual topic-specific behavior and influence backbone networks in social media , 2014, Social Network Analysis and Mining.

[53]  Patricio Martínez-Barco,et al.  Subjectivity and sentiment analysis: An overview of the current state of the area and envisaged developments , 2012, Decis. Support Syst..

[54]  Akshi Kumar,et al.  Ontology Driven Sentiment Analysis on Social Web for Government Intelligence , 2017, ICEGOV '17.

[55]  Sunita Goel,et al.  The Role of Text Analytics and Information Retrieval in the Accounting Domain , 2010 .

[56]  Arkaitz Zubiaga,et al.  Analysing How People Orient to and Spread Rumours in Social Media by Looking at Conversational Threads , 2015, PloS one.

[57]  Francesc Alías,et al.  Sentence-Based Sentiment Analysis for Expressive Text-to-Speech , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[58]  Eni Mustafaraj,et al.  Learning to Discover Political Activism in the Twitterverse , 2012, KI - Künstliche Intelligenz.

[59]  Akshi Kumar,et al.  Sentiment Analysis on Twitter , 2012 .

[60]  Elias Aboujaoude,et al.  Cyberbullying: Review of an Old Problem Gone Viral. , 2015, The Journal of adolescent health : official publication of the Society for Adolescent Medicine.

[61]  Philip C. Treleaven,et al.  Social media analytics: a survey of techniques, tools and platforms , 2014, AI & SOCIETY.

[62]  Qiaozhu Mei,et al.  Enquiring Minds: Early Detection of Rumors in Social Media from Enquiry Posts , 2015, WWW.

[63]  Rongrong Ji,et al.  Microblog Sentiment Analysis Based on Cross-media Bag-of-words Model , 2014, ICIMCS '14.

[64]  Paulo Cortez,et al.  The impact of microblogging data for stock market prediction: Using Twitter to predict returns, volatility, trading volume and survey sentiment indices , 2017 .

[65]  George Oikonomou,et al.  Highlighting Relationships of a Smartphone’s Social Ecosystem in Potentially Large Investigations , 2016, IEEE Transactions on Cybernetics.

[66]  Xin Fu,et al.  Study of collective user behaviour in Twitter: a fuzzy approach , 2014, Neural Computing and Applications.

[67]  Xin Chen,et al.  Mining Social Media Data for Understanding Students’ Learning Experiences , 2014, IEEE Transactions on Learning Technologies.

[68]  Arkaitz Zubiaga,et al.  Real‐time classification of Twitter trends , 2014, J. Assoc. Inf. Sci. Technol..

[69]  Pete Burnap,et al.  Us and them: identifying cyber hate on Twitter across multiple protected characteristics , 2016, EPJ Data Science.