Combining Behaviors and Demographics to Segment Online Audiences: Experiments with a YouTube Channel

Social media channels with audiences in the millions are increasingly common. Efforts at segmenting audiences for populations of these sizes can result in hundreds of audience segments, as the compositions of the overall audiences tend to be complex. Although understanding audience segments is important for strategic planning, tactical decision making, and content creation, it is unrealistic for human decision makers to effectively utilize hundreds of audience segments in these tasks. In this research, we present efforts at simplifying the segmentation of audience populations to increase their practical utility. Using millions of interactions with hundreds of thousands of viewers with an organization’s online content collection, we first isolate the maximum number of audience segments, based on behavioral profiling, and then demonstrate a computational approach of using non-negative matrix factorization to reduce this number to 42 segments that are both impactful and representative segments of the overall population. Initial results are promising, and we present avenues for future research leveraging our approach.

[1]  Bernard J. Jansen,et al.  Findings of a User Study of Automatically Generated Personas , 2018, CHI Extended Abstracts.

[2]  Shuguang Han,et al.  Understanding and modeling behavior patterns in cross‐device web search , 2017, ASIST.

[3]  Barbara Stern A revised communication model for advertising: Multiple dimensions of the source, the message , 1994 .

[4]  Virgílio A. F. Almeida,et al.  Characterizing Videos, Audience and Advertising in Youtube Channels for Kids , 2017, SocInfo.

[5]  A. Miller,et al.  Communicating with Key Publics in Crisis Communication: The Synthetic Approach to the Public Segmentation in Caps (Communicative Action in Problem Solving) , 2016 .

[6]  Bernard J. Jansen,et al.  Classifying web queries by topic and user intent , 2010, CHI Extended Abstracts.

[7]  Bernard J. Jansen,et al.  Classifying ecommerce information sharing behaviour by youths on social networking sites , 2011, J. Inf. Sci..

[8]  Bernard J. Jansen,et al.  Generating Cultural Personas from Social Data: A Perspective of Middle Eastern Users , 2017, 2017 5th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW).

[9]  John Sweller,et al.  Cognitive Load During Problem Solving: Effects on Learning , 1988, Cogn. Sci..

[10]  J. Burkell,et al.  Could we do better? Behavioural tracking on recommended consumer health websites. , 2015, Health information and libraries journal.

[11]  Sharyn Rundle-Thiele,et al.  To segment or not? That is the question , 2018 .

[12]  Lesly Alejandra Gonzalez Camacho,et al.  Social network data to alleviate cold-start in recommender system: A systematic review , 2018, Inf. Process. Manag..

[13]  J. L. Nelson And Deliver Us to Segmentation , 2018, Reimagining Journalism and Social Order in a Fragmented Media World.

[14]  Vasant Dhar,et al.  Editorial - Big Data, Data Science, and Analytics: The Opportunity and Challenge for IS Research , 2014, Inf. Syst. Res..

[15]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..

[16]  Bernard J. Jansen,et al.  Personas for Content Creators via Decomposed Aggregate Audience Statistics , 2017, 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[17]  Chengzhi Zhang,et al.  Detecting dietary preference of social media users in China via sentiment analysis , 2017, ASIST.

[18]  Chirag Shah,et al.  Evaluating user search trails in exploratory search tasks , 2017, Inf. Process. Manag..

[19]  Wendell R. Smith Product Differentiation and Market Segmentation as Alternative Marketing Strategies , 1956 .

[20]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[21]  Jisun An,et al.  Multidimensional Analysis of the News Consumption of Different Demographic Groups on a Nationwide Scale , 2017, SocInfo.

[22]  Petros Ieromonachou,et al.  Big data analytics in supply chain management: A state-of-the-art literature review , 2017, Comput. Oper. Res..

[23]  Mahmoud Al-Ayyoub,et al.  Paraphrase identification and semantic text similarity analysis in Arabic news tweets using lexical, syntactic, and semantic features , 2017, Inf. Process. Manag..

[24]  Minh Le Nguyen,et al.  Multilingual opinion mining on YouTube - A convolutional N-gram BiLSTM word embedding , 2018, Inf. Process. Manag..

[25]  Frank Schweitzer,et al.  Evaluative Patterns and Incentives in YouTube , 2017, SocInfo.

[26]  Bernard J. Jansen,et al.  Classifying web search queries to identify high revenue generating customers , 2012, J. Assoc. Inf. Sci. Technol..

[27]  John S. Edwards,et al.  Using Knowledge Management to Give Context to Analytics and Big Data and Reduce Strategic Risk , 2016 .

[28]  Giselle A. Auger,et al.  Extrovert and engaged? Exploring the connection between personality and involvement of stakeholders and the perceived relationship investment of nonprofit organizations , 2017 .

[29]  David Cornforth,et al.  Ranking of high-value social audiences on Twitter , 2016, Decis. Support Syst..

[30]  R. Nielsen,et al.  Are News Audiences Increasingly Fragmented? A Cross-National Comparative Analysis of Cross-Platform News Audience Fragmentation and Duplication , 2017 .

[31]  Lene Nielsen,et al.  Personas is applicable: a study on the use of personas in Denmark , 2014, CHI.

[32]  Bernard J. Jansen,et al.  Questioner or question: Predicting the response rate in social question and answering on Sina Weibo , 2018, Inf. Process. Manag..

[33]  Bernard J. Jansen,et al.  Persona Generation from Aggregated Social Media Data , 2017, CHI Extended Abstracts.

[34]  Tracy L. Tuten,et al.  Creative Strategies in Social Media Marketing: An Exploratory Study of Branded Social Content and Consumer Engagement , 2015 .

[35]  Bernard J. Jansen,et al.  From 2, 772 segments to five personas: Summarizing a diverse online audience by generating culturally adapted personas , 2018, First Monday.

[36]  Bernard J. Jansen,et al.  Viewed by too many or viewed too little: Using information dissemination for audience segmentation , 2017, ASIST.