Automated Delineation of Subgroups in Web Video: A Medical Activism Case Study

Web 2.0 tools in general, and Web video in particular, provide new ways for activists to express their viewpoints to a broad audience. In this paper we deployed tools that have been used to find subgroups automatically in social networks and applied them to the problem of distinguishing between two sides of a controversial issue based on patterns of online interaction. We explored the problem of distinguishing between anti- and pro-vaccination activists based on a social network of videos and associated comments posted on YouTube. Videos for the analysis were selected by submitting the term “vaccination” to a search on YouTube. A content analysis of the selected videos was then performed (Keelan et al, 2007) to classify videos as pro- or anti-vaccination. Then, a modified version of the SCAN method (Chin and Chignell, 2008) for identifying cohesive subgroups in social networks was applied to the social network inferred from the discussions about the videos. Results showed that a cohesive subgroup of anti-vaccination people existed in discussions around anti-vaccination videos, whereas discussions around pro-vaccination videos included both anti-vaccination and pro-vaccination people. Implications of the method and results for more general delineation of types of medical activism and the opposing camps within those camps are discussed.

[1]  Yan Zhao,et al.  Visualization of Communication Patterns in Collaborative Innovation Networks - Analysis of Some W3C Working Groups , 2003, CIKM '03.

[2]  Ravi Kumar,et al.  Trawling the Web for Emerging Cyber-Communities , 1999, Comput. Networks.

[3]  Sara E. Sterling Aggregation Techniques to Characterize Social Networks , 2012 .

[4]  R. M. Wolfe,et al.  Vaccine Criticism on the World Wide Web , 2005, Journal of medical Internet research.

[5]  Caroline Haythornthwaite,et al.  Studying Online Social Networks , 2006, J. Comput. Mediat. Commun..

[6]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[7]  R. Alba A graph‐theoretic definition of a sociometric clique† , 1973 .

[8]  S. Chapman,et al.  Antivaccination activists on the world wide web , 2002, Archives of disease in childhood.

[9]  Funda Meric-Bernstam,et al.  Efficacy of Quality Criteria to Identify Potentially Harmful Information: A Cross-sectional Survey of Complementary and Alternative Medicine Web Sites , 2004, Journal of medical Internet research.

[10]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Mads Melbye,et al.  Association Between Thimerosal-Containing Vaccine and Autism , 2003 .

[12]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Bernardo A. Huberman,et al.  E-Mail as Spectroscopy: Automated Discovery of Community Structure within Organizations , 2005, Inf. Soc..

[14]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[15]  G D Lundberg,et al.  Assessing, controlling, and assuring the quality of medical information on the Internet: Caveant lector et viewor--Let the reader and viewer beware. , 1997, JAMA.

[16]  Lee Rainie,et al.  The online health care revolution: how the web helps americans take better care of themselves , 2000 .

[17]  David R. Karger,et al.  Scatter/Gather: a cluster-based approach to browsing large document collections , 1992, SIGIR '92.

[18]  Janice Singer,et al.  New Visual Media and Gender: A Content, Visual, and Audience Analysis of YouTube Vlogs , 2008 .

[19]  Thomas W. Valente,et al.  The stability of centrality measures when networks are sampled , 2003, Soc. Networks.

[20]  V. Latora,et al.  Centrality measures in spatial networks of urban streets. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  Martin Halvey,et al.  Exploring social dynamics in online media sharing , 2007, WWW '07.

[22]  Mark H. Chignell,et al.  A social hypertext model for finding community in blogs , 2006, HYPERTEXT '06.

[23]  Janice Singer,et al.  Exploring the Gender Divide on YouTube: An Analysis of the Creation and Reception of Vlogs , 2008 .

[24]  Hsinchun Chen,et al.  Automated Identification of Web Communities for Business Intelligence Analysis , 2005 .

[25]  W. Hueston,et al.  Virtual Medical Care: How Are Our Patients Using Online Health Information? , 2006, Journal of Community Health.

[26]  C. Lee Giles,et al.  Self-Organization and Identification of Web Communities , 2002, Computer.

[27]  L Nasir,et al.  Reconnoitering the antivaccination web sites: news from the front. , 2000, The Journal of family practice.

[28]  Bin Wu,et al.  Community detection in large-scale social networks , 2007, WebKDD/SNA-KDD '07.

[29]  Manish Latthe,et al.  Accuracy of information on apparently credible websites: survey of five common health topics , 2002, BMJ : British Medical Journal.

[30]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[31]  Jiangchuan Liu,et al.  Understanding the Characteristics of Internet Short Video Sharing: YouTube as a Case Study , 2007, ArXiv.

[32]  Mark H. Chignell,et al.  Automatic detection of cohesive subgroups within social hypertext: A heuristic approach , 2008, New Rev. Hypermedia Multim..

[33]  Patricia G. Lange Publicly Private and Privately Public: Social Networking on YouTube , 2007, J. Comput. Mediat. Commun..

[34]  Caroline Haythornthwaite,et al.  Social networks and Internet connectivity effects , 2005 .

[35]  Mark H. Chignell,et al.  Identifying subcommunities using cohesive subgroups in social hypertext , 2007, HT '07.

[36]  Setsuo Ohsuga,et al.  INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES , 1977 .

[37]  Ravi Kumar,et al.  Discovering Large Dense Subgraphs in Massive Graphs , 2005, VLDB.

[38]  R. M. Wolfe,et al.  Vaccination or Immunization? The Impact of Search Terms on the Internet , 2005, Journal of health communication.

[39]  R. M. Wolfe,et al.  Content and design attributes of antivaccination web sites. , 2002, JAMA.

[40]  Kumanan Wilson,et al.  Association of autistic spectrum disorder and the measles, mumps, and rubella vaccine: a systematic review of current epidemiological evidence. , 2003, Archives of pediatrics & adolescent medicine.

[41]  O. Daescu,et al.  Centrality Measures for the Human Red Blood Cell Interactome , 2007, 2007 IEEE Dallas Engineering in Medicine and Biology Workshop.

[42]  Danyel Fisher,et al.  Visualizing the Signatures of Social Roles in Online Discussion Groups , 2007, J. Soc. Struct..

[43]  Yoram M. Kalman,et al.  Pauses and Response Latencies: A Chronemic Analysis of Asynchronous CMC , 2006, J. Comput. Mediat. Commun..

[44]  Jun Li,et al.  Autism and measles, mumps, and rubella vaccine: no epidemiological evidence for a causal association , 1999, The Lancet.

[45]  Sergiy Butenko,et al.  Clique Relaxations in Social Network Analysis: The Maximum k-Plex Problem , 2011, Oper. Res..

[46]  Daniel Lewis,et al.  What is web 2.0? , 2006, CROS.

[47]  Thierry Chanier,et al.  How Social Network Analysis can help to Measure Cohesion in Collaborative Distance-Learning , 2003, CSCL.

[48]  A. Richardsen,et al.  Cohesion as a Basic Bond in Groups , 1983 .

[49]  Sallie Bernard Association between thimerosal-containing vaccine and autism. , 2004, JAMA.

[50]  An-Ping Zeng,et al.  The Connectivity Structure, Giant Strong Component and Centrality of Metabolic Networks , 2003, Bioinform..

[51]  Gary Geisler,et al.  Tagging video: conventions and strategies of the YouTube community , 2007, JCDL '07.

[52]  Tanya Y. Berger-Wolf,et al.  A framework for community identification in dynamic social networks , 2007, KDD '07.

[53]  Danyel Fisher,et al.  Using egocentric networks to understand communication , 2005, IEEE Internet Computing.

[54]  Caroline Haythornthwaite,et al.  Automated Discovery and Analysis of Social Networks from Threaded Discussions , 2008 .

[55]  Nan Zheng,et al.  Issue Publics on the Web: Applying Network Theory to the War Blogosphere , 2006, J. Comput. Mediat. Commun..

[56]  M. Hardey Doctor in the house: the Internet as a source of lay health knowledge and the challenge to expertise , 1999 .

[57]  Mark H. Chignell,et al.  Identifying active subgroups in online communities , 2007, CASCON.

[58]  G. Tomlinson,et al.  YouTube as a source of information on immunization: a content analysis. , 2007, JAMA.

[59]  Cameron A. Marlow Audience, structure and authority in the weblog community , 2004 .