Installing computational social science: Facing the challenges of new information and communication technologies in social science

Today’s world allows people to connect over larger distances and in shorter intervals than ever before, widely monitored by massive online data sources. Ongoing worldwide computerization has led to completely new opportunities for social scientists to conceive human interactions and relations in unknown precision and quantities. However, the large data sets require techniques that are more likely to be found in computer and natural sciences than in the established fields of social relations. In order to facilitate the participation of social scientists in an emerging interdisciplinary research branch of “computational social science,” we propose in this article the usage of the Python programming language. First, we carve out its capacity to handle “Big Data” in suitable formats. Second, we introduce programming libraries to analyze large networks and big text corpora, conduct simulations, and compare their performance to their counterparts in the R environment. Furthermore, we highlight practical tools implemented in Python for operational tasks like preparing presentations. Finally, we discuss how the process of writing code may help to exemplify theoretical concepts and could lead to empirical applications that gain a better understanding of the social processes initiated by the truly global connections of the Internet era.

[1]  Ben Jann,et al.  Reputation Formation and the Evolution of Cooperation in Anonymous Online Markets , 2014 .

[2]  J. French A formal theory of social power. , 1956, Psychology Review.

[3]  Dirk Helbing Introduction: The FuturICT knowledge accelerator towards a more resilient and sustainable future , 2013, ArXiv.

[4]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[5]  Roger Burrows,et al.  The Coming Crisis of Empirical Sociology , 2007, Sociology.

[6]  Ronald S. Burt,et al.  Structural Holes: The Social Structure of Competition. , 1994 .

[7]  Petr Sojka,et al.  Software Framework for Topic Modelling with Large Corpora , 2010 .

[8]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Raphael H. Heiberger,et al.  Collective Attention and Stock Prices: Evidence from Google Trends Data on Standard and Poor's 100 , 2015, PloS one.

[10]  Linton C. Freeman,et al.  The Development of Social Network Analysis—with an Emphasis on Recent Events , 2011 .

[11]  Wiebke Wagner,et al.  Steven Bird, Ewan Klein and Edward Loper: Natural Language Processing with Python, Analyzing Text with the Natural Language Toolkit , 2010, Lang. Resour. Evaluation.

[12]  Bruce Edmonds,et al.  Sociology and Social Theory in Agent Based Social Simulation: A Symposium , 2001, Comput. Math. Organ. Theory.

[13]  Xingming Zhao,et al.  Computational Systems Biology , 2013, TheScientificWorldJournal.

[14]  Kevin Lewis,et al.  Beyond and Below Racial Homophily: ERG Models of a Friendship Network Documented on Facebook1 , 2010, American Journal of Sociology.

[15]  Ajay Mehra The Development of Social Network Analysis: A Study in the Sociology of Science , 2005 .

[16]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[17]  Marco Gonzalez,et al.  Author's Personal Copy Social Networks Tastes, Ties, and Time: a New Social Network Dataset Using Facebook.com , 2022 .

[18]  C. Bail The Fringe Effect , 2012 .

[19]  H. Simon Models of Bounded Rationality: Empirically Grounded Economic Reason , 1997 .

[20]  A. Pentland,et al.  Computational Social Science , 2009, Science.

[21]  David M. Pennock,et al.  Predicting consumer behavior with Web search , 2010, Proceedings of the National Academy of Sciences.

[22]  Roger Burrows,et al.  After the crisis? Big Data and the methodological challenges of empirical sociology , 2014 .

[23]  Thomas C. Herndon,et al.  Does high public debt consistently stifle economic growth? A critique of Reinhart and Rogoff , 2014 .

[24]  Todd A. Brun,et al.  Quantum Computing , 2011, Computer Science, The Hardware, Software and Heart of It.

[25]  Alan G. Isaac Simulating Evolutionary Games: A Python-Based Introduction , 2008, J. Artif. Soc. Soc. Simul..

[26]  F. Kalter,et al.  Rational Choice Theory and Empirical Research: Methodological and Theoretical Contributions in Europe , 2012 .

[27]  Susan Leigh Star,et al.  The Structure of Ill-Structured Solutions: Boundary Objects and Heterogeneous Distributed Problem Solving , 1989, Distributed Artificial Intelligence.

[28]  M. Lutter Do Women Suffer from Network Closure? The Moderating Effect of Social Capital on Gender Inequality in a Project-Based Labor Market, 1929 to 2010 , 2015 .

[29]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[30]  Linton C. Freeman,et al.  Interpersonal Proximity in Social and Cognitive Space , 1994 .

[31]  Allen B. Downey Think Perl 6: How to Think Like a Computer Scientist , 2017 .

[32]  C. J. Campell,et al.  The coming crisis , 1995 .

[33]  C. Bail The cultural environment: measuring culture with big data , 2014, Theory and Society.

[34]  Bruce D. McCullough,et al.  Econometric Computing with “R” , 2010 .

[35]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994 .

[36]  Kenneth Rogoff,et al.  The Forgotten History of Domestic Debt , 2008 .

[37]  George C. Homans Human Group , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[38]  Yann Chevaleyre,et al.  A Short Introduction to Computational Social Choice , 2007, SOFSEM.

[39]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[40]  Jari Saramäki,et al.  Small But Slow World: How Network Topology and Burstiness Slow Down Spreading , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[41]  Jim Giles,et al.  Computational social science: Making the links , 2012, Nature.

[42]  N. Luhmann Essays On Self-Reference , 1990 .

[43]  A-L Barabási,et al.  Structure and tie strengths in mobile communication networks , 2006, Proceedings of the National Academy of Sciences.

[44]  G. King,et al.  Ensuring the Data-Rich Future of the Social Sciences , 2011, Science.

[45]  Balazs Vedres,et al.  Game Changer: The Topology of Creativity1 , 2015, American Journal of Sociology.

[46]  Kurt Hornik,et al.  Text Mining Infrastructure in R , 2008 .

[47]  S. Dumais Latent Semantic Analysis. , 2005 .

[48]  John Scott,et al.  The SAGE Handbook of Social Network Analysis , 2011 .

[49]  Percy S. Cohen,et al.  Modern Social Theory , 1968 .

[50]  Ulrich Ellwanger,et al.  The Theory of Fields , 2012 .

[51]  H. Maturana,et al.  Autopoiesis and Cognition : The Realization of the Living (Boston Studies in the Philosophy of Scie , 1980 .

[52]  Raphael Heiko Heiberger,et al.  U.S. and Whom? Structures and Communities of International Economic Research , 2015, J. Soc. Struct..

[53]  Harry Eugene Stanley,et al.  Catastrophic cascade of failures in interdependent networks , 2009, Nature.

[54]  Terry D. Clark,et al.  Predicting the trajectory of the evolving international cyber regime: Simulating the growth of a social network , 2015, Soc. Networks.

[55]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.

[56]  Pádraig Cunningham,et al.  The influence of network structures of Wikipedia discussion pages on the efficiency of WikiProjects , 2015, Soc. Networks.

[57]  D. Watts Common Sense and Sociological Explanations1 , 2014, American Journal of Sociology.

[58]  E Ray Dorsey,et al.  The coming crisis , 2013, Neurology.

[59]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[60]  Scott A. Golder,et al.  Diurnal and Seasonal Mood Vary with Work, Sleep, and Daylength Across Diverse Cultures , 2011 .

[61]  J. Nadal,et al.  Manifesto of computational social science , 2012 .

[62]  D. Watts The “New” Science of Networks , 2004 .

[63]  Pierre Bourdieu,et al.  Outline of a Theory of Practice , 2020, On Violence.

[64]  Stephen Kalberg,et al.  Max Weber's Types of Rationality: Cornerstones for the Analysis of Rationalization Processes in History , 1980, American Journal of Sociology.

[65]  George Sugihara,et al.  Complex systems: Ecology for bankers , 2008, Nature.

[66]  D. Ruths,et al.  Social media for large studies of behavior , 2014, Science.

[67]  Raphael H. Heiberger,et al.  Stock network stability in times of crisis , 2014 .

[68]  H. Foerster Understanding Understanding , 2002, Springer New York.

[69]  Frantisek Kalvas Introduction to Computational Social Science: Principles and Applications (Texts in Computer Science) by Claudio Cioffi-Revilla , 2015, J. Artif. Soc. Soc. Simul..

[70]  Kenneth D. Bailey,et al.  Sociology and the new systems theory : toward a theoretical synthesis , 1994 .