The # BTW 17 Twitter Dataset – Recorded Tweets of the Federal Election Campaigns of 2017 for the 19 th German Bundestag

The German Bundestag elections are the most important elections in Germany. This dataset comprises Twitter interactions related to German politicians of the most important political parties over several months in the (pre-)phase of the German federal election campaigns in 2017. The Twitter accounts of more than 360 politicians were followed for four months. The collected data comprise a sample of approximately 10 GB of Twitter raw data, and they cover more than 120,000 active Twitter users and more than 1,200,000 recorded tweets. Even without sophisticated data analysis techniques, it was possible to deduce a likely political party proximity for more than half of these accounts simply by looking at the re-tweet behavior. This might be of interest for innovative data-driven party campaign strategists in the future. Furthermore, it is observable, that, in Germany, supporters and politicians of populist parties make use of Twitter much more intensively and aggressively than supporters of other parties. Furthermore, established left-wing parties seem to be more active on Twitter than established conservative parties. The dataset can be used to study how political parties, their followers and supporters make use of social media channels in political election campaigns and what kind of content is shared. Data Set: https://doi.org/10.5281/zenodo.835735 Data Set License: CC BY 4.0

[1]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[2]  Pablo Barberá,et al.  Understanding the Political Representativeness of Twitter Users , 2015 .

[3]  Panagiotis Takis Metaxas,et al.  Limits of Electoral Predictions Using Twitter , 2011, ICWSM.

[4]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[5]  Jason Baldridge,et al.  Twitter Polarity Classification with Label Propagation over Lexical Links and the Follower Graph , 2011, ULNLP@EMNLP.

[6]  Yazhe Wang,et al.  Should We Use the Sample? Analyzing Datasets Sampled from Twitter’s Stream API , 2015, ACM Trans. Web.

[7]  Shrikanth S. Narayanan,et al.  A System for Real-time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle , 2012, ACL.

[8]  Jacob R. Straus Social Media in Congress: The Impact of Electronic Media on Member Communications , 2016 .

[9]  Johan Bos,et al.  Predicting the 2011 Dutch Senate Election Results with Twitter , 2012 .

[10]  Andreas Jungherr,et al.  Tweets and votes, a special relationship: the 2009 federal election in germany , 2013, PLEAD '13.

[11]  Huan Liu,et al.  Is the Sample Good Enough? Comparing Data from Twitter's Streaming API with Twitter's Firehose , 2013, ICWSM.

[12]  Mohand Boughanem,et al.  Uncovering Like-minded Political Communities on Twitter , 2017, ICTIR.

[13]  J. Cook,et al.  Twitter Adoption and Activity in U.S. Legislatures: A 50-State Study , 2017 .

[14]  Orestis Papakyriakopoulos,et al.  Social Media und Microtargeting in Deutschland , 2017, Informatik-Spektrum.

[15]  Jorge Ferraz de Abreu,et al.  From Live TV Events to Twitter Status Updates - a Study on Delays , 2016, jAUTI.

[16]  Silvio Waisbord,et al.  Populist communication by digital means: presidential Twitter in Latin America , 2017 .