A calibrated measure to compare fluctuations of different entities across timescales

A common way to learn about a system’s properties is to analyze temporal fluctuations in associated variables. However, conclusions based on fluctuations from a single entity can be misleading when used without proper reference to other comparable entities or when examined only on one timescale. Here we introduce a method that uses predictions from a fluctuation scaling law as a benchmark for the observed standard deviations. Differences from the benchmark (residuals) are aggregated across multiple timescales using Principal Component Analysis to reduce data dimensionality. The first component score is a calibrated measure of fluctuations—the reactivity RA of a given entity. We apply our method to activity records from the media industry using data from the Event Registry news aggregator—over 32M articles on selected topics published by over 8000 news outlets. Our approach distinguishes between different news outlet reporting styles: high reactivity points to activity fluctuations larger than expected, reflecting a bursty reporting style, whereas low reactivity suggests a relatively stable reporting style. Combining our method with the political bias detector Media Bias/Fact Check we quantify the relative reporting styles for different topics of mainly US media sources grouped by political orientation. The results suggest that news outlets with a liberal bias tended to be the least reactive while conservative news outlets were the most reactive.

[1]  Gregor Leban,et al.  Modelling of temporal fluctuation scaling in online news network with independent cascade model , 2018, Physica A: Statistical Mechanics and its Applications.

[2]  H. F. Smith An empirical law describing heterogeneity in the yields of agricultural crops , 1938, The Journal of Agricultural Science.

[3]  Vu Duong,et al.  Human activity under high pressure: A case study on fluctuation scaling of air traffic controller’s communication behaviors , 2016 .

[4]  Steven Skiena,et al.  MediaRank: Computational Ranking of Online News Sources , 2019, KDD.

[5]  Lawrence W. Sheppard,et al.  Proximate determinants of Taylor's law slopes , 2019, The Journal of animal ecology.

[6]  Bent Jørgensen,et al.  Taylor's power law and fluctuation scaling explained by a central-limit-like convergence. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  Johan Bollen,et al.  A Principal Component Analysis of 39 Scientific Impact Measures , 2009, PloS one.

[8]  U. Fano Ionization Yield of Radiations. II. The Fluctuations of the Number of Ions , 1947 .

[9]  F. Shaffer,et al.  An Overview of Heart Rate Variability Metrics and Norms , 2017, Front. Public Health.

[10]  Markus Ringnér,et al.  What is principal component analysis? , 2008, Nature Biotechnology.

[11]  Stefan Rotter,et al.  Measurement of variability dynamics in cortical spike trains , 2008, Journal of Neuroscience Methods.

[12]  Jan Choloniewski,et al.  Key courses of academic curriculum uncovered by data mining of students' grades , 2016, 1604.07074.

[13]  Amos Maritan,et al.  Zipf's and Taylor's laws , 2018, Physical Review E.

[14]  Quentin S. Hanley,et al.  Fluctuation Scaling, Taylor’s Law, and Crime , 2014, PloS one.

[15]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[16]  Rosario N. Mantegna,et al.  Book Review: An Introduction to Econophysics, Correlations, and Complexity in Finance, N. Rosario, H. Mantegna, and H. E. Stanley, Cambridge University Press, Cambridge, 2000. , 2000 .

[17]  James Glass,et al.  FAKTA: An Automatic End-to-End Fact Checking System , 2019, NAACL.

[18]  I. Prigogine,et al.  Book Review: Modern Thermodynamics: From Heat Engines to Dissipative Structures , 1998 .

[19]  Hernán A. Makse,et al.  CUNY Academic Works , 2022 .

[20]  Sune Lehmann,et al.  Accelerating dynamics of collective attention , 2019, Nature Communications.

[21]  Misako Takayasu,et al.  Statistical properties of fluctuations of time series representing the appearance of words in nationwide blog data and their applications: An example of observations and the modelling of fluctuation scalings of nonstationary time series , 2016, Physical review. E.

[22]  Peter M. A. Sloot,et al.  Categorical and Geographical Separation in Science , 2013, Scientific Reports.

[23]  Preslav Nakov,et al.  Predicting the Topical Stance of Media and Popular Twitter Users , 2019, ArXiv.

[24]  Jack Dongarra,et al.  LAPACK: a portable linear algebra library for high-performance computers , 1990, SC.

[25]  Randi Naes,et al.  Order Book Characteristics and the Volume-Volatility Relation: Empirical Evidence from a Limit Order Market , 2004 .

[26]  P. Allison Measures of Inequality , 1978 .

[27]  Eduardo G. Altmann,et al.  Scaling laws and fluctuations in the statistics of word frequencies , 2014, ArXiv.

[28]  P. Cizeau,et al.  Statistical properties of the volatility of price fluctuations. , 1999, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[29]  Preslav Nakov,et al.  Proppy: Organizing the news based on their propagandistic content , 2019, Inf. Process. Manag..

[30]  L. R. Taylor,et al.  Aggregation, Variance and the Mean , 1961, Nature.

[31]  J. Kwapień,et al.  Physical approach to complex systems , 2012 .

[32]  Preslav Nakov,et al.  Predicting the Leading Political Ideology of YouTube Channels Using Acoustic, Textual, and Metadata Information , 2019, INTERSPEECH.

[33]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[34]  A. Vulpiani,et al.  Fluctuation-dissipation: Response theory in statistical physics , 2008, 0803.0719.

[35]  D.sc. THE PROBLEM OF LONG-TERM STORAGE IN RESERVOIRS , 1956 .

[36]  J. Galen Buckwalter,et al.  Heart Rate Variability as an Index of Resilience. , 2020, Military medicine.

[37]  P. Linsley,et al.  Risk reporting by the largest UK companies: readability and lack of obfuscation , 2007 .

[38]  Boleslaw K. Szymanski,et al.  Impact of Attributes on Group Formation , 2018, 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[39]  W. Kruskal,et al.  Use of Ranks in One-Criterion Variance Analysis , 1952 .

[40]  H. Stanley,et al.  Multifractal properties of price fluctuations of stocks and commodities , 2003, cond-mat/0308012.

[41]  Bela Gipp,et al.  Automated identification of media bias in news articles: an interdisciplinary literature review , 2018, International Journal on Digital Libraries.

[42]  Marko Grobelnik,et al.  Event registry: learning about world events from news , 2014, WWW.

[43]  Gen Sakoda,et al.  Tracking Poisson Parameter for Non-Stationary Discontinuous Time Series with Taylor’s Abnormal Fluctuation Scaling , 2019, Stats.

[44]  Preslav Nakov,et al.  Predicting Factuality of Reporting and Bias of News Media Sources , 2018, EMNLP.

[45]  Nathan Pike,et al.  Using false discovery rates for multiple comparisons in ecology and evolution , 2011 .

[46]  Mahesan Niranjan,et al.  Genomic Analysis of the Function of the Transcription Factor gata3 during Development of the Mammalian Inner Ear , 2009, PloS one.

[47]  Huan Liu,et al.  Beyond News Contents: The Role of Social Context for Fake News Detection , 2017, WSDM.

[48]  Darinka Dentcheva,et al.  On the price of risk in a mean-risk optimization model , 2018 .

[49]  Wei Chen,et al.  A correlation-matrix-based hierarchical clustering method for functional connectivity analysis , 2012, Journal of Neuroscience Methods.

[50]  Gustavo Deco,et al.  Neural Network Mechanisms Underlying Stimulus Driven Variability Reduction , 2012, PLoS Comput. Biol..

[51]  Kristina Lerman,et al.  Who Falls for Online Political Manipulation? , 2018, WWW.

[52]  A. Kappas,et al.  Temporal Taylor's scaling of facial electromyography and electrodermal activity in the course of emotional stimulation , 2016, 1601.01649.

[53]  J. Kertész,et al.  Fluctuation scaling in complex systems: Taylor's law and beyond , 2007, 0708.2053.

[54]  Nitesh V. Chawla,et al.  ONE-M: Modeling the Co-evolution of Opinions and Network Connections , 2018, ECML/PKDD.

[55]  Maksim Terpilowski,et al.  scikit-posthocs: Pairwise multiple comparison tests in Python , 2019, J. Open Source Softw..

[56]  Nitesh V. Chawla,et al.  Influence of Personal Preferences on Link Dynamics in Social Networks , 2017, Complex..

[57]  Joel E. Cohen,et al.  Synchrony affects Taylor’s law in theory and data , 2017, Proceedings of the National Academy of Sciences.