Is a Sampled Network a Good Enough Descriptor? Missing Links and Appropriate Choice of Representation

The spread of infectious diseases, between animals as well as between humans, is a topic often in focus. Outbreaks of diseases like for example foot-and-mouth disease, avian influenza, and swine influenza have in the last decades led to an increasing interest in modelling of infectious diseases since such models can be used to elucidate disease transmission and to evaluate the impact of different control strategies. Different kind of modelling techniques can be used, e.g. individual based disease modelling, Bayesian analysis, Markov Chain Monte Carlo simulations, and network analysis. The topic in this thesis is network analysis, since this is a useful method when studying spread of infect ious diseases. The usefulness lies in the fact that a network describes potential transmission routes, and to have knowledge about the structure of them is valuable in predicting the spread of diseases. This thesis contains both a method for generating a wide range of different theoretical networks, and also examination and discussion about the usefulness of network analysis as a tool for analysing transmission of infectious animal diseases between farms in a spatial context. In addition to the theoretical networks, Swedish animal transport networks are used as empirical examples. To be able to answer questions about the effect of the proportion of contacts in networks, the effect of missing links and about the usefulness of network measures, there was a need to manage to generate networks with a wide range of different structures. Therefore, it was necessary to develop a network generating algorithm. Papers I and II describes that network generating algorithm, SpecNet, which creates spatial networks. The aim was to develop an algorithm that managed to generate a wide range of network structures. The performance of the algorithm was evaluated by some network measures. In the first study, Paper I, the algorithm succeeded to generate a wide range of most of the investigated network measures. Paper II is an improvement of the algorithm to produce networks with low negative assortativity by adding two classes of nodes instead of one. Except to generate theoretical networks from scratch, it is also relevant that a network generating algorithm has the potential to regenerate a network with given specific structures. Therefore, we tested to regenerate two Swedish animal transport networks according to their structures. SpecNet managed to mimic the two empirical networks well in comparison with a non-spatial network generating algorithm that was not equally successful in regenerating the requested structures. Sampled empirical networks are rarely complete, since contacts are often missing during sampling, e. g. due to difficulties to sample or due to too short time window during sampling. In Paper III, the focus is on the effect on disease transmission, due to number of contacts in the network, as well as on the reliability of making predictions from networks with a small proportion of missing links. In addition, attention is also given to the spatial distribution of animal holdings in the landscape and on what effect this distribution has on the resulting disease transmission between the holdings. Our results indicate that, assuming weighted contacts, it is maybe risky to make predictions about disease transmission from one single network replicate with as low proportion of contacts as in most empirical animal transport networks. In case of a disease outbreak, it would be valuable to use network measures as predictors for the progress and the extent of the disease transmission. Then a reliable network is required, and also that the used network measures has the potential to make reasonable predictions about the epidemic. In Paper IV we investigate if network measures are useful as predictors for eventual disease transmissions. Moreover, we also analyse if there is some measure that correlates better with disease transmission than others. Disease transmission simulations are performed in networks with different structures to mimic diverse spatial conditions, thereafter are the simulation results compared to the values of the network structures.

[1]  Hawoong Jeong,et al.  Statistical properties of sampled networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  M. Heath,et al.  Construction of networks with intrinsic temporal structure from UK cattle movement data , 2008, BMC veterinary research.

[3]  R. Christley,et al.  Exploring the role of auction markets in cattle movements within Great Britain. , 2007, Preventive veterinary medicine.

[4]  Estimating animal movement contacts between holdings of different production types. , 2010, Preventive veterinary medicine.

[5]  T. Petermann,et al.  Exploration of scale-free networks , 2004, cond-mat/0401065.

[6]  Cohen,et al.  Resilience of the internet to random breakdowns , 2000, Physical review letters.

[7]  Mark D. F. Shirley,et al.  The impacts of network topology on disease spread , 2005 .

[8]  Nina Håkansson,et al.  Generating Structure Specific Networks , 2010, Adv. Complex Syst..

[9]  M. Nöremark,et al.  Network analysis of cattle and pig movements in Sweden: measures relevant for disease control and risk based surveillance. , 2011, Preventive veterinary medicine.

[10]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[11]  Scott A Sisson,et al.  Estimation of distance related probability of animal movements between holdings and implications for disease spread modeling. , 2009, Preventive veterinary medicine.

[12]  I. Kiss,et al.  Disease contact tracing in random and clustered networks , 2005, Proceedings of the Royal Society B: Biological Sciences.

[13]  S. S. Lewerin,et al.  Influence on disease spread dynamics of herd characteristics in a structured livestock industry , 2011, Journal of The Royal Society Interface.

[14]  Cristopher Moore,et al.  Accuracy and scaling phenomena in Internet mapping. , 2004, Physical review letters.

[15]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[16]  A. Vespignani,et al.  The architecture of complex weighted networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Ken T D Eames,et al.  Epidemic prediction and control in weighted networks. , 2009, Epidemics.

[18]  A. Barrat,et al.  Dynamical Patterns of Cattle Trade Movements , 2011, PloS one.

[19]  Alessandro Vespignani,et al.  Epidemic spreading in scale-free networks. , 2000, Physical review letters.

[20]  Alessandro Vespignani,et al.  Statistical theory of Internet exploration. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  D S Callaway,et al.  Network robustness and fragility: percolation on random graphs. , 2000, Physical review letters.

[22]  Dylan B. George,et al.  Using network properties to predict disease dynamics on human contact networks , 2011, Proceedings of the Royal Society B: Biological Sciences.

[23]  N. Håkansson,et al.  The shape of the spatial kernel and its implications for biological invasions in patchy environments , 2011, Proceedings of the Royal Society B: Biological Sciences.

[24]  Roger Guimerà,et al.  Missing and spurious interactions and the reconstruction of complex networks , 2009, Proceedings of the National Academy of Sciences.

[25]  Carsten Wiuf,et al.  Sampling properties of random graphs: the degree distribution. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  Francesca Cagnacci,et al.  Comparison of social networks derived from ecological data: implications for inferring infectious disease dynamics. , 2009, The Journal of animal ecology.

[27]  Rudolf Sollacher,et al.  Impact of network structure on the capacity of wireless multihop ad hoc communication , 2004 .

[28]  D. Pfeiffer,et al.  Use of social network analysis to characterize the pattern of animal movements in the initial phases of the 2001 foot and mouth disease (FMD) epidemic in the UK. , 2006, Preventive veterinary medicine.

[29]  R. Christley,et al.  Network analysis of cattle movement in Great Britain. , 2005 .

[30]  N. Håkansson,et al.  Splitting the tail of the displacement kernel shows the unimportance of kurtosis. , 2008, Ecology.

[31]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[32]  I. Kiss,et al.  The network of sheep movements within Great Britain: network properties and their implications for infectious disease spread , 2006, Journal of The Royal Society Interface.

[33]  Matt J. Keeling,et al.  Representing the UK's cattle herd as static and dynamic networks , 2008, Proceedings of the Royal Society B: Biological Sciences.

[34]  C. Webb,et al.  Farm animal networks: unraveling the contact structure of the British sheep population. , 2005, Preventive veterinary medicine.

[35]  Michael P. H. Stumpf,et al.  Generating confidence intervals on biological networks , 2007, BMC Bioinformatics.

[36]  M. Stumpf,et al.  Incomplete and noisy network data as a percolation process , 2010, Journal of The Royal Society Interface.