K-anonymity model for privacy-preserving soccer fitness data publishing

With the development of data mining technology, more and more researchers use the soccer fitness data to analyse the ranking of soccer athletes' and professional training. However, the direct release of soccer fitness data may leak the personal privacy of soccer athletes, so how to ensure the utility of soccer fitness data and the privacy of soccer player has become an issue. In this paper, we point out the linking attack existing in soccer fitness data, which the attackers can use the auxiliary demographic data as background information to attack the published physical data. So the attackers will map the privacy data and the athlete together. At the same time, we apply the partitioning-based and k-means clustering-based two k-anonymity algorithms to the soccer fitness data publishing to trade-offs the data utility and the personal privacy. Experimental results showed that the performance of methods is convincing.

[1]  Latanya Sweeney,et al.  Achieving k-Anonymity Privacy Protection Using Generalization and Suppression , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[2]  Latanya Sweeney,et al.  k-Anonymity: A Model for Protecting Privacy , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[3]  David J. DeWitt,et al.  Incognito: efficient full-domain K-anonymity , 2005, SIGMOD '05.

[4]  Philip S. Yu,et al.  Privacy-preserving data publishing: A survey of recent developments , 2010, CSUR.

[5]  Luca Pappalardo,et al.  Effective injury forecasting in soccer with GPS training data and machine learning , 2017, PloS one.

[6]  Roberto J. Bayardo,et al.  Data privacy through optimal k-anonymization , 2005, 21st International Conference on Data Engineering (ICDE'05).

[7]  David J. DeWitt,et al.  Mondrian Multidimensional K-Anonymity , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[8]  Jian Pei,et al.  Utility-based anonymization using local recoding , 2006, KDD '06.