论文信息 - Speeding Up Secure Computations via Embedded Caching

Speeding Up Secure Computations via Embedded Caching

Most existing work on Privacy-Preserving Data Mining (PPDM) focus on enabling conventional data mining algorithms with the ability to run in a secure manner in a multi-party setting. Although various algorithms in data mining have been enhanced to incorporate secure mechanisms for data privacy preservation, their computation performance is far too high to allow them to be practically useful. This is especially true for those algorithms that make use of common cryptosystems. In this paper, we address the efficiency issue of PPDM algorithms by proposing to cache result data that are used more than once by secure computations. For this to be possible, we carefully examine the micro steps of secure computations to identify the repetitive or iterative portions and reduce the overall computational cost by caching intermediate results/data. We have applied this to decision tree induction, association rule mining and k-means clustering that make use of secure building blocks such as secure multi-party sum, secure matrix multiplication, and secure inverse of matrix sum. We show empirically that the computational costs of secure computations can be reduced without affecting the quality of the data mining result in general. Our experiments show that the caching technique is generalizable to common data mining algorithms and the efficiency of PPDM algorithms can be greatly improved without compromising data privacy.

S. Han | K. Zhai | W. K. Ng | A. R. Herianto

[1] Chris Clifton,et al. Privacy-Preserving Decision Trees over Vertically Partitioned Data , 2005, DBSec.

[2] Bart Goethals,et al. On Private Scalar Product Computation for Privacy-Preserving Data Mining , 2004, ICISC.

[3] Shuguo Han,et al. Privacy-Preserving Linear Fisher Discriminant Analysis , 2008, PAKDD.

[4] Wenliang Du,et al. Privacy-preserving cooperative scientific computations , 2001, Proceedings. 14th IEEE Computer Security Foundations Workshop, 2001..

[5] Rebecca N. Wright,et al. Privacy-preserving distributed k-means clustering over arbitrarily partitioned data , 2005, KDD '05.

[6] Rebecca N. Wright,et al. Experimental analysis of a privacy-preserving scalar product protocol , 2006, Comput. Syst. Sci. Eng..

[7] Moni Naor,et al. Oblivious transfer and polynomial evaluation , 1999, STOC '99.

[8] Chris Clifton,et al. Tools for privacy preserving distributed data mining , 2002, SKDD.

[9] A. Yao,et al. Fair exchange with a semi-trusted third party (extended abstract) , 1997, CCS '97.

[10] Philip S. Yu,et al. Privacy-Preserving Singular Value Decomposition , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[11] Gu Si-yang,et al. Privacy preserving association rule mining in vertically partitioned data , 2006 .

[12] Yehuda Lindell,et al. Privacy Preserving Data Mining , 2002, Journal of Cryptology.