Privacy Preserving C4.5 Algorithm over Vertically Distributed Datasets
暂无分享,去创建一个
It is a primary task in the privacy-preserving data mining in the distributed environment how to protect privacy and at the same time acquire accurate data relation. This paper shows how two parties built a decision tree collaboratively without revealing privacy when datasets is vertically distributed, including a PPC4.5 algorithm for privacy preserving via C4.5 over vertically distributed datasets and an algorithm of the best split attribute and the information gain ratio of the node. Further, the secure scalar product protocol and the x¿(x) protocol are used in collaborative computing, which can protect privacy effectively.
[1] Yiqun Huang,et al. A method of security improvement for privacy preserving association rule mining over vertically partitioned data , 2005, 9th International Database Engineering & Application Symposium (IDEAS'05).
[2] Bart Kuijpers,et al. Privacy Preserving ID3 over Horizontally, Vertically and Grid Partitioned Data , 2008, ArXiv.
[3] Wenliang Du,et al. Building decision tree classifier on private data , 2002 .
[4] Yehuda Lindell,et al. Privacy Preserving Data Mining , 2002, Journal of Cryptology.