Application of Feature Selection for Unsupervised Learning in Prosecutors' Office

Feature selection is effective in removing irrelevant data. However, the result of feature selection in unsupervised learning is not as satisfying as that in supervised learning. In this paper, we propose a novel methodology ULAC (Feature Selection for Unsupervised Learning Based on Attribute Correlation Analysis and Clustering Algorithm) to identify important features for unsupervised learning. We also apply ULAC into prosecutors' office to solve the real world application for unsupervised learning.

[1]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[2]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[3]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[4]  Carla E. Brodley,et al.  Feature Selection for Unsupervised Learning , 2004, J. Mach. Learn. Res..

[5]  Huan Liu,et al.  Feature Selection with Selective Sampling , 2002, International Conference on Machine Learning.