论文信息 - Weighted A* Algorithms for Unsupervised Feature Selection with Provable Bounds on Suboptimality

Weighted A* Algorithms for Unsupervised Feature Selection with Provable Bounds on Suboptimality

Identifying a small number of features that can represent the data is believed to be NP-hard. Previous approaches exploit algebraic structure and use randomization. We propose an algorithm based on ideas similar to the Weighted A* algorithm in heuristic search. Our experiments show this new algorithm to be more accurate than the current state of the art.

Ke Xu | Haim Schweitzer | Crystal Maung | Hiromasa Arai

[1] Luis Rademacher,et al. Efficient Volume Sampling for Row/Column Subset Selection , 2010, 2010 IEEE 51st Annual Symposium on Foundations of Computer Science.

[2] Gene H. Golub,et al. Matrix computations , 1983 .

[3] Christos Boutsidis,et al. An improved approximation algorithm for the column subset selection problem , 2008, SODA.

[4] Ali Çivril,et al. Column Subset Selection Problem is UG-hard , 2014, J. Comput. Syst. Sci..

[5] Ke Xu,et al. Unsupervised Feature Selection by Heuristic Search with Provable Bounds on Suboptimality , 2016, AAAI.

[6] Haim Schweitzer,et al. Optimal Column Subset Selection by A-Star Search , 2015, AAAI.

[7] Anirban Dasgupta,et al. Feature selection methods for text classification , 2007, KDD '07.