论文信息 - Partial example acquisition in cost-sensitive learning

Partial example acquisition in cost-sensitive learning

It is often expensive to acquire data in real-world data mining applications. Most previous data mining and machine learning research, however, assumes that a fixed set of training examples is given. In this paper, we propose an online cost-sensitive framework that allows a learner to dynamically acquire examples as it learns, and to decide the ideal number of examples needed to minimize the total cost. We also propose a new strategy for Partial Example Acquisition (PAS), in which the learner can acquire examples with a subset of attribute values to reduce the data acquisition cost. Experiments on UCI datasets show that the new PAS strategy is an effective method in reducing the total cost for data acquisition.

Victor S. Sheng | Charles X. Ling

[1] Daphne Koller,et al. Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[2] Russell Greiner,et al. Budgeted learning of nailve-bayes classifiers , 2002, UAI 2002.

[3] Foster J. Provost,et al. Active feature-value acquisition for classifier induction , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[4] Andrew McCallum,et al. Dynamic conditional random fields: factorized probabilistic models for labeling and segmenting sequence data , 2004, J. Mach. Learn. Res..

[5] Thomas G. Dietterich,et al. Pruning Improves Heuristic Search for Cost-Sensitive Learning , 2002, ICML.

[6] Peter D. Turney. Types of Cost in Inductive Concept Learning , 2002, ArXiv.

[7] Russell Greiner,et al. Learning and Classifying Under Hard Budgets , 2005, ECML.

[8] Haym Hirsh,et al. Improving Short-Text Classification using Unlabeled Data for Classification Problems , 2000, ICML.

[9] Kai Ming Ting,et al. Inducing Cost-Sensitive Trees via Instance Weighting , 1998, PKDD.

[10] Bojan Cestnik,et al. Estimating Probabilities: A Crucial Task in Machine Learning , 1990, ECAI.

[11] Maytal Saar-Tsechansky,et al. Economical active feature-value acquisition through Expected Utility estimation , 2005, UBDM '05.