Active Learning with Disagreement Graphs

We present two novel enhancements of an online importance-weighted active learning algorithm IWAL, using the properties of disagreements among hypotheses. The first enhancement, IWALD, prunes the hypothesis set with a more aggressive strategy based on the disagreement graph. We show that IWAL-D improves the generalization performance and the label complexity of the original IWAL, and quantify the improvement in terms of a disagreement graph coefficient. The second enhancement, IZOOM, further improves IWAL-D by adaptively zooming into the current version space and thus reducing the best-in-class error. We show that IZOOM admits favorable theoretical guarantees with the changing hypothesis set. We report experimental results on multiple datasets and demonstrate that the proposed algorithms achieve better test performances than IWAL given the same amount of labeling budget.

[1]  Maria-Florina Balcan,et al.  Active and passive learning of linear separators under log-concave distributions , 2012, COLT.

[2]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[3]  Steve Hanneke,et al.  Theory of Disagreement-Based Active Learning , 2014, Found. Trends Mach. Learn..

[4]  Chicheng Zhang,et al.  Efficient active learning of sparse halfspaces , 2018, COLT.

[5]  John Langford,et al.  Agnostic active learning , 2006, J. Comput. Syst. Sci..

[6]  Maria-Florina Balcan,et al.  Margin Based Active Learning , 2007, COLT.

[7]  Kamalika Chaudhuri,et al.  Beyond Disagreement-Based Agnostic Active Learning , 2014, NIPS.

[8]  Sanjoy Dasgupta,et al.  A General Agnostic Active Learning Algorithm , 2007, ISAIM.

[9]  John Langford,et al.  Importance weighted active learning , 2008, ICML '09.

[10]  Maria-Florina Balcan,et al.  The Power of Localization for Efficiently Learning Linear Separators with Noise , 2013, J. ACM.

[11]  Maria-Florina Balcan,et al.  Efficient Learning of Linear Separators under Bounded Noise , 2015, COLT.

[12]  Steve Hanneke,et al.  A bound on the label complexity of agnostic active learning , 2007, ICML '07.

[13]  John Shawe-Taylor,et al.  Covering numbers for support vector machines , 1999, COLT '99.

[14]  John Langford,et al.  Agnostic Active Learning Without Constraints , 2010, NIPS.

[15]  Claudio Gentile,et al.  Region-Based Active Learning , 2019, AISTATS.

[16]  Adam Tauman Kalai,et al.  Analysis of Perceptron-Based Active Learning , 2009, COLT.