Doubly Robust Policy Evaluation and Learning
暂无分享,去创建一个
[1] D. Horvitz,et al. A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .
[2] C. Cassel,et al. Some results on generalized difference estimation and generalized regression estimation for finite populations , 1976 .
[3] J. Robins,et al. Estimation of Regression Coefficients When Some Regressors are not Always Observed , 1994 .
[4] J. Robins,et al. Semiparametric Efficiency in Multivariate Regression Models with Missing Data , 1995 .
[5] Peter Auer,et al. The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..
[6] J. Lunceford,et al. Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study , 2004, Statistics in medicine.
[7] Joseph Kang,et al. Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data , 2007, 0804.2958.
[8] Diane Lambert,et al. More bang for their bucks: assessing new features for online advertisers , 2007, SKDD.
[9] Marie Davidian,et al. Comment: Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data. , 2008, Statistical science : a review journal of the Institute of Mathematical Statistics.
[10] J. Langford,et al. The Epoch-Greedy algorithm for contextual multi-armed bandits , 2007, NIPS 2007.
[11] A. Beygelzimer. Multiclass Classification with Filter Trees , 2007 .
[12] John Langford,et al. Exploration scavenging , 2008, ICML '08.
[13] Ian H. Witten,et al. The WEKA data mining software: an update , 2009, SKDD.
[14] Neil D. Lawrence,et al. Dataset Shift in Machine Learning , 2009 .
[15] John Langford,et al. The offset tree for learning with partial labels , 2008, KDD.
[16] Tamir Hazan,et al. Direct Loss Minimization for Structured Prediction , 2010, NIPS.
[17] Lihong Li,et al. Learning from Logged Implicit Exploration Data , 2010, NIPS.
[18] Rong Ge,et al. Evaluating online ad campaigns in a pipeline: causal models at scale , 2010, KDD.
[19] Elad Hazan,et al. Better Algorithms for Benign Bandits , 2009, J. Mach. Learn. Res..