Kai Wang
发表
Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning
pdf
Finale Doshi-Velez,
Milind Tambe,
Andrew Perrault,
2021,
ArXiv.
Coordinating Followers to Reach Better Equilibria: End-to-End Gradient Descent for Stackelberg Games
pdf
Milind Tambe,
Michael K. Reiter,
Lily Xu,
2021,
AAAI.
Milind Tambe,
Bistra Dilkina,
Bryan Wilder,
2021,
ArXiv.