Can We Learn Heuristics for Graphical Model Inference Using Reinforcement Learning?
暂无分享,去创建一个
[1] Vladimir Kolmogorov,et al. Convergent Tree-Reweighted Message Passing for Energy Minimization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[2] Hal Daumé,et al. Structured Prediction via Learning to Search under Bandit Feedback , 2017, SPNLP@EMNLP.
[3] G. Evans,et al. Learning to Optimize , 2008 .
[4] Sergey Levine,et al. Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic , 2016, ICLR.
[5] Le Song,et al. Discriminative Embeddings of Latent Variable Models for Structured Data , 2016, ICML.
[6] William Yang Wang,et al. Robust Distant Supervision Relation Extraction via Deep Reinforcement Learning , 2018, ACL.
[7] Tomás Werner,et al. Revisiting the Linear Programming Relaxation Approach to Gibbs Energy Minimization and Weighted Constraint Satisfaction , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[8] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[9] Ullrich Köthe,et al. The Lazy Flipper: Efficient Depth-Limited Exhaustive Search in Discrete Graphical Models , 2012, ECCV.
[10] Xiaoxiao Li,et al. Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[11] Vaibhava Goel,et al. Self-Critical Sequence Training for Image Captioning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[12] Michail G. Lagoudakis,et al. Learning to Select Branching Rules in the DPLL Procedure for Satisfiability , 2001, Electron. Notes Discret. Math..
[13] Ofer Meshi,et al. Asynchronous Parallel Coordinate Minimization for MAP Inference , 2017, NIPS.
[14] Navdeep Jaitly,et al. Pointer Networks , 2015, NIPS.
[15] Kyunghyun Cho,et al. Task-Oriented Query Reformulation with Reinforcement Learning , 2017, EMNLP.
[16] Yunguan Fu,et al. Ranked Reward: Enabling Self-Play Reinforcement Learning for Combinatorial Optimization , 2018, ArXiv.
[17] Marc Pollefeys,et al. Globally Convergent Parallel MAP LP Relaxation Solver using the Frank-Wolfe Algorithm , 2014, ICML.
[18] Stephen Gould,et al. Accelerated dual decomposition for MAP inference , 2010, ICML.
[19] Samy Bengio,et al. Neural Combinatorial Optimization with Reinforcement Learning , 2016, ICLR.
[20] Le Song,et al. 2 Common Formulation for Greedy Algorithms on Graphs , 2018 .
[21] Wenhan Xiong,et al. DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning , 2017, EMNLP.
[22] Dale Schuurmans,et al. Reward Augmented Maximum Likelihood for Neural Structured Prediction , 2016, NIPS.
[23] Andrew Blake,et al. "GrabCut" , 2004, ACM Trans. Graph..
[24] Regina Barzilay,et al. Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning , 2016, EMNLP.
[25] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.
[26] Ofer Meshi,et al. Deep Structured Prediction with Nonlinear Output Transformations , 2018, NeurIPS.
[27] D. Greig,et al. Exact Maximum A Posteriori Estimation for Binary Images , 1989 .
[28] Ofer Meshi,et al. Smooth and Strong: MAP Inference with Linear Convergence , 2015, NIPS.
[29] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[30] Le Song,et al. Learning to Branch in Mixed Integer Programming , 2016, AAAI.
[31] Vladimir Kolmogorov,et al. What energy functions can be minimized via graph cuts? , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[32] Tommi S. Jaakkola,et al. Fixing Max-Product: Convergent Message Passing Algorithms for MAP LP-Relaxations , 2007, NIPS.
[33] Eric P. Xing,et al. An Augmented Lagrangian Approach to Constrained MAP Inference , 2011, ICML.
[34] Alexander G. Schwing,et al. Graph Structured Prediction Energy Networks , 2019, NeurIPS.
[35] Marie-Pierre Jolly,et al. Interactive Graph Cuts for Optimal Boundary and Region Segmentation of Objects in N-D Images , 2001, ICCV.
[36] Eric V. Denardo,et al. Flows in Networks , 2011 .
[37] Jitendra Malik,et al. Hypercolumns for object segmentation and fine-grained localization , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[38] Luca Maria Gambardella,et al. Fast image scanning with deep max-pooling convolutional neural networks , 2013, 2013 IEEE International Conference on Image Processing.
[39] Vibhav Vineet,et al. Conditional Random Fields as Recurrent Neural Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[40] Tommi S. Jaakkola,et al. Tightening LP Relaxations for MAP using Message Passing , 2008, UAI.
[41] Marc Pollefeys,et al. Distributed message passing for large scale graphical models , 2011, CVPR 2011.
[42] Olga Veksler,et al. Markov random fields with efficient approximations , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).
[43] Chen Liang,et al. Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision , 2016, ACL.
[44] Endre Boros,et al. A graph cut algorithm for higher-order Markov Random Fields , 2011, 2011 International Conference on Computer Vision.
[45] Joelle Pineau,et al. An Actor-Critic Algorithm for Sequence Prediction , 2016, ICLR.
[46] Marc'Aurelio Ranzato,et al. Sequence Level Training with Recurrent Neural Networks , 2015, ICLR.
[47] Marc Pollefeys,et al. Globally Convergent Dual MAP LP Relaxation Solvers using Fenchel-Young Margins , 2012, NIPS.
[48] Martin J. Wainwright,et al. On the Optimality of Tree-reweighted Max-product Message-passing , 2005, UAI.
[49] Olga Veksler,et al. Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.
[50] Vladimir Kolmogorov,et al. An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision , 2001, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[51] Jianfeng Gao,et al. Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.
[52] Nando de Freitas,et al. Learning Compositional Neural Programs with Recursive Tree Search and Planning , 2019, NeurIPS.
[53] Chen Liang,et al. Memory Augmented Policy Optimization for Program Synthesis with Generalization , 2018, ArXiv.
[54] Raquel Urtasun,et al. Fully Connected Deep Structured Networks , 2015, ArXiv.
[55] Alan L. Yuille,et al. Statistical cues for domain specific image segmentation with performance analysis , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).
[56] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.
[57] Horst Samulowitz,et al. Learning to Solve QBF , 2007, AAAI.
[58] Xiang Zhang,et al. OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.
[59] Martin J. Wainwright,et al. Tree-reweighted belief propagation algorithms and approximate ML estimation by pseudo-moment matching , 2003, AISTATS.
[60] Marcin Andrychowicz,et al. Learning to learn by gradient descent by gradient descent , 2016, NIPS.
[61] Jason K. Johnson,et al. Convex relaxation methods for graphical models: Lagrangian and maximum entropy approaches , 2008 .
[62] Martin J. Wainwright,et al. Message-passing for Graph-structured Linear Programs: Proximal Methods and Rounding Schemes , 2010, J. Mach. Learn. Res..
[63] Iasonas Kokkinos,et al. Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs , 2014, ICLR.
[64] George Papandreou,et al. Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.
[65] Christoph Schnörr,et al. A bundle approach to efficient MAP-inference by Lagrangian relaxation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[66] Philip H. S. Torr,et al. Higher Order Conditional Random Fields in Deep Neural Networks , 2015, ECCV.
[67] Tomás Werner,et al. A Linear Programming Approach to Max-Sum Problem: A Review , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[68] He He,et al. Learning to Search in Branch and Bound Algorithms , 2014, NIPS.
[69] David A. Forsyth,et al. Structural Consistency and Controllability for Diverse Colorization , 2018, ECCV.
[70] Xiaogang Wang,et al. Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[71] Geoffrey Zweig,et al. Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning , 2017, ACL.
[72] Jannis Bulian,et al. Ask the Right Questions: Active Question Reformulation with Reinforcement Learning , 2017, ICLR.
[73] Andrew W. Moore,et al. Learning Evaluation Functions to Improve Optimization by Local Search , 2001, J. Mach. Learn. Res..
[74] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.
[75] Seunghoon Hong,et al. Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[76] Andrew V. Goldberg,et al. A new approach to the maximum flow problem , 1986, STOC '86.
[77] Subhransu Maji,et al. Semantic contours from inverse detectors , 2011, 2011 International Conference on Computer Vision.
[78] Andreas Geiger,et al. MOTS: Multi-Object Tracking and Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[79] Alan L. Yuille,et al. Learning Deep Structured Models , 2014, ICML.
[80] Ali Farhadi,et al. You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[81] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.
[82] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[83] Vladlen Koltun,et al. Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.
[84] Richard Socher,et al. A Deep Reinforced Model for Abstractive Summarization , 2017, ICLR.
[85] Tie-Yan Liu,et al. Dual Learning for Machine Translation , 2016, NIPS.
[86] Judea Pearl,et al. Reverend Bayes on Inference Engines: A Distributed Hierarchical Approach , 1982, AAAI.
[87] Antonio Criminisi,et al. TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-class Object Recognition and Segmentation , 2006, ECCV.
[88] Marie-Pierre Jolly,et al. Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.
[89] Xiaoxiao Li,et al. Semantic Image Segmentation via Deep Parsing Network , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[90] Joseph Naor,et al. Approximation algorithms for the metric labeling problem via a new linear programming formulation , 2001, SODA '01.
[91] Ofer Meshi,et al. An Alternating Direction Method for Dual MAP LP Relaxation , 2011, ECML/PKDD.
[92] Miguel Á. Carreira-Perpiñán,et al. Multiscale conditional random fields for image labeling , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..
[93] Matthew J. Hausknecht,et al. Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis , 2018, ICLR.