论文信息 - Relational Reinforcement Learning - 字舞流文

Relational Reinforcement Learning

De | M Bruynooghe

[1] Tim Oates,et al. Learning in Worlds with Objects , 2017, Encyclopedia of Machine Learning and Data Mining.

[2] Marcel Abendroth,et al. Data Mining Practical Machine Learning Tools And Techniques With Java Implementations , 2016 .

[3] U. Rieder,et al. Markov Decision Processes , 2010 .

[4] Dimitri P. Bertsekas,et al. Neuro-Dynamic Programming , 2009, Encyclopedia of Optimization.

[5] Jens Vygen,et al. The Book Review Column1 , 2020, SIGACT News.

[6] Abhijit Gosavi,et al. Self-Improving Factory Simulation using Continuous-time Average-Reward Reinforcement Learning , 2007 .

[7] Thomas Gärtner,et al. Graph kernels and Gaussian processes for relational reinforcement learning , 2006, Machine Learning.

[8] David W. Aha,et al. Instance-Based Learning Algorithms , 1991, Machine Learning.

[9] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[10] Saso Dzeroski,et al. Integrating Guidance into Relational Reinforcement Learning , 2004, Machine Learning.

[11] Luc De Raedt,et al. Bellman goes relational , 2004, ICML.

[12] M. van Otterlo. Reinforcement Learning for Relational MDPs , 2004 .

[13] Erik D. Demaine,et al. Tetris is hard, even to approximate , 2002, Int. J. Comput. Geom. Appl..

[14] Hector J. Levesque,et al. On the Semantics of Deliberation in IndiGolog — from Theory to Implementation , 2002, Annals of Mathematics and Artificial Intelligence.

[15] Hannu Toivonen,et al. Discovery of frequent DATALOG patterns , 1999, Data Mining and Knowledge Discovery.

[16] Luc De Raedt,et al. Scaling Up Inductive Logic Programming by Learning from Interpretations , 1999, Data Mining and Knowledge Discovery.

[17] Paul E. Utgoff,et al. Decision Tree Induction Based on Efficient Tree Restructuring , 1997, Machine Learning.

[18] Andrew W. Moore,et al. Locally Weighted Learning , 1997, Artificial Intelligence Review.

[19] Gerald Tesauro,et al. Practical issues in temporal difference learning , 1992, Machine Learning.

[20] DIMITRIOS PIERRAKOS,et al. User Modeling and User-Adapted Interaction , 2004, User Modeling and User-Adapted Interaction.

[21] Longxin Lin. Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching , 2004, Machine Learning.

[22] Dirk Ormoneit,et al. Kernel-Based Reinforcement Learning , 2004, Machine Learning.

[23] Carl E. Rasmussen,et al. Gaussian Processes in Reinforcement Learning , 2003, NIPS.

[24] Robert Givan,et al. Approximate Policy Iteration with a Policy Language Bias , 2003, NIPS.

[25] Kurt Driessens,et al. Relational Instance Based Regression for Relational Reinforcement Learning , 2003, ICML.

[26] Carlos Guestrin,et al. Generalizing plans to new environments in relational MDPs , 2003, IJCAI 2003.

[27] Thomas Gärtner,et al. A survey of kernels for structured data , 2003, SKDD.

[28] Mike Barley,et al. Intelligent Agents and Multi-Agent Systems , 2003, Lecture Notes in Computer Science.

[29] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..

[30] Luc De Raedt,et al. Logical Markov Decision Programs , 2003 .

[31] Eduardo F. Morales,et al. Scaling Up Reinforcement Learning with a Relational Representation , 2003 .

[32] Maurice Bruynooghe,et al. Aggregation versus selection bias, and relational neural networks , 2003 .

[33] Thomas Gärtner,et al. On Graph Kernels: Hardness Results and Efficient Alternatives , 2003, COLT.

[34] Thomas Gärtner,et al. Kernels for structured data , 2008, Series in Machine Perception and Artificial Intelligence.

[35] Robert Givan,et al. Inductive Policy Selection for First-Order MDPs , 2002, UAI.

[36] Tim Oates,et al. The Thing that we Tried Didn't Work very Well: Deictic Representation in Reinforcement Learning , 2002, UAI.

[37] van Martijn Otterlo. Relational Representations in Reinforcement Learning: Review and Open Problems , 2002, ICML 2002.

[38] Saso Dzeroski,et al. Integrating Experimentation and Guidance in Relational Reinforcement Learning , 2002, ICML.

[39] Wim Van Laer. From Propositional to First Order Logic in Machine Learning and Data Mining - Induction of first ord , 2002 .

[40] Michail G. Lagoudakis,et al. Least-Squares Methods in Reinforcement Learning for Control , 2002, SETN.

[41] Hisashi Kashima,et al. Kernels for graph classification , 2002 .

[42] Saso Dzeroski,et al. On using guidance in relational reinforcement learning , 2002 .

[43] Jan Ramon,et al. Clustering and instance based learning in first order logic , 2002, AI Communications.

[44] Jeffrey M. Forbes,et al. Representations for learning control policies , 2002 .

[45] Stefan Kramer,et al. Inducing classification and regression trees in first order logic , 2001 .

[46] Kurt Driessens,et al. Learning digger using hierarchical reinforcement learning for concurrent goals , 2001 .

[47] Hendrik Blockeel,et al. From Shell Logs to Shell Scripts , 2001, ILP.

[48] Kurt Driessens,et al. Speeding Up Relational Reinforcement Learning through the Use of an Incremental First Order Decision Tree Learner , 2001, ECML.

[49] Maurice Bruynooghe,et al. A polynomial time computable metric between point sets , 2001, Acta Informatica.

[50] Craig Boutilier,et al. Symbolic Dynamic Programming for First-Order MDPs , 2001, IJCAI.

[51] Ross D. Shachter,et al. Using background knowledge to speed reinforcement learning in physical agents , 2001, AGENTS '01.

[52] Ashwin Srinivasan,et al. Warmr: a data mining tool for chemical data , 2001, J. Comput. Aided Mol. Des..

[53] Michael Collins,et al. Convolution Kernels for Natural Language , 2001, NIPS.

[54] Xin Wang,et al. Batch Value Function Approximation via Support Vectors , 2001, NIPS.

[55] John K. Slaney,et al. Blocks World revisited , 2001, Artif. Intell..

[56] S. Džeroski,et al. Relational Data Mining , 2001, Springer Berlin Heidelberg.

[57] Nello Cristianini,et al. Classification using String Kernels , 2000 .

[58] Bart Demoen,et al. Executing Query Packs in ILP , 2000, ILP.

[59] Leslie Pack Kaelbling,et al. Practical Reinforcement Learning in Continuous Spaces , 2000, ICML.

[60] Stefan Schaal,et al. Real-time robot learning with locally weighted statistical learning , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[61] W. Imrich,et al. Product Graphs: Structure and Recognition , 2000 .

[62] Nello Cristianini,et al. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[63] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..

[64] K. R. Dixon,et al. Incorporating Prior Knowledge and Previously Learned Information into Reinforcement Learning Agents , 2000 .

[65] Gunnar Rätsch,et al. Engineering Support Vector Machine Kerneis That Recognize Translation Initialion Sites , 2000, German Conference on Bioinformatics.

[66] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[67] Craig Boutilier,et al. Decision-Theoretic Planning: Structural Assumptions and Computational Leverage , 1999, J. Artif. Intell. Res..

[68] Andrew McCallum,et al. Using Reinforcement Learning to Spider the Web Efficiently , 1999, ICML.

[69] Marco Wiering,et al. Explorations in efficient reinforcement learning , 1999 .

[70] Hendrik Blockeel,et al. Top-Down Induction of First Order Logical Decision Trees , 1998, AI Commun..

[71] Luc Dehaspe. Frequent Pattern Discovery in First-Order Logic , 1999, AI Commun..

[72] David Haussler,et al. Convolution kernels on discrete structures , 1999 .

[73] Luc De Raedt,et al. Top-Down Induction of Clustering Trees , 1998, ICML.

[74] Wim Van Laer,et al. A methodology for first order learning: a case study , 1998 .

[75] D. Mackay,et al. Introduction to Gaussian processes , 1998 .

[76] Stuart J. Russell,et al. Reinforcement Learning with Hierarchies of Machines , 1997, NIPS.

[77] Michèle Sebag,et al. Distance Induction in First Order Logic , 1997, ILP.

[78] Russell Greiner,et al. Why Experimentation can be better than "Perfect Guidance" , 1997, ICML.

[79] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.

[80] Dietrich Wettschereck,et al. Relational Instance-Based Learning , 1996, ICML.

[81] Ivan Bratko,et al. Learning Models of Control Skills: Phenomena, Results and Problems , 1996 .

[82] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[83] Melanie. Mitchell,et al. An introduction to genetic algorithms [electronic resource] , 1996 .

[84] Andrew McCallum,et al. Reinforcement learning with selective perception and hidden state , 1996 .

[85] Mark Humphrys. W-learning: Competition among selfish Q-learners , 1995 .

[86] Xuemei Wang,et al. Learning by Observation and Practice: An Incremental Approach for Planning Operator Acquisition , 1995, ICML.

[87] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[88] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[89] Pat Langley,et al. Elements of Machine Learning , 1995 .

[90] Luc De Raedt,et al. First-Order jk-Clausal Theories are PAC-Learnable , 1994, Artif. Intell..

[91] Michael I. Jordan,et al. MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES , 1996 .

[92] Andrew G. Barto,et al. Monte Carlo Matrix Inversion and Reinforcement Learning , 1993, NIPS.

[93] Leslie Pack Kaelbling,et al. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.

[94] David W. Aha,et al. Instance‐based prediction of real‐valued attributes , 1989, Comput. Intell..

[95] C. Watkins. Learning from delayed rewards , 1989 .

[96] David E. Goldberg,et al. Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[97] Stephen Barnett,et al. Matrix Methods for Engineers and Scientists , 1982 .

[98] Nils J. Nilsson,et al. Principles of Artificial Intelligence , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[99] Michael J. Fischer,et al. The String-to-String Correction Problem , 1974, JACM.

[100] Richard Fikes,et al. STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[101] Donald Michie,et al. Man-Machine Co-operation on a Learning Task , 1969 .

[102] Frank Harary,et al. Graph Theory , 2016 .

[103] Richard Bellman,et al. Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[104] N. Aronszajn. Theory of Reproducing Kernels. , 1950 .