论文信息 - Autonomous skill discovery with quality-diversity and unsupervised descriptors

Autonomous skill discovery with quality-diversity and unsupervised descriptors

Quality-Diversity optimization is a new family of optimization algorithms that, instead of searching for a single optimal solution to solving a task, searches for a large collection of solutions that all solve the task in a different way. This approach is particularly promising for learning behavioral repertoires in robotics, as such a diversity of behaviors enables robots to be more versatile and resilient. However, these algorithms require the user to manually define behavioral descriptors, which is used to determine whether two solutions are different or similar. The choice of a behavioral descriptor is crucial, as it completely changes the solution types that the algorithm derives. In this paper, we introduce a new method to automatically define this descriptor by combining Quality-Diversity algorithms with unsupervised dimensionality reduction algorithms. This approach enables robots to autonomously discover the range of their capabilities while interacting with their environment. The results from two experimental scenarios demonstrate that robot can autonomously discover a large range of possible behaviors, without any prior knowledge about their morphology and environment. Furthermore, these behaviors are deemed to be similar to hand-crafted solutions that uses domain knowledge and significantly more diverse than when using existing unsupervised methods.

Antoine Cully | Antoine Cully

[1] R. Miikkulainen,et al. Learning Behavior Characterizations for Novelty Search , 2016, GECCO.

[2] J. MacQueen. Some methods for classification and analysis of multivariate observations , 1967 .

[3] Antoine Cully,et al. Evolving a Behavioral Repertoire for a Walking Robot , 2013, Evolutionary Computation.

[4] Jean-Baptiste Mouret,et al. Using Centroidal Voronoi Tessellations to Scale Up the Multidimensional Archive of Phenotypic Elites Algorithm , 2016, IEEE Transactions on Evolutionary Computation.

[5] Kenneth O. Stanley,et al. Quality Diversity: A New Frontier for Evolutionary Computation , 2016, Front. Robot. AI.

[6] Stéphane Doncieux,et al. Beyond black-box optimization: a review of selective pressures for evolutionary robotics , 2014, Evol. Intell..

[7] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[8] Antoine Cully,et al. Robots that can adapt like animals , 2014, Nature.

[9] Jean-Baptiste Mouret,et al. Illuminating search spaces by mapping elites , 2015, ArXiv.

[10] Julian Togelius,et al. Talakat: bullet hell generation through constrained map-elites , 2018, GECCO.

[11] Simon Haykin,et al. GradientBased Learning Applied to Document Recognition , 2001 .