Learning to soar in turbulent environments

Significance Thermals are ascending currents that typically extend from the ground up to the base of the clouds. Birds and gliders piggyback thermals to fly with a reduced expenditure of energy, for example, during migration, and to extend their flying range. Flow in the thermals is highly turbulent, which poses the challenge of the orientation in strongly fluctuating environments. We combine numerical simulations of atmospheric flow with reinforcement learning methods to identify strategies of navigation that can cope with and even exploit turbulent fluctuations. Specifically, we show how the strategies evolve as the level of turbulent fluctuations increase, and we identify those sensorimotor cues that are effective at directing turbulent navigation. Birds and gliders exploit warm, rising atmospheric currents (thermals) to reach heights comparable to low-lying clouds with a reduced expenditure of energy. This strategy of flight (thermal soaring) is frequently used by migratory birds. Soaring provides a remarkable instance of complex decision making in biology and requires a long-term strategy to effectively use the ascending thermals. Furthermore, the problem is technologically relevant to extend the flying range of autonomous gliders. Thermal soaring is commonly observed in the atmospheric convective boundary layer on warm, sunny days. The formation of thermals unavoidably generates strong turbulent fluctuations, which constitute an essential element of soaring. Here, we approach soaring flight as a problem of learning to navigate complex, highly fluctuating turbulent environments. We simulate the atmospheric boundary layer by numerical models of turbulent convective flow and combine them with model-free, experience-based, reinforcement learning algorithms to train the gliders. For the learned policies in the regimes of moderate and strong turbulence levels, the glider adopts an increasingly conservative policy as turbulence levels increase, quantifying the degree of risk affordable in turbulent environments. Reinforcement learning uncovers those sensorimotor cues that permit effective control over soaring in turbulent environments.

[1]  N. Malik,et al.  Kinematic simulation of homogeneous turbulence by unsteady random Fourier modes , 1992, Journal of Fluid Mechanics.

[2]  S. Popinet Gerris: a tree-based adaptive solver for the incompressible Euler equations in complex geometries , 2003 .

[3]  D. Lenschow,et al.  The role of thermals in the convective boundary layer , 1980 .

[4]  John H. Cochrane,et al.  MacCREADY THEORY WITH UNCERTAIN LIFT AND LIMITED ALTITUDE , 1999 .

[5]  Michael J. Allen,et al.  Guidance and Control of an Autonomous Soaring UAV , 2013 .

[6]  Detlef Lohse,et al.  Heat transfer and large scale dynamics in turbulent Rayleigh-Bénard convection , 2008, 0811.0471.

[7]  George S. Young,et al.  Convection in the atmospheric boundary layer , 1988 .

[8]  John Valasek,et al.  Autonomous Soaring Using Reinforcement Learning for Trajectory Generation , 2014 .

[9]  Gerald Tesauro,et al.  Temporal Difference Learning and TD-Gammon , 1995, J. Int. Comput. Games Assoc..

[10]  U. Frisch Turbulence: The Legacy of A. N. Kolmogorov , 1996 .

[11]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[12]  John David Anderson,et al.  Introduction to Flight , 1985 .

[13]  I. Newton The Migration Ecology of Birds , 2007 .

[14]  R. V. Mises Theory of Flight , 1959 .

[15]  Roberto Verzicco,et al.  Prandtl number effects in convective turbulence , 1999 .

[16]  Roberto Verzicco,et al.  Numerical experiments on strongly turbulent thermal convection in a slender cylindrical cell , 2003, Journal of Fluid Mechanics.

[17]  Ran Nathan,et al.  The gliding speed of migrating birds: slow and safe or fast and risky? , 2014, Ecology letters.

[18]  Eric D. Siggia,et al.  Scalar turbulence , 2000, Nature.

[19]  C. Pennycuick Thermal Soaring Compared in Three Dissimilar Tropical Bird Species, Fregata Magnificens, Pelecanus Occidentals and Coragyps Atratus , 1983 .

[20]  Nicholas R. J. Lawrance,et al.  Long Endurance Autonomous Flight for Unmanned Aerial Vehicles , 2014 .

[21]  Tamás Vicsek,et al.  Comparing bird and human soaring strategies , 2008, Proceedings of the National Academy of Sciences.

[22]  John Harte,et al.  The Birder's Handbook: A Field Guide to the Natural History of North American Birds , 1989 .

[23]  G. Grötzbach,et al.  Spatial resolution requirements for direct numerical simulation of the Rayleigh-Bénard convection , 1983 .

[24]  M. Vergassola,et al.  Particles and fields in fluid turbulence , 2001, cond-mat/0105199.

[25]  Tamás Vicsek,et al.  Thermal soaring flight of birds and unmanned aerial vehicles , 2010, Bioinspiration & biomimetics.