Optimizing Crop Management with Reinforcement Learning and Imitation Learning

Crop management, including nitrogen (N) fertilization and irrigation management, has a significant impact on the crop yield, economic profit, and the environment. Although man- agement guidelines exist, it is challenging to find the optimal management practices given a specific planting environ- ment and a crop. Previous work used reinforcement learning (RL) and crop simulators to solve the problem, but the trained policies either have limited performance or are not deployable in the real world. In this paper, we present an intelligent crop management system which optimizes the N fertilization and irrigation simultaneously via RL, imitation learning (IL), and crop simulations using the Decision Support System for Agrotechnology Transfer (DSSAT). We first use deep RL, in particular, deep Q-network, to train management policies that require all state information from the simulator as observa- tions (denoted as full observation). We then invoke IL to train management policies that only need a limited amount of state information that can be easily obtained or measured in the real world (denoted as partial observation) by mimicking the actions of the previously RL-trained policies under full observation. We conduct experiments on a case study using maize in Florida, and compare trained policies with a maize management guideline in simulations. Our trained policies under both full and partial observations achieve better outcomes, re-sulting in a higher profit or a similar profit with a smaller en- vironmental impact. Moreover, the partial-observation management policies are directly deployable in the real world as they use readily available information.

[1]  Emilio J. Padrón,et al.  gym-DSSAT: a crop model turned into a Reinforcement Learning environment , 2022, ArXiv.

[2]  N. Hovakimyan,et al.  Optimizing Nitrogen Management with Deep Reinforcement Learning and Crop Simulations , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[3]  S. Archontoulis,et al.  Exploring Trade-Offs Between Profit, Yield, and the Environmental Footprint of Potential Nitrogen Fertilizer Regulations in the US Midwest , 2022, Frontiers in Plant Science.

[4]  Kiran Karra,et al.  Machine Learning aided Crop Yield Optimization , 2021, ArXiv.

[5]  P. deVoil,et al.  Application, adoption and opportunities for improving decision support systems in irrigated agriculture: A review , 2021, Agricultural Water Management.

[6]  Ioannis N. Athanasiadis,et al.  CropGym: a Reinforcement Learning Environment for Crop Management , 2021, ArXiv.

[7]  Marcin Andrychowicz,et al.  Solving Rubik's Cube with a Robot Hand , 2019, ArXiv.

[8]  Patrice Dumas,et al.  Creating a sustainable food future: A menu of solutions to feed nearly 10 billion people by 2050. Final report , 2019 .

[9]  Daniel Wallach,et al.  A SIMPLE crop model , 2019, European Journal of Agronomy.

[10]  Vladlen Koltun,et al.  Deep Drone Racing: Learning Agile Flight in Dynamic Environments , 2018, CoRL.

[11]  Pieter Abbeel,et al.  An Algorithmic Perspective on Imitation Learning , 2018, Found. Trends Robotics.

[12]  Marcin Andrychowicz,et al.  Sim-to-Real Transfer of Robotic Control with Dynamics Randomization , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[13]  Jiang Hu,et al.  Reinforcement Learning Control for Water-Efficient Agricultural Irrigation , 2017, 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC).

[14]  John M. Antle,et al.  Brief history of agricultural systems modeling , 2017, Agricultural systems.

[15]  Wojciech Zaremba,et al.  Domain randomization for transferring deep neural networks from simulation to the real world , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[16]  Wojciech Zaremba,et al.  OpenAI Gym , 2016, ArXiv.

[17]  Kyunghyun Cho,et al.  Query-Efficient Imitation Learning for End-to-End Autonomous Driving , 2016, ArXiv.

[18]  Sergey Levine,et al.  Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[19]  S. Duiker,et al.  Influence of Precipitation on Maize Yield in the Eastern United States , 2015 .

[20]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[21]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22]  James W. Jones,et al.  How do various maize crop models vary in their responses to climate change factors? , 2014, Global change biology.

[23]  C. Tétard-Jones,et al.  The influence of organic and conventional fertilisation and crop protection practices, preceding crop, harvest year and weather conditions on yield and quality of potato ( Solanum tuberosum ) in a long-term management trial , 2013 .

[24]  James W. Jones,et al.  Uncertainty in Simulating Wheat Yields Under Climate Change , 2013 .

[25]  Michael Blümmel,et al.  Crop management factors influencing yield and quality of crop residues , 2003 .

[26]  James W. Jones,et al.  The DSSAT cropping system model , 2003 .

[27]  P. Pienaar Water for agriculture , 1987 .