FlowMind: Automatic Workflow Generation with LLMs

The rapidly evolving field of Robotic Process Automation (RPA) has made significant strides in automating repetitive processes, yet its effectiveness diminishes in scenarios requiring spontaneous or unpredictable tasks demanded by users. This paper introduces a novel approach, FlowMind, leveraging the capabilities of Large Language Models (LLMs) such as Generative Pretrained Transformer (GPT), to address this limitation and create an automatic workflow generation system. In FlowMind, we propose a generic prompt recipe for a lecture that helps ground LLM reasoning with reliable Application Programming Interfaces (APIs). With this, FlowMind not only mitigates the common issue of hallucinations in LLMs, but also eliminates direct interaction between LLMs and proprietary data or code, thus ensuring the integrity and confidentiality of information — a cornerstone in financial services. FlowMind further simplifies user interaction by presenting high-level descriptions of auto-generated workflows, enabling users to inspect and provide feedback effectively. We also introduce NCEN-QA, a new dataset in finance for benchmarking question-answering tasks from N-CEN reports on funds. We used NCEN-QA to evaluate the performance of workflows generated by FlowMind against baseline and ablation variants of FlowMind. We demonstrate the success of FlowMind, the importance of each component in the proposed lecture recipe, and the effectiveness of user interaction and feedback in FlowMind.

[1]  Trevor Darrell,et al.  Modular Visual Question Answering via Code Generation , 2023, ACL.

[2]  Lingming Zhang,et al.  Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation , 2023, NeurIPS.

[3]  P. Kambadur,et al.  BloombergGPT: A Large Language Model for Finance , 2023, ArXiv.

[4]  Ashish Kapoor,et al.  ChatGPT for Robotics: Design Principles and Model Abilities , 2023, IEEE Access.

[5]  Luke Zettlemoyer,et al.  Toolformer: Language Models Can Teach Themselves to Use Tools , 2023, NeurIPS.

[6]  Peter R. Florence,et al.  Code as Policies: Language Model Programs for Embodied Control , 2022, 2023 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Elena L. Glassman,et al.  Expectation vs. Experience: Evaluating the Usability of Code Generation Tools Powered by Large Language Models , 2022, CHI Extended Abstracts.

[8]  S. Levine,et al.  Do As I Can, Not As I Say: Grounding Language in Robotic Affordances , 2022, CoRL.

[9]  Jeff Wu,et al.  WebGPT: Browser-assisted question-answering with human feedback , 2021, ArXiv.

[10]  Jonathan Berant,et al.  Learning To Retrieve Prompts for In-Context Learning , 2021, NAACL.

[11]  Alice Saldanha Villar,et al.  Robotic process automation in banking industry: a case study on Deutsche Bank , 2021, Journal of Banking and Financial Technology.

[12]  Adrian Hofmann,et al.  A Consolidated Framework for Implementing Robotic Process Automation Projects , 2020, BPM.

[13]  Sander J. J. Leemans,et al.  Robotic Process Automation: Contemporary themes and challenges , 2020, Comput. Ind..

[14]  Hiroyuki Sato,et al.  Communication Robot for Elderly Based on Robotic Process Automation , 2019, 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC).

[15]  Nina Helander,et al.  Robotic Process Automation - Creating Value by Digitalizing Work in the Private Healthcare? , 2018, MindTrek.

[16]  W. V. D. Aalst,et al.  Robotic Process Automation , 2018, Business & Information Systems Engineering.

[17]  Wil M.P. van der Aalst,et al.  Robotic Process Automation , 2016, Business & Information Systems Engineering.

[18]  Ilya Sutskever,et al.  Language Models are Unsupervised Multitask Learners , 2019 .

[19]  Alec Radford,et al.  Improving Language Understanding by Generative Pre-Training , 2018 .