Character-LLM: A Trainable Agent for Role-Playing

Large language models (LLMs) can be used to serve as agents to simulate human behaviors, given the powerful ability to understand human instructions and provide high-quality generated texts. Such ability stimulates us to wonder whether LLMs can simulate a person in a higher form than simple human behaviors. Therefore, we aim to train an agent with the profile, experience, and emotional states of a specific person instead of using limited prompts to instruct ChatGPT API. In this work, we introduce Character-LLM that teach LLMs to act as specific people such as Beethoven, Queen Cleopatra, Julius Caesar, etc. Our method focuses on editing profiles as experiences of a certain character and training models to be personal simulacra with these experiences. To assess the effectiveness of our approach, we build a test playground that interviews trained agents and evaluates whether the agents \textit{memorize} their characters and experiences. Experimental results show interesting observations that help build future simulacra of humankind.

[1]  Yuke Zhu,et al.  Voyager: An Open-Ended Embodied Agent with Large Language Models , 2023, Trans. Mach. Learn. Res..

[2]  Can Xu,et al.  WizardLM: Empowering Large Language Models to Follow Complex Instructions , 2023, ArXiv.

[3]  Michael S. Bernstein,et al.  Generative Agents: Interactive Simulacra of Human Behavior , 2023, UIST.

[4]  Henrique Pondé de Oliveira Pinto,et al.  GPT-4 Technical Report , 2023, 2303.08774.

[5]  Jinyu Li,et al.  Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling , 2023, ArXiv.

[6]  Naman Goyal,et al.  LLaMA: Open and Efficient Foundation Language Models , 2023, ArXiv.

[7]  Jinyu Li,et al.  Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers , 2023, ArXiv.

[8]  Noah A. Smith,et al.  Self-Instruct: Aligning Language Models with Self-Generated Instructions , 2022, ACL.

[9]  Ronan Le Bras,et al.  SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization , 2022, EMNLP.

[10]  Gaurav Singh Tomar,et al.  Dungeons and Dragons as a Dialog Challenge for Artificial Intelligence , 2022, EMNLP.

[11]  Michael S. Bernstein,et al.  Social Simulacra: Creating Populated Prototypes for Social Computing Systems , 2022, UIST.

[12]  Tom B. Brown,et al.  Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback , 2022, ArXiv.

[13]  Carrie J. Cai,et al.  PromptChainer: Chaining Large Language Model Prompts through Visual Programming , 2022, CHI Extended Abstracts.

[14]  Ryan J. Lowe,et al.  Training language models to follow instructions with human feedback , 2022, NeurIPS.

[15]  Pascale Fung,et al.  Survey of Hallucination in Natural Language Generation , 2022, ACM Comput. Surv..

[16]  Dale Schuurmans,et al.  Chain of Thought Prompting Elicits Reasoning in Large Language Models , 2022, NeurIPS.

[17]  Carrie J. Cai,et al.  AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts , 2021, CHI.

[18]  Andreas Vlachos,et al.  A Survey on Automated Fact-Checking , 2021, TACL.

[19]  Michael S. Bernstein,et al.  On the Opportunities and Risks of Foundation Models , 2021, ArXiv.

[20]  Rongrong Ji,et al.  HifiFace: 3D Shape and Semantic Prior Guided High Fidelity Face Swapping , 2021, IJCAI.

[21]  Wolfgang Effelsberg,et al.  Procedural Generation of Interactive Stories using Language Models , 2020, FDG.

[22]  Tom B. Brown,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[23]  Richard Socher,et al.  Evaluating the Factual Consistency of Abstractive Text Summarization , 2019, EMNLP.

[24]  Quoc Viet Hung Nguyen,et al.  Deep learning for deepfakes creation and detection: A survey , 2019, Comput. Vis. Image Underst..

[25]  Xiao Zeng,et al.  Developing game AI agent behaving like human by mixing reinforcement learning and supervised learning , 2017, 2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD).

[26]  Vadim Bulitko,et al.  Interactive Narrative: A Novel Application of Artificial Intelligence for Computer Games , 2012, AAAI.

[27]  Neil Madden,et al.  Collaborative narrative generation in persistent virtual environments , 2009, AAAI Fall Symposium: Intelligent Narrative Technologies.

[28]  Robert Michael Young,et al.  An Objective Character Believability Evaluation Procedure for Multi-agent Story Generation Systems , 2005, IVA.

[29]  John E. Laird,et al.  Human-Level AI's Killer Application: Interactive Computer Games , 2000, AI Mag..

[30]  Joseph Bates,et al.  The role of emotion in believable agents , 1994, CACM.

[31]  S. Papson,et al.  “Model” , 1981 .

[32]  Samantha Godbey It's a Good Idea , 1951 .

[33]  R. Brooks,et al.  The cog project: building a humanoid robot , 1999 .

[34]  F. Thomas,et al.  Disney Animation: The Illusion of Life , 1981 .

[35]  J. Margolis That all Men are Created Equal , 1955 .