AgInteracT: LLM-based Human-AI Teaming Simulation Environment
By Maarten Sap
To showcase AgInteracT’s usefulness towards studying human-AI teams, we propose to use our simulation environment to investigate how team composition and teammate characteristics of digital twins affect team performance in two types of military-inspired tasks: (1) Cooperative tasks, in which teammates of different backgrounds, cultures, and expertise must cooperate towards a shared goal (e.g., deciding what to do in a search and rescue mission). (2) Semi-cooperative tasks, in which teammates with different characteristics have to negotiate to achieve their own goals (e.g., job offer negotiations, game of Diplomacy).
Within these tasks, we will investigate the following research questions: (1) How can we best create digital twins that are realistic and faithful to their human counterpart? This will help us develop more faithful and realistic human digital twins. (2) How can we most accurately evaluate the performance of agents and teams using automatic methods? This will help us build better evaluation models for agents and teams, human and AI alike. (3) How do team size, composition, specific agent characteristics, and team communication strategies influence the outcomes of tasks? This will help us answer social science inspired questions to help teams better achieve their goals.