Predictability Awareness for Efficient and Robust Multi-Agent Coordination

Dept. of Cognitive Robotics, TU Delft
Pre-Print

Figure shows the results of a swapping task for predictability weights \( \lambda=0 \) and \( \lambda=5 \) to the left and right respectively. Solving a swapping task requires agents to coordinate on a passing side. Sequential planning agents don't account for how their actions impact other agents beliefs. On the other hand, agents that account for predictability use a prediction model to approximate each other's beliefs and modify their trajectories to avoid surprising others. In this manner the prediction model fosters a 'soft social convention', which uniformly biases one of the passing sides accross all agents. From the perspective of an agent this results in reduced uncertainty about the environment (more accurate predictions), facilitating coordination.

Abstract

To safely and efficiently solve motion planning problems in multi-agent settings, most approaches attempt to solve a joint optimization that explicitly accounts for the responses triggered in other agents. This often results in solutions with an exponential computational complexity, making these methods intractable for complex scenarios with many agents. While sequential predict-and-plan approaches are more scalable, they tend to perform poorly in highly interactive environments. This paper proposes a method to improve the interactive capabilities of sequential predict-and-plan methods in multi-agent navigation problems by introducing predictability as an optimization objective. We interpret predictability through the use of general prediction models, by allowing agents to predict themselves and estimate how they align with these external predictions. We formally introduce this behavior through the free-energy of the system, which reduces (under appropriate bounds) to the Kullback-Leibler divergence between plan and prediction, and use this as a penalty for unpredictable trajectories. The proposed interpretation of predictability allows agents to more robustly leverage prediction models, and fosters a ‘soft social convention’ that accelerates agreement on coordination strategies without the need of explicit high level control or communication. We show how this predictability-aware planning leads to lower-cost trajectories and reduces planning effort in a set of multi-robot problems, including autonomous driving experiments with human driver data, where we show that the benefits of considering predictability apply even when only the ego-agent uses this strategy.

Methodology

Single-Agent Experiment

Swapping Tasks

Robot-Robot Traffic Experiments

Experiments with Human Driver Data