We study the new class of urban routing games, where fleets of collaborative autonomous vehicles (CAVs) learn to make better route choice decisions in mixed urban traffic systems.
- RouteRL: Multi-Agent Reinforcement Learning framework for modeling and simulating the collective route choices of humans and autonomous vehicles - SoftwareX, Docs
- URB - Urban Routing Benchmark: Benchmarking MARL algorithms on the fleet routing tasks - NeurIPS 2025, Website, Leaderboard
In the town of Nemours inhabited only by human drivers, at some point, a given share of drivers mutate to CAVs and delegate their routing decisions to algorithms. Then, for a period of time, the CAV agents develop routing strategies to minimize their delay (e.g. using MARL). This process (both learning and new state) affects traffic and all its users (human and autonomous vehicles).
RouteRL can run this task for an arbitrary city with arbitrary demand (most likely from predefined case studies) and configuration. You may use some algorithm (own or from TorchRL) and analyze results to draw conclusions.
Then, you may compete in URB to dominate the official leaderboard with your best-performing algorithm tested across variety of tasks.
🏃♀️ In the typical use-case:
- You import road network of a given urban areas from
Open Street Map.- You generate a demand pattern, where each of agents is specified with own traits and travel demans
$(o_i, d_i, \tau_i$ ).- You control your experiment with a
.jsonfile and specify details of conducted experiment (or set of experiments).- You specify your human behaviour models to accurately reproduce how human drivers select routes.
- You generate choice set of paths for each agent to select from.
- You connect with
SUMOtraffic simulator to be used as environment to compute travel costs.- You run
$n$ days of human learning (SUMO days), hoping the system will stabilize in proximity of Wardrop User Equilibrium.- You introduce mutation and replace some human agents with
CAVs.- You determine
reinforcement learningalgorithm for each agent by defining rewards, observations and hyperparameters.- You
trainyour algorithms until it finds suitablepolicy.- You roll-out the trained policy and observe impact of new routing on the system.
- You further allow humans to adapt to actions of
CAVsand allowCAVsto refine its policies.
Complete list of available software (work-in-progress, sandboxes, discontinued projects, or side quests) is:
1. JanuX — Tool for generating a set of path options in directed graphs. Designed for efficient routing and creating path options for custom requirements.
2. GenTTP — Leading to optimal assigment by approximating SUMO with ML methods.
3. demandify — Reproduce real-world traffic congestion with synthetic demand calibration for agent-based traffic scenarios using genetic algorithms.
4. OpenURB — A benchmark for testing MARL algorithms for CAV route choice under dynamic CAV-HDV changes.
5. Coalition formation — We demonstrate (for the first time) that CAVs may form exclusive routing coalitions in traffic.
6. General Decision Model — Framework to simulate the decision process of humans that can join CAV fleet.
7. RoutingZOO — A simulation platform where virtual drivers experiment with routing strategies to navigate from origins to destinations in dense urban networks.
8. Wardropian Cycles — A concept bridging between System Optimum and User Equilibrium Assignment in a day-to-day context.
9. parcour — An early prototype version of RouteRL by Onur Akman.
10. BottleCOEX — Lightweight Simulation of coexistence of CAVs and human drivers in two-route bottleneck scenarios with a macroscopic traffic model.
🔖 For the overview of scientific contributions and societal impact see the COeXISTENCE group web page.
🫵 To collaborate mail us or see contribution guidelines at repsective repositories.
👩🎓 Prospective students, PhDs or visiting scholars welcomed - please mail Rafał Kucharski.
This project is developed within the COeXISTENCE project
(ERC Starting Grant, grant agreement No. 101075838), based at Jagiellonian University in Kraków, Poland.











