Article-Journal

Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning
Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning
Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning
Jaxmarl: Multi-agent rl environments in jax
Trust-region-free policy optimization for stochastic policies
Generalization in cooperative multi-agent systems
Lift: Reinforcement learning in computer systems by learning from demonstrations