Publications

(2024). Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning. Advances in Neural Information Processing Systems.

PDF Cite Code Project

(2024). Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning. arXiv preprint arXiv:2402.16801.

PDF Cite Code Project

(2023). Trust-region-free policy optimization for stochastic policies. RLDM 2022.

PDF Cite

(2023). Jaxmarl: Multi-agent rl environments in jax. arXiv preprint arXiv:2311.10090.

PDF Cite Code Project

(2022). Generalization in cooperative multi-agent systems. arXiv preprint arXiv:2202.00104.

PDF Cite

(2018). Lift: Reinforcement learning in computer systems by learning from demonstrations. arXiv preprint arXiv:1808.07903.

Cite