Research Papers

Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization

T. Pierrot | V. Macé | F. Chalumeau | A. Flajolet | G. Cideron | K. Beguir | A. Cully | O. Sigaud | N. Perrin-Gilbert

ICLR, GECCO Apr 2022

Autoregressive neural-network wavefunctions for ab initio quantum chemistry

Dr Thomas Barrett | Prof A. I. Lvovsky | Aleksei Malyshev

Nature Machine Intelligence Apr 2022

Robust and Scalable SDE Learning: a Functional Perspective

Scott Cameron | Tyron Cameron | Arnu Pretorius | Stephen Roberts

ICLR Jan 2022

One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient Reinforcement Learning

C. Bonnet | P. Caron | T. Barrett | I. Davies | A. Laterre

NeurIPS Workshop 2021 Dec 2021

On pseudo-absence generation and machine learning for locust breeding ground prediction in Africa

I.S. Yusuf | K. Tessera | T. Tumiel | Z. Slim | A. Kerkeni | S. Nevo | A. Pretorius

NeurIPS Workshop 2021 Dec 2021

Causal Multi-Agent Reinforcement Learning: Review and Open Problems

S.J. Grimbly | J. Shock | A. Pretorius

NeurIPS Workshop 2021 Dec 2021

Mava: A new Framework for Distributed Multi-Agent Reinforcement Learning

A. Pretorius | K. Tessera | A.P. Smit | C. Formanek | S.J. Grimbly | K. Eloff | S. Danisa | L. Francis | J. Shock | H. Kamper | W. Brink | H. Engelbrecht | A. Laterre | K. Beguir

Jul 2021

Scaling Properties of Deep Residual Networks

A-S. Cohen | R. Cont | A. Rossier | R. Xu

ICML May 2021

Designing a Prospective COVID-19 Therapeutic with Reinforcement Learning

M. J. Skwark | N. L. Carranza | T. Pierrot | J. Phillips | S. Said | A. Laterre | A. Kerkeni | U. Sahin | K. Beguir

NeurIPS Dec 2020

Offline Reinforcement Learning Hands-On

Louis Monier | Jakub Kmec | Alexandre Laterre | Thomas Pierrot | Valentin Courgeau | Olivier Sigaud | Karim Beguir

NeurIPS Workshop Dec 2020

A game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learning

A. Pretorius | S. Cameron | E. Van Biljon | T. Makkink | S. Mawjee | J. Du Plessis | J. Shock | A. Laterre | K. Beguir

NeurIPS 2020 Sep 2020

AlphaNPI-X: Learning Compositional Neural Programs for Continuous Control

T. Pierrot | N. Perrin | F. Behbahani | A. Laterre | O. Sigaud | K. Beguir | N. De Freitas

Jul 2020