Mehdi Mirza

2022

Retrieval-Augmented Reinforcement Learning

A. Goyal, A. Friesen, A. Banino, T. Weber, N. R. Ke, A. P. Badia, A. Guez, M. Mirza, K. Konyushkova, M. Valko, S. Osindero, T. Lillicrap, N. Heess, C. Blundell

ICML 2022

Evaluating Model-Based Planning and Planner Amortization for Continuous Control

A. Byravan, L. Hasenclever, P. Trochim, M. Mirza, A. D. Ialongo, Y. Tassa, J. T. Springenberg, A. Abdolmaleki, N. Heess, J. Merel, M. Riedmiller

ICLR 2022

2020

Physically Embedded Planning Problems: New Challenges for Reinforcement Learning

M. Mirza, A. Jaegle, J. J. Hunt, A. Guez, S. Tunyasuvunakool, A. Muldal, T. Weber, P. Karkus, S. Racanière, L. Buesing, T. Lillicrap, N. Heess

arXiv 2020

Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban

P. Karkus, M. Mirza, A. Guez, A. Jaegle, T. Lillicrap, L. Buesing, N. Heess, T. Weber

arXiv 2020

Generative Adversarial Networks

I. J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, Y. Bengio

Communications of the ACM, 63(11), 2020

2019

An Investigation of Model-Free Planning

A. Guez, M. Mirza, K. Gregor, R. Kabra, S. Racanière, T. Weber, D. Raposo, A. Santoro, L. Orseau, T. Eccles, G. Wayne, D. Silver, T. Lillicrap

ICML 2019

2018

Unsupervised Predictive Memory in a Goal-Directed Agent

G. Wayne, C.-C. Hung, D. Amos, M. Mirza, A. Ahuja, A. Grabska-Barwinska, J. Rae, P. Mirowski, J. Leibo, A. Santoro, M. Gemici, et al.

arXiv 2018

Optimizing Agent Behavior over Long Time Scales by Transporting Value

C.-C. Hung, T. Lillicrap, J. Abramson, Y. Wu, M. Mirza, F. Carnevale, A. Ahuja, G. Wayne

arXiv 2018

2016

Asynchronous Methods for Deep Reinforcement Learning

V. Mnih, A. P. Badia, M. Mirza, A. Graves, T. Lillicrap, T. Harley, D. Silver, K. Kavukcuoglu

ICML 2016

2014

Conditional Generative Adversarial Nets

M. Mirza, S. Osindero

arXiv 2014

For a complete list, see Google Scholar.