Следене
Adrià Puigdomènech Badia
Adrià Puigdomènech Badia
DeepMind
Потвърден имейл адрес: google.com
Заглавие
Позовавания
Позовавания
Година
Asynchronous methods for deep reinforcement learning
V Mnih, A Puigdomenech Badia, M Mirza, A Graves, T Lillicrap, T Harley, ...
International conference on machine learning, 1928-1937, 2016
78182016
Hybrid computing using a neural network with dynamic external memory
A Graves, G Wayne, M Reynolds, T Harley, I Danihelka, ...
Nature 538 (7626), 471-476, 2016
15502016
Imagination-augmented agents for deep reinforcement learning
S Racanière, T Weber, D Reichert, L Buesing, A Guez, ...
Advances in neural information processing systems 30, 2017
554*2017
Agent57: Outperforming the atari human benchmark
AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ...
International Conference on Machine Learning, 507-517, 2020
3432020
Neural episodic control
A Pritzel, B Uria, S Srinivasan, AP Badia, O Vinyals, D Hassabis, ...
International Conference on Machine Learning, 2827-2836, 2017
2952017
Never give up: Learning directed exploration strategies
AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ...
arXiv preprint arXiv:2002.06038, 2020
1572020
Proceedings of the 33rd International Conference on Machine Learning
V Mnih, AP Badia, M Mirza, A Graves, T Lillicrap, T Harley, D Silver, ...
PMLR 48, 1928-1937, 2016
862016
Memory-based parameter adaptation
P Sprechmann, SM Jayakumar, JW Rae, A Pritzel, AP Badia, B Uria, ...
arXiv preprint arXiv:1802.10542, 2018
802018
Generalization of reinforcement learners with working and episodic memory
M Fortunato, M Tan, R Faulkner, S Hansen, A Puigdomènech Badia, ...
Advances in neural information processing systems 32, 2019
402019
Memo: A deep network for flexible combination of episodic memories
A Banino, AP Badia, R Köster, MJ Chadwick, V Zambaldi, D Hassabis, ...
arXiv preprint arXiv:2001.10913, 2020
282020
Asynchronous deep reinforcement learning
V Mnih, AP Badia, AB Graves, TJA Harley, D Silver, K Kavukcuoglu
US Patent 10,936,946, 2021
112021
Retrieval-augmented reinforcement learning
A Goyal, A Friesen, A Banino, T Weber, NR Ke, AP Badia, A Guez, ...
International Conference on Machine Learning, 7740-7765, 2022
62022
Beyond fine-tuning: Transferring behavior in reinforcement learning
V Campos, P Sprechmann, S Hansen, A Barreto, S Kapturowski, ...
arXiv preprint arXiv:2102.13515, 2021
62021
Coverage as a principle for discovering transferable behavior in reinforcement learning
V Campos, P Sprechmann, SS Hansen, A Barreto, C Blundell, A Vitvitskyi, ...
62020
The CLRS Algorithmic Reasoning Benchmark.
P Velickovic, AP Badia, D Budden, R Pascanu, A Banino, M Dashevskiy, ...
CoRR, 2022
32022
Agent57: Outperforming the Atari Human Benchmark. arXiv e-prints, page
AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, D Guo, ...
arXiv preprint arXiv:2003.13350, 2020
32020
The CLRS Algorithmic Reasoning Benchmark
P Veličković, AP Badia, D Budden, R Pascanu, A Banino, M Dashevskiy, ...
arXiv preprint arXiv:2205.15659, 2022
12022
Asynchronous deep reinforcement learning
V Mnih, AP Badia, AB Graves, TJA Harley, D Silver, K Kavukcuoglu
US Patent 11,334,792, 2022
12022
Jointly learning exploratory and non-exploratory action selection policies
AP Badia, P Sprechmann, A Vitvitskyi, Z Guo, B Piot, SJ Kapturowski, ...
US Patent App. 16/881,180, 2020
12020
Neural episodic control
B Uria-Martínez, A Pritzel, C Blundell, AP Badia
US Patent 10,664,753, 2020
12020
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20