Gabriel Barth-Maron

Получаване на мой собствен потребителски профил

Позовавания

	Всички	От 2019
Позовавания	4318	4135
h-индекс	14	13
i10-индекс	15	15

1500

750

375

1125

2017201820192020202120222023202419 136 250 362 501 616 892 1500

Публичен достъп

Преглед на всички

1 статия

0 статии

налични

неналични

Въз основа на изисквания при финансирането

Съавтори

Dan HorganGoogle DeepMindПотвърден имейл адрес: google.com
Matthew W. HoffmanGoogle DeepMindПотвърден имейл адрес: google.com
Nando de FreitasCIFAR & DeepMindПотвърден имейл адрес: google.com
Scott ReedResearch Scientist, NVIDIA ResearchПотвърден имейл адрес: google.com
Konrad ŻołnaResearch Scientist, DeepMindПотвърден имейл адрес: google.com
David AbelResearch Scientist, DeepMindПотвърден имейл адрес: deepmind.com
Tom Le PaineStaff Research Scientist at Google DeepMindПотвърден имейл адрес: google.com
Bobak ShahriariDeepMindПотвърден имейл адрес: google.com
Caglar GulcehreAI Researcher, Prof at EPFL, Consultant@Google DeepMind, ex-Staff Research Scientist@Google DeepMindПотвърден имейл адрес: google.com

Следене

Gabriel Barth-Maron

Google DeepMind

Потвърден имейл адрес: google.com

Artificial Intelligence Deep Learning Variational Inference Reinforcement Learning


Заглавие Сортиране по цитати Сортиране по година Сортиране по заглавие	Позовавания Позовавания	Година
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	1042	2023
Distributed prioritized experience replay D Horgan, J Quan, D Budden, G Barth-Maron, M Hessel, H Van Hasselt, ... arXiv preprint arXiv:1803.00933, 2018	875	2018
A generalist agent S Reed, K Zolna, E Parisotto, SG Colmenarejo, A Novikov, G Barth-Maron, ... arXiv preprint arXiv:2205.06175, 2022	773	2022
Distributed distributional deterministic policy gradients G Barth-Maron, MW Hoffman, D Budden, W Dabney, D Horgan, D Tb, ... arXiv preprint arXiv:1804.08617, 2018	626	2018
Data-efficient deep reinforcement learning for dexterous manipulation I Popov, N Heess, T Lillicrap, R Hafner, G Barth-Maron, M Vecerik, ... arXiv preprint arXiv:1704.03073, 2017	307	2017
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	239	2020
Observe and look further: Achieving consistent performance on atari T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ... arXiv preprint arXiv:1805.11593, 2018	137	2018
Making efficient use of demonstrations to solve hard exploration problems TL Paine, C Gulcehre, B Shahriari, M Denil, M Hoffman, H Soyer, ... arXiv preprint arXiv:1909.01387, 2019	91	2019
Goal-based action priors D Abel, D Hershkowitz, G Barth-Maron, S Brawner, K O'Farrell, ... Proceedings of the International Conference on Automated Planning and …, 2015	59	2015
One-shot high-fidelity imitation: Training large-scale deep nets with rl TL Paine, SG Colmenarejo, Z Wang, S Reed, Y Aytar, T Pfaff, ... arXiv preprint arXiv:1810.05017, 2018	27	2018
Reverb: A framework for experience replay A Cassirer, G Barth-Maron, E Brevdo, S Ramos, T Boyd, T Sottiaux, ... arXiv preprint arXiv:2102.04736, 2021	24	2021
QuaRL: Quantization for fast and environmentally sustainable reinforcement learning S Krishnan, M Lam, S Chitlangia, Z Wan, G Barth-Maron, A Faust, ... arXiv preprint arXiv:1910.01055, 2019	22	2019
Launchpad: A programming model for distributed machine learning research F Yang, G Barth-Maron, P Stańczyk, M Hoffman, S Liu, M Kroiss, A Pope, ... arXiv preprint arXiv:2106.04516, 2021	21	2021
Toward affordance-aware planning D Abel, G Barth-Maron, J MacGlashan, S Tellex First Workshop on Affordances: Affordances in Vision for Cognitive Robotics, 2014	16	2014
Data-efficient reinforcement learning for continuous control tasks M Riedmiller, R Hafner, M Vecerik, TP Lillicrap, T Lampe, I Popov, ... US Patent 10,664,725, 2020	13	2020
Reinforcement learning using distributed prioritized replay D Budden, G Barth-Maron, J Quan, DG Horgan US Patent 11,625,604, 2023	9	2023
Diego de Las Casas, Andreas Fidjeland, Tim Green, Adrià Puigdomènech, Sébastien Racanière, Jack Rae, and Fabio Viola. Open sourcing Sonnet-a new library for constructing neural … M Reynolds, G Barth-Maron, F Besse	9	2017
Distributional reinforcement learning for continuous control tasks D Budden, MW Hoffman, G Barth-Maron US Patent 11,481,629, 2022	8	2022
Affordances as transferable knowledge for planning agents G Barth-Maron, D Abel, J MacGlashan, S Tellex 2014 AAAI Fall Symposium Series, 2014	7	2014
Quantized reinforcement learning (quarl) Z Wan	5	2019

Системата не може да изпълни операцията сега. Опитайте отново по-късно.

Статии 1–20

Позовавания годишно

Дублирани описания

Обединени библиографски описания

Добавяне на съавториСъавтори

Следене

Позовавания

Съавтори