Joel Z Leibo

Получаване на мой собствен потребителски профил

Позовавания

	Всички	От 2019
Позовавания	13452	11524
h-индекс	41	36
i10-индекс	66	56

2700

1350

675

2025

20132014201520162017201820192020202120222023202462 84 92 155 435 890 1300 1743 2130 2262 2696 1373

Публичен достъп

Преглед на всички

10 статии

1 статия

налични

неналични

Въз основа на изисквания при финансирането

Съавтори

Thore GraepelGlobal Lead Computational Science, AI & ML at Altos Labs and Chair of Machine Learning, UCLПотвърден имейл адрес: ucl.ac.uk
TOMASO POGGIOMcDermott Professor in Brain Sciences, MITПотвърден имейл адрес: ai.mit.edu
Edward HughesStaff Research Engineer, DeepMindПотвърден имейл адрес: google.com
Marc LanctotResearch Scientist, Google DeepMindПотвърден имейл адрес: google.com
Edgar A. Duéñez-GuzmánGoogle DeepMindПотвърден имейл адрес: oeb.harvard.edu
Karl TuylsFounder at H company, ex-Google DeepMind, Prof at University of LiverpoolПотвърден имейл адрес: hcompany.ai
Wojciech Marian Czarnecki.Потвърден имейл адрес: google.com
Matthew BotvinickGoogle DeepMind, Yale Law School, University College LondonПотвърден имейл адрес: google.com
Charlie BeattieSoftware Engineer, DeepMindПотвърден имейл адрес: google.com
Peter SunehagGoogle - DeepMindПотвърден имейл адрес: google.com
Tom SchaulSenior Staff Scientist, DeepMindПотвърден имейл адрес: nyu.edu
Kevin R. McKeeStaff Research Scientist, Google DeepMindПотвърден имейл адрес: deepmind.com
Raphael KösterGoogle DeepMindПотвърден имейл адрес: google.com
Audrūnas GruslysПотвърден имейл адрес: gruslys.com
Jane X. WangStaff Research Scientist, DeepMindПотвърден имейл адрес: google.com
Max JaderbergChief AI Scientist, Isomorphic LabsПотвърден имейл адрес: robots.ox.ac.uk
Fabio AnselmiAssistant professor at University of Trieste, MIT affiliateПотвърден имейл адрес: units.it
Vinicius ZambaldiGoogle DeepmindПотвърден имейл адрес: google.com
Dharshan KumaranGoogle DeepMindПотвърден имейл адрес: fil.ion.ucl.ac.uk
Zeb Kurth-NelsonDeepMind, UCLПотвърден имейл адрес: google.com

Следене

Joel Z Leibo

Research scientist

Потвърден имейл адрес: google.com - Начална страница

Cooperation in AI & Neuroscience Multi-Agent Reinforcement Learning Machine Learning


Заглавие Сортиране по цитати Сортиране по година Сортиране по заглавие	Позовавания Позовавания	Година
Value-decomposition networks for cooperative multi-agent learning P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ... arXiv preprint arXiv:1706.05296, 2017	1703	2017
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	1389*	2018
Reinforcement learning with unsupervised auxiliary tasks M Jaderberg, V Mnih, WM Czarnecki, T Schaul, JZ Leibo, D Silver, ... arXiv preprint arXiv:1611.05397, 2016	1388	2016
Learning to reinforcement learn JX Wang, Z Kurth-Nelson, D Tirumala, H Soyer, JZ Leibo, R Munos, ... arXiv preprint arXiv:1611.05763, 2016	1065	2016
Human-level performance in 3D multiplayer games with population-based reinforcement learning M Jaderberg, WM Czarnecki, I Dunning, L Marris, G Lever, AG Castaneda, ... Science 364 (6443), 859-865, 2019	945	2019
Multi-agent reinforcement learning in sequential social dilemmas JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel arXiv preprint arXiv:1702.03037, 2017	883	2017
Prefrontal cortex as a meta-reinforcement learning system JX Wang, Z Kurth-Nelson, D Kumaran, D Tirumala, H Soyer, JZ Leibo, ... Nature neuroscience 21 (6), 860-868, 2018	634	2018
Deepmind lab C Beattie, JZ Leibo, D Teplyashin, T Ward, M Wainwright, H Küttler, ... arXiv preprint arXiv:1612.03801, 2016	601	2016
Social influence as intrinsic motivation for multi-agent deep reinforcement learning N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ... International conference on machine learning, 3040-3049, 2019	525	2019
Model-free episodic control C Blundell, B Uria, A Pritzel, Y Li, A Ruderman, JZ Leibo, J Rae, ... arXiv preprint arXiv:1606.04460, 2016	296	2016
The dynamics of invariant object recognition in the human visual system L Isik, EM Meyers, JZ Leibo, T Poggio Journal of neurophysiology 111 (1), 91-102, 2014	279	2014
Using fast weights to attend to the recent past J Ba, GE Hinton, V Mnih, JZ Leibo, C Ionescu Advances in neural information processing systems 29, 2016	268	2016
Inequity aversion improves cooperation in intertemporal social dilemmas E Hughes, JZ Leibo, M Phillips, K Tuyls, E Dueñez-Guzman, ... Advances in neural information processing systems 31, 2018	246	2018
A multi-agent reinforcement learning model of common-pool resource appropriation J Perolat, JZ Leibo, V Zambaldi, C Beattie, K Tuyls, T Graepel Advances in neural information processing systems 30, 2017	217	2017
Open problems in cooperative ai A Dafoe, E Hughes, Y Bachrach, T Collins, KR McKee, JZ Leibo, K Larson, ... arXiv preprint arXiv:2012.08630, 2020	197	2020
Unsupervised predictive memory in a goal-directed agent G Wayne, CC Hung, D Amos, M Mirza, A Ahuja, A Grabska-Barwinska, ... arXiv preprint arXiv:1803.10760, 2018	196	2018
Emergent communication through negotiation K Cao, A Lazaridou, M Lanctot, JZ Leibo, K Tuyls, S Clark arXiv preprint arXiv:1804.03980, 2018	184	2018
How important is weight symmetry in backpropagation? Q Liao, J Leibo, T Poggio Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016	177	2016
Unsupervised learning of invariant representations F Anselmi, JZ Leibo, L Rosasco, J Mutch, A Tacchetti, T Poggio Theoretical Computer Science 633, 112-121, 2016	144	2016
Kickstarting deep reinforcement learning S Schmitt, JJ Hudson, A Zidek, S Osindero, C Doersch, WM Czarnecki, ... arXiv preprint arXiv:1803.03835, 2018	143	2018

Системата не може да изпълни операцията сега. Опитайте отново по-късно.

Статии 1–20

Позовавания годишно

Дублирани описания

Обединени библиографски описания

Добавяне на съавториСъавтори

Следене

Позовавания

Съавтори