Diana Borsa

Получаване на мой собствен потребителски профил

Позовавания

	Всички	От 2019
Позовавания	1038	933
h-индекс	14	14
i10-индекс	18	18

240

120

180

201420152016201720182019202020212022202320244 5 17 28 48 78 134 201 218 237 65

Публичен достъп

Преглед на всички

2 статии

0 статии

налични

неналични

Въз основа на изисквания при финансирането

Съавтори

Andre BarretoResearch Scientist, Google DeepMindПотвърден имейл адрес: google.com
Tom SchaulSenior Staff Scientist, DeepMindПотвърден имейл адрес: nyu.edu
Rémi MunosDeepMindПотвърден имейл адрес: inria.fr
David SilverDeepMind, UCLПотвърден имейл адрес: google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLПотвърден имейл адрес: google.com
Doina PrecupDeepMind and McGill UniversityПотвърден имейл адрес: cs.mcgill.ca
Will DabneyDeepMindПотвърден имейл адрес: google.com
Matteo HesselResearch Engineer, Google DeepMindПотвърден имейл адрес: google.com
Daniel J. MankowitzGoogle DeepmindПотвърден имейл адрес: google.com
Ingemar J. CoxDepartment of Computer Science, University College London / University of CopenhagenПотвърден имейл адрес: ucl.ac.uk
Elad Yom-TovBar Ilan UniversityПотвърден имейл адрес: yom-tov.info
Augustin ZidekResearch Engineer, DeepMindПотвърден имейл адрес: google.com
Nicolas HeessDeepMindПотвърден имейл адрес: google.com
Anna HarutyunyanDeepMindПотвърден имейл адрес: google.com
Thore GraepelGlobal Lead Computational Science, AI & ML at Altos Labs and Chair of Machine Learning, UCLПотвърден имейл адрес: ucl.ac.uk
GHEORGHE COMANICIResearch Scientist, DeepMindПотвърден имейл адрес: deepmind.com
Bilal PiotGoogle DeepmindПотвърден имейл адрес: google.com
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Потвърден имейл адрес: univ-lille.fr
John Shawe-TaylorUCLПотвърден имейл адрес: cs.ucl.ac.uk
Mark RowlandResearch Scientist, Google DeepMindПотвърден имейл адрес: google.com

Следене

Diana Borsa

DeepMind

Потвърден имейл адрес: google.com

Reinforcement Learning Machine Learning Artificial Intelligence Exploration.


Заглавие Сортиране по цитати Сортиране по година Сортиране по заглавие	Позовавания Позовавания	Година
Transfer in deep reinforcement learning using successor features and generalised policy improvement A Barreto, D Borsa, J Quan, T Schaul, D Silver, M Hessel, D Mankowitz, ... International Conference on Machine Learning, 501-510, 2018	180	2018
Fast reinforcement learning with generalized policy updates A Barreto, S Hou, D Borsa, D Silver, D Precup Proceedings of the National Academy of Sciences 117 (48), 30079-30087, 2020	120	2020
Universal successor features approximators D Borsa, A Barreto, J Quan, D Mankowitz, R Munos, H Van Hasselt, ... arXiv preprint arXiv:1812.07626, 2018	118	2018
The option keyboard: Combining skills in reinforcement learning A Barreto, D Borsa, S Hou, G Comanici, E Aygün, P Hamel, D Toyama, ... Advances in Neural Information Processing Systems 32, 2019	91	2019
Detecting disease outbreaks in mass gatherings using Internet data E Yom-Tov, D Borsa, IJ Cox, RA McKendry Journal of medical Internet research 16 (6), e154, 2014	73	2014
Observational learning by reinforcement learning D Borsa, B Piot, R Munos, O Pietquin arXiv preprint arXiv:1706.06617, 2017	68	2017
Ray interference: a source of plateaus in deep reinforcement learning T Schaul, D Borsa, J Modayil, R Pascanu arXiv preprint arXiv:1904.11455, 2019	66	2019
The termination critic A Harutyunyan, W Dabney, D Borsa, N Heess, R Munos, D Precup arXiv preprint arXiv:1902.09996, 2019	53	2019
Learning shared representations in multi-task reinforcement learning D Borsa, T Graepel, J Shawe-Taylor arXiv preprint arXiv:1603.02041, 2016	44	2016
Expected eligibility traces H van Hasselt, S Madjiheurem, M Hessel, D Silver, A Barreto, D Borsa Proceedings of the AAAI Conference on Artificial Intelligence 35 (11), 9997 …, 2021	42	2021
Automatic identification of web-based risk markers for health events E Yom-Tov, D Borsa, AC Hayward, RA McKendry, IJ Cox Journal of medical Internet research 17 (1), e29, 2015	33	2015
Training deep neural nets to aggregate crowdsourced responses A Gaunt, D Borsa, Y Bachrach Proceedings of the Thirty-Second Conference on Uncertainty in Artificial …, 2016	32	2016
When should agents explore? M Pislar, D Szepesvari, G Ostrovski, D Borsa, T Schaul arXiv preprint arXiv:2108.11811, 2021	26	2021
Adapting behaviour for learning progress T Schaul, D Borsa, D Ding, D Szepesvari, G Ostrovski, W Dabney, ... arXiv preprint arXiv:1912.06910, 2019	15	2019
Temporal difference uncertainties as a signal for exploration S Flennerhag, JX Wang, P Sprechmann, F Visin, A Galashov, ... arXiv preprint arXiv:2010.02255, 2020	14	2020
Return-based scaling: Yet another normalisation trick for deep rl T Schaul, G Ostrovski, I Kemaev, D Borsa arXiv preprint arXiv:2105.05347, 2021	13	2021
Conditional importance sampling for off-policy learning M Rowland, A Harutyunyan, H Hasselt, D Borsa, T Schaul, R Munos, ... International Conference on Artificial Intelligence and Statistics, 45-55, 2020	12	2020
General non-linear bellman equations H van Hasselt, J Quan, M Hessel, Z Xu, D Borsa, A Barreto arXiv preprint arXiv:1907.03687, 2019	10	2019
Model-value inconsistency as a signal for epistemic uncertainty A Filos, E Vértes, Z Marinho, G Farquhar, D Borsa, A Friesen, ... arXiv preprint arXiv:2112.04153, 2021	9	2021
Generalised policy improvement with geometric policy composition S Thakoor, M Rowland, D Borsa, W Dabney, R Munos, A Barreto International Conference on Machine Learning, 21272-21307, 2022	6	2022

Системата не може да изпълни операцията сега. Опитайте отново по-късно.

Статии 1–20

Позовавания годишно

Дублирани описания

Обединени библиографски описания

Добавяне на съавториСъавтори

Следене

Позовавания

Съавтори