Finbarr Timbers

Получаване на мой собствен потребителски профил

Позовавания

	Всички	От 2019
Позовавания	789	758
h-индекс	12	12
i10-индекс	13	13

280

140

210

2016201720182019202020212022202320249 5 14 40 67 103 163 274 107

Съавтори

Marc LanctotResearch Scientist, Google DeepMindПотвърден имейл адрес: google.com
Martin SchmidGoogle DeepMindПотвърден имейл адрес: kam.mff.cuni.cz
Michael BowlingUniversity of AlbertaПотвърден имейл адрес: ualberta.ca
Edward LockhartDeepMindПотвърден имейл адрес: deepmind.com
Neil BurchSony AI & Alberta Machine Intelligence Institute, University of AlbertaПотвърден имейл адрес: ualberta.ca
Karl TuylsResearch Scientist, ex-Google DeepMind and Professor of computer science, University of LiverpoolПотвърден имейл адрес: google.com
julien perolatDeepMindПотвърден имейл адрес: google.com
Nolan BardSenior Research Scientist, Sony AIПотвърден имейл адрес: ualberta.ca
G. Zacharias HollandUniversity of AlbertaПотвърден имейл адрес: ualberta.ca
Dustin MorrillSony AIПотвърден имейл адрес: sony.com

Следене

Finbarr Timbers

Midjourney

Потвърден имейл адрес: midjourney.com - Начална страница

Reinforcement learning


Заглавие Сортиране по цитати Сортиране по година Сортиране по заглавие	Позовавания Позовавания	Година
OpenSpiel: A framework for reinforcement learning in games M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ... arXiv preprint arXiv:1908.09453, 2019	242	2019
Mastering the game of stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	146	2022
Detecting duplicate bug reports with software engineering domain knowledge K Aggarwal, F Timbers, T Rutgers, A Hindle, E Stroulia, R Greiner Journal of Software: Evolution and Process 29 (3), e1821, 2017	114	2017
Computing approximate equilibria in sequential adversarial games by exploitability descent E Lockhart, M Lanctot, J Pérolat, JB Lespiau, D Morrill, F Timbers, K Tuyls arXiv preprint arXiv:1903.05614, 2019	74	2019
Player of games M Schmid, M Moravcik, N Burch, R Kadlec, J Davidson, K Waugh, N Bard, ... arXiv preprint arXiv:2112.03178, 2021	49	2021
Approximate exploitability: Learning a best response in large games F Timbers, N Bard, E Lockhart, M Lanctot, M Schmid, N Burch, ... arXiv preprint arXiv:2004.09677, 2020	36	2020
The advantage regret-matching actor-critic A Gruslys, M Lanctot, R Munos, F Timbers, M Schmid, J Perolat, D Morrill, ... arXiv preprint arXiv:2008.12234, 2020	24	2020
OpenSpiel: a framework for reinforcement learning in games. CoRR abs/1908.09453 (2019) M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ... arXiv preprint arXiv:1908.09453, 2019	23	2019
Emergent bartering behaviour in multi-agent reinforcement learning MB Johanson, E Hughes, F Timbers, JZ Leibo arXiv preprint arXiv:2205.06760, 2022	21	2022
Reward-respecting subtasks for model-based reinforcement learning RS Sutton, MC Machado, GZ Holland, D Szepesvari, F Timbers, B Tanner, ... Artificial Intelligence 324, 104001, 2023	18	2023
Solving common-payoff games with approximate policy iteration S Sokota, E Lockhart, F Timbers, E Davoodi, R D'Orazio, N Burch, ... Proceedings of the AAAI Conference on Artificial Intelligence 35 (11), 9695-9703, 2021	14	2021
Fast computation of Nash equilibria in imperfect information games R Munos, J Perolat, JB Lespiau, M Rowland, B De Vylder, M Lanctot, ... International Conference on Machine Learning, 7119-7129, 2020	12	2020
Bug reports dataset A Alipour, A Hindle, T Rutgers, R Dawson, F Timbers, K Aggarwal Dedup, 2013	11	2013
Student of Games: A unified learning algorithm for both perfect and imperfect information games M Schmid, M Moravčík, N Burch, R Kadlec, J Davidson, K Waugh, N Bard, ... Science Advances 9 (46), eadg3256, 2023	5	2023
Reward-Respecting Subtasks for Model-Based Reinforcement Learning (Abstract Reprint) RS Sutton, MC Machado, GZ Holland, D Szepesvari, F Timbers, B Tanner, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (20), 22713 …, 2024		2024
Detecting Duplicate Bug Reports using a Hierarchy of Domain Knowledge Contexts RG K. Aggarwal, A. Hindle, F. Timbers, E. Stroulia, T. Rutgers Journal of Software: Evolution and Process 29 (3), 2017		2017

Системата не може да изпълни операцията сега. Опитайте отново по-късно.

Статии 1–16

Позовавания годишно

Дублирани описания

Обединени библиографски описания

Добавяне на съавториСъавтори

Следене

Позовавания

Съавтори