Vivek Veeriah

Получаване на мой собствен потребителски профил

Позовавания

	Всички	От 2019
Позовавания	1011	745
h-индекс	11	10
i10-индекс	14	11

160

120

2013201420152016201720182019202020212022202320246 9 6 30 96 104 121 129 136 154 149 55

Публичен достъп

Преглед на всички

5 статии

0 статии

налични

неналични

Въз основа на изисквания при финансирането

Съавтори

Satinder SinghGoogle DeepMind / U. of MichiganПотвърден имейл адрес: umich.edu
Junhyuk OhResearch Scientist, DeepMindПотвърден имейл адрес: google.com
Richard S. SuttonKeen, Amii, and University of AlbertaПотвърден имейл адрес: richsutton.com
Guo-Jun Qi (齐国君), Fellow of IEEE &...Computer Science, University of Central FloridaПотвърден имейл адрес: ucf.edu
Zhongwen XuTencentПотвърден имейл адрес: tencent.com
David SilverDeepMind, UCLПотвърден имейл адрес: google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLПотвърден имейл адрес: google.com
Matteo HesselResearch Engineer, Google DeepMindПотвърден имейл адрес: google.com
Tom ZahavyStaff Research Scientist, Google DeepMindПотвърден имейл адрес: deepmind.com
Richard L. LewisProfessor of Psychology, Linguistics and Cognitive Science, University of MichiganПотвърден имейл адрес: umich.edu
Naifan ZhuangPhD student of Department of Computer Science, University of Central FloridaПотвърден имейл адрес: knights.ucf.edu
Janarthanan RajendranAssistant Professor, Faculty of Computer Science, Dalhousie UniversityПотвърден имейл адрес: umich.edu
Patrick M. PilarskiUniversity of Alberta, Amii (Alberta Machine Intelligence Institute)Потвърден имейл адрес: ualberta.ca
Iurii KemaevDeepMindПотвърден имейл адрес: deepmind.com
Alex KearneyPhD Candidate, University of AlbertaПотвърден имейл адрес: ualberta.ca
Jaden TravnikUniversity of Alberta, Sony AIПотвърден имейл адрес: ualberta.ca
Shangtong ZhangUniversity of VirginiaПотвърден имейл адрес: virginia.edu
Zeyu ZhengDeepMindПотвърден имейл адрес: deepmind.com
Ted MoskovitzGatsby Unit, UCLПотвърден имейл адрес: gatsby.ucl.ac.uk
Sebastian FlennerhagResearch Scientist at DeepMindПотвърден имейл адрес: google.com

Следене

Vivek Veeriah

Google DeepMind

Потвърден имейл адрес: google.com

Reinforcement learning MCTS Artificial Intelligence Language Models Planning


Заглавие Сортиране по цитати Сортиране по година Сортиране по заглавие	Позовавания Позовавания	Година
Differential recurrent neural networks for action recognition V Veeriah, N Zhuang, GJ Qi Proceedings of the IEEE international conference on computer vision, 4041-4049, 2015	586	2015
Discovery of useful questions as auxiliary tasks V Veeriah, M Hessel, Z Xu, J Rajendran, RL Lewis, J Oh, HP van Hasselt, ... Advances in Neural Information Processing Systems 32, 2019	94	2019
A self-tuning actor-critic algorithm T Zahavy, Z Xu, V Veeriah, M Hessel, J Oh, HP van Hasselt, D Silver, ... Advances in neural information processing systems 33, 20913-20924, 2020	78	2020
Many-goals reinforcement learning V Veeriah, J Oh, S Singh arXiv preprint arXiv:1806.09605, 2018	55	2018
Discovery of options via meta-learned subgoals V Veeriah, T Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 29861-29873, 2021	34	2021
Face valuing: Training user interfaces with facial expressions and reinforcement learning V Veeriah, PM Pilarski, RS Sutton arXiv preprint arXiv:1606.02807, 2016	26	2016
Robust hand gesture recognition algorithm for simple mouse control V Veeriah, PL Swaminathan International Journal of Computer and Communication Engineering 2 (2), 219, 2013	26	2013
Deep Learning Architecture with Dynamically Programmed Layers for Brain Connectome Prediction V Veeriah J, R Durvasula, GJ Qi ACM KDD 2015, 2015	22	2015
Tidbd: Adapting temporal-difference step-sizes through stochastic meta-descent A Kearney, V Veeriah, JB Travnik, RS Sutton, PM Pilarski arXiv preprint arXiv:1804.03334, 2018	16	2018
Reload: Reinforcement learning with optimistic ascent-descent for last-iterate convergence in constrained mdps T Moskovitz, B O’Donoghue, V Veeriah, S Flennerhag, S Singh, T Zahavy International Conference on Machine Learning, 25303-25336, 2023	11	2023
Learning feature relevance through step size adaptation in temporal-difference learning A Kearney, V Veeriah, J Travnik, PM Pilarski, RS Sutton arXiv preprint arXiv:1903.03252, 2019	11	2019
Diversifying ai: Towards creative chess with alphazero T Zahavy, V Veeriah, S Hou, K Waugh, M Lai, E Leurent, N Tomasev, ... arXiv preprint arXiv:2308.09175, 2023	10	2023
How Should an Agent Practice? J Rajendran, R Lewis, V Veeriah, H Lee, S Singh Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5454-5461, 2020	10	2020
Forward actor-critic for nonlinear function approximation in reinforcement learning V Veeriah, H van Seijen, RS Sutton Proceedings of the 16th Conference on Autonomous Agents and MultiAgent …, 2017	10	2017
Crossprop: Learning representations by stochastic meta-gradient descent in neural networks V Veeriah, S Zhang, RS Sutton Machine Learning and Knowledge Discovery in Databases: European Conference …, 2017	9	2017
Learning state representations from random deep action-conditional predictions Z Zheng, V Veeriah, R Vuorio, RL Lewis, S Singh Advances in Neural Information Processing Systems 34, 23679-23691, 2021	7	2021
Grasp: Gradient-based affordance selection for planning V Veeriah, Z Zheng, R Lewis, S Singh arXiv preprint arXiv:2202.04772, 2022	3	2022
Learning options for action selection with meta-gradients in multi-task reinforcement learning VVJ Veeraiah, TBZ Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, ... US Patent App. 17/918,365, 2023	1	2023
Discovery in Reinforcement Learning V Veeriah	1	2022
Learning representations by stochastic meta-gradient descent in neural networks V Veeriah, S Zhang, RS Sutton arXiv preprint arXiv:1612.02879, 2016	1	2016

Системата не може да изпълни операцията сега. Опитайте отново по-късно.

Статии 1–20

Позовавания годишно

Дублирани описания

Обединени библиографски описания

Добавяне на съавториСъавтори

Следене

Позовавания

Съавтори