Следене
Tom Erez
Tom Erez
Researcher, DeepMind
Потвърден имейл адрес: google.com - Начална страница
Заглавие
Позовавания
Позовавания
Година
Continuous control with deep reinforcement learning
TP Lillicrap, JJ Hunt, A Pritzel, N Heess, T Erez, Y Tassa, D Silver, ...
arXiv preprint arXiv:1509.02971, 2015
99382015
Mujoco: A physics engine for model-based control
E Todorov, T Erez, Y Tassa
2012 IEEE/RSJ international conference on intelligent robots and systems …, 2012
33902012
Emergence of locomotion behaviours in rich environments
N Heess, D TB, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, T Erez, ...
arXiv preprint arXiv:1707.02286, 2017
8062017
Synthesis and stabilization of complex behaviors through online trajectory optimization
Y Tassa, T Erez, E Todorov
2012 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2012
7252012
Learning continuous control policies by stochastic value gradients
N Heess, G Wayne, D Silver, T Lillicrap, T Erez, Y Tassa
Advances in neural information processing systems 28, 2015
5222015
Deepmind control suite
Y Tassa, Y Doron, A Muldal, T Erez, Y Li, DL Casas, D Budden, ...
arXiv preprint arXiv:1801.00690, 2018
2912018
Simulation tools for model-based robotics: Comparison of bullet, havok, mujoco, ode and physx
T Erez, Y Tassa, E Todorov
2015 IEEE international conference on robotics and automation (ICRA), 4397-4404, 2015
2722015
Diego de Las Casas, David Budden, Abbas Abdolmaleki, Josh Merel, Andrew Lefrancq, et al. Deepmind control suite
Y Tassa, Y Doron, A Muldal, T Erez, Y Li
arXiv preprint arXiv:1801.00690 1, 2018
265*2018
Reinforcement and imitation learning for diverse visuomotor skills
Y Zhu, Z Wang, J Merel, A Rusu, T Erez, S Cabi, S Tunyasuvunakool, ...
arXiv preprint arXiv:1802.09564, 2018
2532018
Data-efficient deep reinforcement learning for dexterous manipulation
I Popov, N Heess, T Lillicrap, R Hafner, G Barth-Maron, M Vecerik, ...
arXiv preprint arXiv:1704.03073, 2017
2422017
Receding horizon differential dynamic programming
Y Tassa, T Erez, W Smart
Advances in neural information processing systems 20, 2007
1542007
An integrated system for real-time model predictive control of humanoid robots
T Erez, K Lowrey, Y Tassa, V Kumar, S Kolev, E Todorov
2013 13th IEEE-RAS International conference on humanoid robots (Humanoids …, 2013
1512013
dm_control: Software and tasks for continuous control
S Tunyasuvunakool, A Muldal, Y Doron, S Liu, S Bohez, J Merel, T Erez, ...
Software Impacts 6, 100022, 2020
972020
Infinite-Horizon Model Predictive Control for Periodic Tasks with Contacts
T Erez, Y Tassa, E Todorov
Robotics: Science and Systems, 2011
822011
Infinite-Horizon Model Predictive Control for Periodic Tasks with Contacts
T Erez, Y Tassa, E Todorov
Robotics: Science and Systems, 2011
822011
Trajectory optimization for domains with contacts using inverse dynamics
T Erez, E Todorov
2012 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2012
812012
Learning to perform physics experiments via deep reinforcement learning
M Denil, P Agrawal, TD Kulkarni, T Erez, P Battaglia, N De Freitas
arXiv preprint arXiv:1611.01843, 2016
782016
Least squares solutions of the HJB equation with neural network value-function approximators
Y Tassa, T Erez
IEEE transactions on neural networks 18 (4), 1031-1041, 2007
742007
A scalable method for solving high-dimensional continuous POMDPs using local approximation
T Erez, WD Smart
arXiv preprint arXiv:1203.3477, 2012
732012
Value function approximation and model predictive control
M Zhong, M Johnson, Y Tassa, T Erez, E Todorov
2013 IEEE symposium on adaptive dynamic programming and reinforcement …, 2013
612013
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20