Следене
Ziyu Wang
Ziyu Wang
Deepmind
Потвърден имейл адрес: google.com - Начална страница
Заглавие
Позовавания
Позовавания
Година
Taking the human out of the loop: A review of Bayesian optimization
B Shahriari, K Swersky, Z Wang, RP Adams, N De Freitas
Proceedings of the IEEE 104 (1), 148-175, 2015
58022015
Dueling network architectures for deep reinforcement learning
Z Wang, T Schaul, M Hessel, H Hasselt, M Lanctot, N Freitas
International conference on machine learning, 1995-2003, 2016
53552016
Grandmaster level in StarCraft II using multi-agent reinforcement learning
O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ...
nature 575 (7782), 350-354, 2019
49012019
Emergence of locomotion behaviours in rich environments
N Heess, D Tb, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, T Erez, ...
arXiv preprint arXiv:1707.02286, 2017
11642017
Sample efficient actor-critic with experience replay
Z Wang, V Bapst, N Heess, V Mnih, R Munos, K Kavukcuoglu, ...
arXiv preprint arXiv:1611.01224, 2016
10442016
Bayesian optimization in a billion dimensions via random embeddings
Z Wang, F Hutter, M Zoghi, D Matheson, N De Feitas
Journal of Artificial Intelligence Research 55, 361-387, 2016
8712016
Alphastar: Mastering the real-time strategy game starcraft ii
O Vinyals, I Babuschkin, J Chung, M Mathieu, M Jaderberg, ...
DeepMind blog 2, 20, 2019
5702019
Autonomous navigation of stratospheric balloons using reinforcement learning
MG Bellemare, S Candido, PS Castro, J Gong, MC Machado, S Moitra, ...
Nature 588 (7836), 77-82, 2020
4172020
Reinforcement and imitation learning for diverse visuomotor skills
Y Zhu, Z Wang, J Merel, A Rusu, T Erez, S Cabi, S Tunyasuvunakool, ...
arXiv preprint arXiv:1802.09564, 2018
3802018
Deep fried convnets
Z Yang, M Moczulski, M Denil, N De Freitas, A Smola, L Song, Z Wang
Proceedings of the IEEE international conference on computer vision, 1476-1483, 2015
3572015
Learning an embedding space for transferable robot skills
K Hausman, JT Springenberg, Z Wang, N Heess, M Riedmiller
International Conference on Learning Representations, 2018
3532018
Critic regularized regression
Z Wang, A Novikov, K Zolna, JS Merel, JT Springenberg, SE Reed, ...
Advances in Neural Information Processing Systems 33, 7768-7778, 2020
3392020
Playing hard exploration games by watching youtube
Y Aytar, T Pfaff, D Budden, T Paine, Z Wang, N De Freitas
Advances in neural information processing systems 31, 2018
3202018
Acme: A research framework for distributed reinforcement learning
MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ...
arXiv preprint arXiv:2006.00979, 2020
2662020
Robust imitation of diverse behaviors
Z Wang, JS Merel, SE Reed, N de Freitas, G Wayne, N Heess
Advances in Neural Information Processing Systems 30, 2017
2502017
Learning human behaviors from motion capture by adversarial imitation
J Merel, Y Tassa, D TB, S Srinivasan, J Lemmon, Z Wang, G Wayne, ...
arXiv preprint arXiv:1707.02201, 2017
2462017
Parallel multiscale autoregressive density estimation
S Reed, A Oord, N Kalchbrenner, SG Colmenarejo, Z Wang, Y Chen, ...
International conference on machine learning, 2912-2921, 2017
2422017
Rl unplugged: A suite of benchmarks for offline reinforcement learning
C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ...
Advances in Neural Information Processing Systems 33, 7248-7259, 2020
1932020
Hyperparameter selection for offline reinforcement learning
TL Paine, C Paduraru, A Michi, C Gulcehre, K Zolna, A Novikov, Z Wang, ...
arXiv preprint arXiv:2007.09055, 2020
1672020
Bayesian optimization in alphago
Y Chen, A Huang, Z Wang, I Antonoglou, J Schrittwieser, D Silver, ...
arXiv preprint arXiv:1812.06855, 2018
1612018
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20