Следене
Yannick Schroecker
Yannick Schroecker
DeepMind
Потвърден имейл адрес: google.com
Заглавие
Позовавания
Позовавания
Година
Imitating Latent Policies from Observation
AD Edwards, H Sahni, Y Schroecker, CL Isbell
International Conference on Machine Learning, 2018
1392018
Bootstrapped meta-learning
S Flennerhag, Y Schroecker, T Zahavy, H van Hasselt, D Silver, S Singh
arXiv preprint arXiv:2109.04504, 2021
662021
Human-timescale adaptation in an open-ended task space
AA Team, J Bauer, K Baumli, S Baveja, F Behbahani, A Bhoopchand, ...
arXiv preprint arXiv:2301.07608, 2023
63*2023
Generative predecessor models for sample-efficient imitation learning
Y Schroecker, M Vecerik, J Scholz
International Conference on Learning Representations, 2019
382019
Structured state space models for in-context reinforcement learning
C Lu, Y Schroecker, A Gu, E Parisotto, J Foerster, S Singh, F Behbahani
Advances in Neural Information Processing Systems 36, 2024
322024
State aware imitation learning
Y Schroecker, CL Isbell
Advances in Neural Information Processing Systems 30, 2017
312017
Discovering policies with domino: Diversity optimization maintaining near optimality
T Zahavy, Y Schroecker, F Behbahani, K Baumli, S Flennerhag, S Hou, ...
arXiv preprint arXiv:2205.13521, 2022
272022
Active learning within constrained environments through imitation of an expert questioner
K Bullard, Y Schroecker, S Chernova
International Joint Conference on Artificial Intelligence, 2019
192019
Universal value density estimation for imitation learning and goal-conditioned reinforcement learning
Y Schroecker, C Isbell
arXiv preprint arXiv:2002.06473, 2020
132020
Directing policy search with interactively taught via-points
Y Schroecker, H Ben Amor, A Thomaz
International Conference on Autonomous Agents & Multiagent Systems, 1052-1059, 2016
112016
Meta-gradients in non-stationary environments
J Luketina, S Flennerhag, Y Schroecker, D Abel, T Zahavy, S Singh
Conference on Lifelong Learning Agents, 886-901, 2022
82022
Imitation learning using a generative predecessor neural network
M Vecerik, Y Schroecker, JK Scholz
US Patent 10,872,294, 2020
82020
Vision-language models as a source of rewards
K Baumli, S Baveja, F Behbahani, H Chan, G Comanici, S Flennerhag, ...
arXiv preprint arXiv:2312.09187, 2023
12023
Manipulating State Space Distributions for Sample-Efficient Imitation-Learning.
Y Schroecker
Georgia Institute of Technology, Atlanta, GA, USA, 2020
12020
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–14