Следене
Yannick Schroecker
Yannick Schroecker
DeepMind
Потвърден имейл адрес: google.com
Заглавие
Позовавания
Позовавания
Година
Imitating Latent Policies from Observation
AD Edwards, H Sahni, Y Schroecker, CL Isbell
International Conference on Machine Learning, 2018
1632018
Human-timescale adaptation in an open-ended task space
AA Team, J Bauer, K Baumli, S Baveja, F Behbahani, A Bhoopchand, ...
arXiv preprint arXiv:2301.07608, 2023
104*2023
Bootstrapped meta-learning
S Flennerhag, Y Schroecker, T Zahavy, H van Hasselt, D Silver, S Singh
arXiv preprint arXiv:2109.04504, 2021
772021
Structured state space models for in-context reinforcement learning
C Lu, Y Schroecker, A Gu, E Parisotto, J Foerster, S Singh, F Behbahani
Advances in Neural Information Processing Systems 36, 2024
672024
Discovering policies with domino: Diversity optimization maintaining near optimality
T Zahavy, Y Schroecker, F Behbahani, K Baumli, S Flennerhag, S Hou, ...
arXiv preprint arXiv:2205.13521, 2022
402022
Generative predecessor models for sample-efficient imitation learning
Y Schroecker, M Vecerik, J Scholz
International Conference on Learning Representations, 2019
392019
State aware imitation learning
Y Schroecker, CL Isbell
Advances in Neural Information Processing Systems 30, 2017
332017
Active learning within constrained environments through imitation of an expert questioner
K Bullard, Y Schroecker, S Chernova
International Joint Conference on Artificial Intelligence, 2019
232019
Vision-language models as a source of rewards
K Baumli, S Baveja, F Behbahani, H Chan, G Comanici, S Flennerhag, ...
arXiv preprint arXiv:2312.09187, 2023
162023
Universal value density estimation for imitation learning and goal-conditioned reinforcement learning
Y Schroecker, C Isbell
arXiv preprint arXiv:2002.06473, 2020
142020
Directing policy search with interactively taught via-points
Y Schroecker, H Ben Amor, A Thomaz
International Conference on Autonomous Agents & Multiagent Systems, 1052-1059, 2016
122016
Meta-gradients in non-stationary environments
J Luketina, S Flennerhag, Y Schroecker, D Abel, T Zahavy, S Singh
Conference on Lifelong Learning Agents, 886-901, 2022
102022
Imitation learning using a generative predecessor neural network
M Vecerik, Y Schroecker, JK Scholz
US Patent 10,872,294, 2020
82020
Human-timescale adaptation in an open-ended task space, 2023
AA Team, J Bauer, K Baumli, S Baveja, F Behbahani, A Bhoopchand, ...
URL https://arxiv. org/abs/2301.07608, 0
5
Manipulating state space distributions for sample-efficient imitation-learning
YKD Schroecker
Georgia Institute of Technology, 2020
12020
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–15