Imitating Latent Policies from Observation AD Edwards, H Sahni, Y Schroecker, CL Isbell International Conference on Machine Learning, 2018 | 163 | 2018 |
Human-timescale adaptation in an open-ended task space AA Team, J Bauer, K Baumli, S Baveja, F Behbahani, A Bhoopchand, ... arXiv preprint arXiv:2301.07608, 2023 | 104* | 2023 |
Bootstrapped meta-learning S Flennerhag, Y Schroecker, T Zahavy, H van Hasselt, D Silver, S Singh arXiv preprint arXiv:2109.04504, 2021 | 77 | 2021 |
Structured state space models for in-context reinforcement learning C Lu, Y Schroecker, A Gu, E Parisotto, J Foerster, S Singh, F Behbahani Advances in Neural Information Processing Systems 36, 2024 | 67 | 2024 |
Discovering policies with domino: Diversity optimization maintaining near optimality T Zahavy, Y Schroecker, F Behbahani, K Baumli, S Flennerhag, S Hou, ... arXiv preprint arXiv:2205.13521, 2022 | 40 | 2022 |
Generative predecessor models for sample-efficient imitation learning Y Schroecker, M Vecerik, J Scholz International Conference on Learning Representations, 2019 | 39 | 2019 |
State aware imitation learning Y Schroecker, CL Isbell Advances in Neural Information Processing Systems 30, 2017 | 33 | 2017 |
Active learning within constrained environments through imitation of an expert questioner K Bullard, Y Schroecker, S Chernova International Joint Conference on Artificial Intelligence, 2019 | 23 | 2019 |
Vision-language models as a source of rewards K Baumli, S Baveja, F Behbahani, H Chan, G Comanici, S Flennerhag, ... arXiv preprint arXiv:2312.09187, 2023 | 16 | 2023 |
Universal value density estimation for imitation learning and goal-conditioned reinforcement learning Y Schroecker, C Isbell arXiv preprint arXiv:2002.06473, 2020 | 14 | 2020 |
Directing policy search with interactively taught via-points Y Schroecker, H Ben Amor, A Thomaz International Conference on Autonomous Agents & Multiagent Systems, 1052-1059, 2016 | 12 | 2016 |
Meta-gradients in non-stationary environments J Luketina, S Flennerhag, Y Schroecker, D Abel, T Zahavy, S Singh Conference on Lifelong Learning Agents, 886-901, 2022 | 10 | 2022 |
Imitation learning using a generative predecessor neural network M Vecerik, Y Schroecker, JK Scholz US Patent 10,872,294, 2020 | 8 | 2020 |
Human-timescale adaptation in an open-ended task space, 2023 AA Team, J Bauer, K Baumli, S Baveja, F Behbahani, A Bhoopchand, ... URL https://arxiv. org/abs/2301.07608, 0 | 5 | |
Manipulating state space distributions for sample-efficient imitation-learning YKD Schroecker Georgia Institute of Technology, 2020 | 1 | 2020 |