Следене
Bradly Stadie
Bradly Stadie
Assistant Professor, Northwestern
Потвърден имейл адрес: northwestern.edu - Начална страница
Заглавие
Позовавания
Позовавания
Година
One-shot imitation learning
Y Duan, M Andrychowicz, B Stadie, OAI Jonathan Ho, J Schneider, ...
Advances in neural information processing systems 30, 2017
7432017
Incentivizing exploration in reinforcement learning with deep predictive models
BC Stadie, S Levine, P Abbeel
arXiv preprint arXiv:1507.00814, 2015
5142015
Evolved policy gradients
R Houthooft, Y Chen, P Isola, B Stadie, F Wolski, OAI Jonathan Ho, ...
Advances in Neural Information Processing Systems 31, 2018
2712018
Third-person imitation learning
BC Stadie, P Abbeel, I Sutskever
arXiv preprint arXiv:1703.01703, 2017
2522017
Some considerations on learning to explore via meta-reinforcement learning
BC Stadie, G Yang, R Houthooft, X Chen, Y Duan, Y Wu, P Abbeel, ...
arXiv preprint arXiv:1803.01118, 2018
1262018
Maximum entropy gain exploration for long horizon multi-goal reinforcement learning
S Pitis, H Chan, S Zhao, B Stadie, J Ba
International Conference on Machine Learning, 7750-7761, 2020
1112020
World model as a graph: Learning latent landmarks for planning
L Zhang, G Yang, BC Stadie
International conference on machine learning, 12611-12620, 2021
692021
One-shot pruning of recurrent neural networks by jacobian spectrum evaluation
MS Zhang, B Stadie
arXiv preprint arXiv:1912.00120, 2019
372019
The importance of sampling inmeta-reinforcement learning
B Stadie, G Yang, R Houthooft, P Chen, Y Duan, Y Wu, P Abbeel, ...
Advances in Neural Information Processing Systems 31, 9280-9290, 2018
352018
Transfer learning for estimating causal effects using neural networks
SR Künzel, BC Stadie, N Vemuri, V Ramakrishnan, JS Sekhon, P Abbeel
arXiv preprint arXiv:1808.07804, 2018
342018
Learning intrinsic rewards as a bi-level optimization problem
B Stadie, L Zhang, J Ba
Conference on Uncertainty in Artificial Intelligence, 111-120, 2020
142020
To the noise and back: Diffusion for shared autonomy
T Yoneda, L Sun, B Stadie, M Walter
arXiv preprint arXiv:2302.12244, 2023
52023
Invariance through latent alignment
T Yoneda, G Yang, MR Walter, B Stadie
arXiv preprint arXiv:2112.08526, 2021
32021
Invariance through inference
T Yoneda, G Yang, M Walter, BC Stadie
32021
Estimating heterogeneous treatment effects using neural networks with the Y-Learner
BC Stadie, SR Künzel, N Vemuri, JS Sekhon
32018
Cold diffusion on the replay buffer: Learning to plan from known good states
Z Wang, T Oba, T Yoneda, R Shen, M Walter, BC Stadie
Conference on Robot Learning, 3277-3291, 2023
22023
One demonstration imitation learning
BC Stadie, S Zhao, Q Xu, B Li, L Zhang
Advances in neural information processing systems 30, 2019
12019
Simulating the stochastic dynamics and cascade failure of power networks
C Matthews, B Stadie, J Weare, M Anitescu, C Demarco
arXiv preprint arXiv:1806.02420, 2018
12018
Learning as a Sampling Problem
BC Stadie
UC Berkeley, 2018
12018
Understanding Hindsight Goal Relabeling Requires Rethinking Divergence Minimization
L Zhang, BC Stadie
2022
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20