Ted Moskovitz
Ted Moskovitz
Gatsby Unit, UCL
Потвърден имейл адрес: gatsby.ucl.ac.uk - Начална страница
Feedback alignment in deep convolutional networks
TH Moskovitz, A Litwin-Kumar, LF Abbott
arXiv preprint arXiv:1812.06488, 2018
Tactical Optimism and Pessimism for Deep Reinforcement Learning
T Moskovitz, J Parker-Holder, A Pacchiano, M Arbel, MI Jordan
Neural Information Processing Systems (NeurIPS) 2021, 2021
Efficient Wasserstein Natural Gradients for Reinforcement Learning
T Moskovitz*, M Arbel*, F Huszar, A Gretton
International Conference on Learning Representations (ICLR) 2021, 2020
Action prediction error: a value-free dopaminergic teaching signal that drives stable learning
F Greenstreet, HM Vergara, S Pati, L Schwarz, M Wisdom, F Marbach, ...
bioRxiv, 2022
A First-Occupancy Representation for Reinforcement Learning
T Moskovitz, SR Wilson, M Sahani
International Conference on Learning Representations (ICLR) 2022, 2021
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
T Moskovitz, B O’Donoghue, V Veeriah, S Flennerhag, S Singh, T Zahavy
International Conference on Machine Learning, 25303-25336, 2023
A comparison of deep learning and linear-nonlinear cascade approaches to neural encoding
TH Moskovitz, NA Roy, JW Pillow
BioRxiv, 463422, 2018
Towards an Understanding of Default Policies in Multitask Policy Optimization
T Moskovitz, M Arbel, J Parker-Holder, A Pacchiano
Conference on Artificial Intelligence and Statistics (AISTATS) 2022, 2021
The Transient Nature of Emergent In-context Learning in Transformers
AK Singh*, SCY Chan*, T Moskovitz, A Saxe, F Hill
NeurIPS 2023, 2023
First-Order Preconditioning via Hypergradient Descent
T Moskovitz, R Wang, J Lan, S Kapoor, T Miconi, J Yosinski, A Rawal
NeurIPS 2019, Beyond First Order Methods in ML Workshop, 2019
Confronting Reward Model Overoptimization with Constrained RLHF
T Moskovitz, AK Singh, DJ Strouse, T Sandholm, R Salakhutdinov, ...
International Conference on Learning Representations (ICLR) 2024, 2023
A unified theory of dual-process control
T Moskovitz, K Miller, M Sahani, MM Botvinick
arXiv preprint arXiv:2211.07036, 2022
Amortised Learning by Wake-Sleep
LK Wenliang, T Moskovitz, H Kanagawa, M Sahani
International Conference on Machine Learning (ICML) 2020, 2020
Understanding the functional and structural differences across excitatory and inhibitory neurons
S Minni, L Ji-An, T Moskovitz, G Lindsay, K Miller, M Dipoppa, GR Yang
bioRxiv, 680439, 2019
Minimum Description Length Control
T Moskovitz, TC Kao, M Sahani, MM Botvinick
International Conference on Learning Representations (ICLR) 2023, 2022
Do Biologically-Realistic Recurrent Architectures Produce Biologically-Realistic Models?
G Lindsay, T Moskovitz, GR Yang, K Miller
Conference on Cognitive Computational Neuroscience (CCN) 2019, 2019
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation
AK Singh, T Moskovitz, F Hill, SCY Chan, AM Saxe
arXiv preprint arXiv:2404.07129, 2024
A State Representation for Diminishing Rewards
T Moskovitz, S Hromadka, A Touati, D Borsa, M Sahani
Thirty-seventh Conference on Neural Information Processing Systems, 2023
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–18