A new convergent variant of -learning with linear function approximation DS Carvalho, FS Melo, PA Santos Advances in Neural Information Processing Systems 33, 2020 | 30 | 2020 |
Hierarchically Structured Scheduling and Execution of Tasks in a Multi-agent Environment DS Carvalho, B Sengupta EPIA Conference on Artificial Intelligence, 15-26, 2022 | 1 | 2022 |
Understanding the impact of data distribution on -learning with function approximation PP Santos, DS Carvalho, A Sardinha, FS Melo arXiv e-prints, arXiv: 2111.11758, 2021 | 1 | 2021 |
CHARET: Character-centered Approach to Emotion Tracking in Stories DS Carvalho, J Campos, M Guimarães, A Antunes, J Dias, PA Santos arXiv preprint arXiv:2102.07537, 2021 | 1 | 2021 |
Theoretical remarks on feudal hierarchies and reinforcement learning DS Carvalho, FS Melo, PA Santos 26th European Conference on Artificial Intelligence, 2023 | | 2023 |
Multi-Bellman operator for convergence of -learning with linear function approximation DS Carvalho, PA Santos, FS Melo arXiv preprint arXiv:2309.16819, 2023 | | 2023 |
Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning PP Santos, DS Carvalho, M Vasco, A Sardinha, PA Santos, A Paiva, ... arXiv preprint arXiv:2210.06274, 2022 | | 2022 |
-learning with regularization converges with non-linear non-stationary features DS Carvalho, FS Melo, PA Santos | | 2022 |