Diogo S. Carvalho

20212022202320245 11 16 1

Public access

3 articles

0 articles

available

not available

Based on funding mandates

Francisco S. MeloINESC-ID / Instituto Superior TecnicoVerified email at inesc-id.pt
Pedro A. SantosInstituto Superior Técnico, INESC-ID, Universidade de LisboaVerified email at math.ist.utl.pt
Alberto SardinhaDepartamento de Informática, PUC-Rio and GAIPS, INESC-IDVerified email at inf.puc-rio.br
Manuel GuimarãesInstituto Superior Técnico, INESC-ID, GAIPS: Group for AI for People and SocietyVerified email at tecnico.ulisboa.pt
João DiasFaculty of Science and Technology, University of Algarve, CISCA, INESC-IDVerified email at ualg.pt
Pedro P. SantosINESC-ID, Instituto Superior TécnicoVerified email at tecnico.ulisboa.pt
Miguel VascoPostdoctoral Researcher KTH Royal Institute of TechnologyVerified email at kth.se

Diogo S. Carvalho

Instituto Superior Técnico, University of Lisbon, and INESC-ID

Verified email at tecnico.ulisboa.pt


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A new convergent variant of -learning with linear function approximation DS Carvalho, FS Melo, PA Santos Advances in Neural Information Processing Systems 33, 2020	30	2020
Hierarchically Structured Scheduling and Execution of Tasks in a Multi-agent Environment DS Carvalho, B Sengupta EPIA Conference on Artificial Intelligence, 15-26, 2022	1	2022
Understanding the impact of data distribution on -learning with function approximation PP Santos, DS Carvalho, A Sardinha, FS Melo arXiv e-prints, arXiv: 2111.11758, 2021	1	2021
CHARET: Character-centered Approach to Emotion Tracking in Stories DS Carvalho, J Campos, M Guimarães, A Antunes, J Dias, PA Santos arXiv preprint arXiv:2102.07537, 2021	1	2021
Theoretical remarks on feudal hierarchies and reinforcement learning DS Carvalho, FS Melo, PA Santos 26th European Conference on Artificial Intelligence, 2023		2023
Multi-Bellman operator for convergence of -learning with linear function approximation DS Carvalho, PA Santos, FS Melo arXiv preprint arXiv:2309.16819, 2023		2023
Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning PP Santos, DS Carvalho, M Vasco, A Sardinha, PA Santos, A Paiva, ... arXiv preprint arXiv:2210.06274, 2022		2022
-learning with regularization converges with non-linear non-stationary features DS Carvalho, FS Melo, PA Santos		2022

The system can't perform the operation now. Try again later.

Articles 1–8

Citations per year