Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning R Fruit, M Pirotta, A Lazaric, R Ortner Proceedings of the 35th International Conference on Machine Learning 80 …, 2018 | 123 | 2018 |
Near optimal exploration-exploitation in non-communicating markov decision processes R Fruit, M Pirotta, A Lazaric Advances in Neural Information Processing Systems, 2994-3004, 2018 | 51 | 2018 |
Exploration--Exploitation in MDPs with Options R Fruit, A Lazaric Proceedings of the 20th International Conference on Artificial Intelligence …, 2017 | 51 | 2017 |
Regret Minimization in MDPs with Options without Prior Knowledge R Fruit, M Pirotta, A Lazaric, E Brunskill Advances in Neural Information Processing Systems, 3166-3176, 2017 | 29 | 2017 |
Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes J Qian, R Fruit, M Pirotta, A Lazaric arXiv preprint arXiv:1812.04363, 2018 | 11 | 2018 |
Regret Minimization in MDPs with Options R Fruit, M Pirotta, A Lazaric, E Brunskill | | |
Analysis of Learning and Planning with Options R Fruit, A Lazaric | | |