Следене
Olivier Pietquin
Olivier Pietquin
Cohere | ex Google DeepMind (On leave - Professor at University of Lille)
Потвърден имейл адрес: univ-lille.fr - Начална страница
Заглавие
Позовавания
Позовавания
Година
Deep q-learning from demonstrations
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ...
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
11632018
Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards
M Vecerik, T Hester, J Scholz, F Wang, O Pietquin, B Piot, N Heess, ...
arXiv preprint arXiv:1707.08817, 2017
7412017
Modulating early visual processing by language
H De Vries, F Strub, J Mary, H Larochelle, O Pietquin, AC Courville
Advances in neural information processing systems 30, 2017
5282017
Guesswhat?! visual object discovery through multi-modal dialogue
H De Vries, F Strub, S Chandar, O Pietquin, H Larochelle, A Courville
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
4412017
Listen and translate: A proof of concept for end-to-end speech-to-text translation
A Bérard, O Pietquin, C Servan, L Besacier
arXiv preprint arXiv:1612.01744, 2016
3132016
A theory of regularized markov decision processes
M Geist, B Scherrer, O Pietquin
International Conference on Machine Learning, 2160-2169, 2019
2872019
Audiolm: a language modeling approach to audio generation
Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
2842023
Acme: A research framework for distributed reinforcement learning
MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ...
arXiv preprint arXiv:2006.00979, 2020
2292020
What matters for on-policy deep actor-critic methods? A large-scale empirical study
M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ...
arXiv preprint arXiv:2006.05990, 2020
215*2020
End-to-end automatic speech translation of audiobooks
A Bérard, L Besacier, AC Kocabiyikoglu, O Pietquin
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
2142018
A probabilistic framework for dialog simulation and optimal strategy learning
O Pietquin, T Dutoit
IEEE Transactions on Audio, Speech, and Language Processing 14 (2), 589-599, 2006
2042006
Learning from demonstrations for real world reinforcement learning
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ...
arXiv preprint arXiv:1704.03732, 2017, 2018
1842018
Machine learning for spoken dialogue systems
O Lemon, O Pietquin
European Conference on Speech Communication and Technologies (Interspeech'07 …, 2007
1502007
What matters for on-policy deep actor-critic methods? a large-scale study
M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ...
International conference on learning representations, 2020
1472020
A framework for unsupervised learning of dialogue strategies
O Pietquin
Presses univ. de Louvain, 2005
1462005
A survey on metrics for the evaluation of user simulations
O Pietquin, H Hastie
The knowledge engineering review 28 (1), 59-73, 2013
1372013
Observe and look further: Achieving consistent performance on atari
T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ...
arXiv preprint arXiv:1805.11593, 2018
1282018
Kalman temporal differences
M Geist, O Pietquin
Journal of artificial intelligence research 39, 483-532, 2010
1232010
Algorithmic Survey of Parametric Value Function Approximation
M Geist, O Pietquin
Transactions on Neural Networks and Learning Systems 24 (6), 845-867, 2013
121*2013
Primal wasserstein imitation learning
R Dadashi, L Hussenot, M Geist, O Pietquin
arXiv preprint arXiv:2006.04678, 2020
1202020
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20