Следене
Mastane Achab
Mastane Achab
DeepGambit
Потвърден имейл адрес: deepgambit.com - Начална страница
Заглавие
Позовавания
Позовавания
Година
Weighted empirical risk minimization: Sample selection bias correction based on importance sampling
M Achab, S Cl{\'e}men{\c{c}}on, C Tillier, R Vogel
Proceedings of the International Conference on Machine Learning, Artificial …, 2020
20*2020
Max K-Armed Bandit: On the ExtremeHunter Algorithm and Beyond
M Achab, S Clémençon, A Garivier, A Sabourin, C Vernade
Machine Learning and Knowledge Discovery in Databases: European Conference …, 2017
172017
Profitable bandits
M Achab, S Clémençon, A Garivier
Asian Conference on Machine Learning, 694-709, 2018
122018
Ranking data with continuous labels through oriented recursive partitions
S Clémençon, M Achab
Advances in Neural Information Processing Systems, 4600-4608, 2017
112017
Ranking and risk-aware reinforcement learning| Theses. fr
M Achab
Institut polytechnique de Paris, 2020
7*2020
Dimensionality Reduction and (Bucket) Ranking: a Mass Transportation Approach
M Achab, A Korba, S Clémençon
Algorithmic Learning Theory, 64-93, 2019
42019
One-Step Distributional Reinforcement Learning
M Achab, R Alami, YA Dahou Djilali, K Fedyanin, E Moulines
Transactions on Machine Learning Research, 2023
32023
Distributional deep Q-learning with CVaR regression
M Achab, R Alami, YAD Djilali, K Fedyanin, E Moulines, M Panov
Deep Reinforcement Learning Workshop NeurIPS 2022, 2022
32022
Robustness and risk management via distributional dynamic programming
M Achab, G Neu
arXiv preprint arXiv:2112.15430, 2021
32021
Investigating Regularization of Self-Play Language Models
R Alami, A Abubaker, M Achab, MEA Seddik, S Lahlou
arXiv preprint arXiv:2404.04291, 2024
12024
A Nested Matrix-Tensor Model for Noisy Multi-view Clustering
MEA Seddik, M Achab, H Goulart, M Debbah
arXiv preprint arXiv:2305.19992, 2023
12023
A Bregman firmly nonexpansive proximal operator for baryconvex optimization
M Achab
arXiv preprint arXiv:2411.00928, 2024
2024
A Risk-Averse Framework for Non-Stationary Stochastic Multi-Armed Bandits
R Alami, M Mahfoud, M Achab
MAB-KD Workshop ICDM 2023, 2023
2023
Deep Reinforcement Learning Algorithms for Hybrid V2X Communication: A Benchmarking Study
F Boukhalfa, R Alami, M Achab, E Moulines, M Bennis
https://arxiv.org/abs/2310.03767, 2023
2023
Beyond Log-Concavity: Theory and Algorithm for Sum-Log-Concave Optimization
M Achab
https://arxiv.org/abs/2309.15298, 2023
2023
Checkered Regression
M Achab
TechRxiv preprint, 2022
2022
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–16