One-Step Distributional Reinforcement Learning
Published in Transactions on Machine Learning Research, 2023
co-authors: R. Alami, Y.A. Dahou Djilali, K. Fedyanin, E. Moulines
Download here
Published in Transactions on Machine Learning Research, 2023
co-authors: R. Alami, Y.A. Dahou Djilali, K. Fedyanin, E. Moulines
Download here
Published in preprint, 2023
co-authors: M.E.A. Seddik, H. Goulart, M. Debbah
Download here
Published in Deep Reinforcement Learning Workshop at NeurIPS 2022, 2022
co-authors: R. Alami, Y.A. Dahou Djilali, K. Fedyanin, E. Moulines, M. Panov
Download here
Published in preprint, 2022
Abstract. This paper introduces the checkered regression model, a nonlinear generalization of logistic regression. More precisely, this new binary classifier relies on the multivariate function $\frac{1}{2}\left( 1 + \tanh(\frac{z_1}{2})\times\dots\times\tanh(\frac{z_m}{2}) \right)$, which coincides with the usual sigmoid function in the univariate case $m=1$. While the decision boundary of logistic regression consists of a single hyperplane, our method is shown to tessellate the feature space by any given number $m\ge 1$ of hyperplanes. In order to fit the model’s parameters to some labeled data, we describe a classic empirical risk minimization framework based on the cross entropy loss. A multiclass version of our approach is also proposed.
Download here
Published in preprint, 2021
co-author: Gergely Neu
Download here
Published in Institut polytechnique de Paris, 2020
supervisors: Stephan Clémençon, Aurélien Garivier and Anne Sabourin
Download here
Published in ICMA 2020, 2020
co-authors: R. Vogel, S. Clémençon, C. Tillier
Download here
Published in ALT 2019, Chicago, USA, 2019
co-authors: A. Korba, S. Clémençon
Download here
Published in ACML 2018, Beijing, China, 2018
co-authors: S. Clémençon, A. Garivier
Download here
Published in NeurIPS 2017, Long Beach, USA, 2017
co-author: S. Clémençon
Download here
Published in ECML PKDD 2017, Skopje, Macedonia, 2017
co-authors: S. Clémençon, A. Garivier, A. Sabourin, C. Vernade
Download here