Marc Jourdan

Post-Doctoral Researcher

TML lab

EPFL

Biography

I am a postdoctoral researcher in the Theory of Machine Learning lab at EPFL, working with Nicolas Flammarion on the theoretical foundations of post-training for Large Language Models. My research interests include multi-armed bandits, sequential hypothesis testing, differential privacy, reinforcement learning, imitation learning, online learning, statistics, and optimization. I focus on developing theoretically well-founded and practically applicable algorithms.

I earned my PhD in Computer Science from the University of Lille, under the supervision of Emilie Kaufmann and Rémy Degenne within the Inria Scool team. My research focused on pure exploration problems in stochastic multi-armed bandits. In my thesis, I aimed to establish the Top-Two approach as a principled methodology that offers both near-optimal theoretical guarantees and state-of-the-art empirical performance. I addressed various facets of bandit theory, including parametric and non-parametric distribution classes, structural assumptions on mean rewards, and a range of identification problems. I was deeply honored to receive the runner-up PhD Award in Artificial Intelligence from AFIA (French Association for Artificial Intelligence) and the runner-up PhD Award in Machine Learning from SSFAM (Francophone Learned Society for Machine Learning). During my PhD, I also had the opportunity to spend three months at the University of Milan, visiting Nicolò Cesa-Bianchi in the Laboratory for Artificial Intelligence and Learning Algorithms.

Before my PhD, I graduated from Ecole Polytechnique and ETH Zurich. I conducted my Master’s thesis in the Learning & Adaptive Systems group of Andreas Krause, where I studied pure exploration for combinatorial bandits with semi-bandit feedback.

Interests

Theory of Large Language Models
Multi-Armed Bandits
Reinforcement Learning
Online Learning
Statistics
Optimization

Education

PhD in Computer Science, 2021-2024
Scool (Inria) / CRIStAL (CNRS) / Univ. Lille
MSc ETH in Data Science, 2018-2020
ETH Zurich
Diplôme d'Ingénieur (MSc), 2015-2018
École Polytechnique

Publications

Marc Jourdan, Gizem Yüce, Nicolas Flammarion (2025). Learning Parametric Distributions from Samples and Preferences. ICML 2025.

Cyrille Kone, Marc Jourdan, Emilie Kaufmann (2024). Pareto Set Identification with Posterior Sampling. AISTATS 2025.

PDF Cite Code Poster

Riccardo Poiani, Marc Jourdan, Emilie Kaufmann, Rémy Degenne (2024). Best-Arm Identification in Unimodal Bandits. AISTATS 2025.

PDF Cite Code Poster

Marc Jourdan (2024). Solving Pure Exploration Problems with the Top Two Approach. Université de Lille.

PDF Cite Slides Video

Achraf Azize, Marc Jourdan, Aymen Al Marjani, Debabrota Basu (2024). Differentially Private Best-Arm Identification.

PDF Cite Code Slides

Marc Jourdan, Clémence Réda (2023). An Anytime Algorithm for Good Arm Identification.

Marc Jourdan, Rémy Degenne, Emilie Kaufmann (2023). An ε-Best-Arm Identification Algorithm for Fixed-Confidence and Beyond. NeurIPS 2023.

PDF Cite Code Poster Slides Video

Achraf Azize, Marc Jourdan, Aymen Al Marjani, Debabrota Basu (2023). On the Complexity of Differentially Private Best-Arm Identification with Fixed Confidence. NeurIPS 2023.

PDF Cite Code Poster Slides Video

Marc Jourdan, Rémy Degenne (2023). Non-Asymptotic Analysis of a UCB-based Top Two Algorithm. NeurIPS 2023.

PDF Cite Code Poster Slides Video

Marc Jourdan, Rémy Degenne, Emilie Kaufmann (2023). Dealing with Unknown Variances in Best-Arm Identification. ALT 2023.

PDF Cite Code Slides Video

Marc Jourdan, Rémy Degenne, Dorian Baudry, Rianne de Heide, Emilie Kaufmann (2022). Top Two Algorithms Revisited. NeurIPS 2022.

PDF Cite Code Poster Slides Video

Marc Jourdan, Rémy Degenne (2022). Choosing Answers in ε-Best-Answer Identification for Linear Bandits. ICML 2022.

PDF Cite Code Poster Slides Video

Marc Jourdan, Karolis Martinkus, David Roschewitz, Martin Strohmeier (2021). I Know Where You Are Going: Predicting Flight Destinations of Corporate and State Aircraft. Eng. Proc. 2021.

PDF Cite Video DOI

Marc Jourdan, Mojmír Mutný, Johannes Kirschner, Andreas Krause (2021). Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback. ALT 2021.

PDF Cite Slides Video

Marc Jourdan, Sebastien Blandin, Laura Wynter, Pralhad Deshpande (2018). A Probabilistic Model of the Bitcoin Blockchain. CVPRW 2019.

Marc Jourdan, Sebastien Blandin, Laura Wynter, Pralhad Deshpande (2018). Characterizing Entities in the Bitcoin Blockchain. ICDMW 2018.

Experience

Post-doctoral Researcher

Theory of Machine Learning lab (EPFL)

Oct 2024 – Present Lausanne, Switzerland

Post-doctoral researcher at EPFL, working with Dr. Nicolas Flammarion.

Research Visitor

Laboratory for Artificial Intelligence and Learning Algorithms (Università degli Studi di Milano)

Apr 2024 – Jun 2024 Milan, Italy

3-months research visit to collaborate with Prof. Dr. Nicolò Cesa-Bianchi.

Research Intern

Scool (Inria Lille)

Mar 2021 – Jul 2021 Villeneuve d'Ascq, France

Bandit identification with continuous answers under the supervision of Dr. Rémy Degenne.

Master’s Thesis

Learning and Adaptive Systems (ETH Zurich)

Apr 2020 – Sep 2020 Zurich, Switzerland

Pure exploration for combinatorial semi-bandits in the group of Prof. Dr. Andreas Krause.

Part time Data Scientist

Feb 2019 – Jul 2019 Zurich, Switzerland

Created a recommender system for customers and developed models to predict churn and customer recovery.

Research Intern

IBM Singapore Lab

Apr 2018 – Aug 2018 Singapore

Characterized entities in the Bitcoin blockchain and developed a probabilistic model of its evolution.

Research Intern

STMicroelectronics

Jun 2017 – Aug 2017 Crolles, France

Implemented a quantized convolutional neural network in order to synthesize it on a electronic chip.

Talks

LAILA Seminar: Solving pure exploration problems with the Top Two approach

In pure exploration problems for stochastic multi-armed bandits, the goal is to answer a question about a set of unknown distributions …

FLAIR Seminar: Solving pure exploration problems with the Top Two approach

In pure exploration problems for stochastic multi-armed bandits, the goal is to answer a question about a set of unknown distributions …

Data Science Seminar: Solving pure exploration problems with the Top Two approach

In pure exploration problems for stochastic multi-armed bandits, the goal is to answer a question about a set of unknown distributions …

LAS Seminar: Best-Arm Identification with Top Two Algorithms

Top Two algorithms arose as an adaptation of Thompson sampling to best arm identification in multi-armed bandit models, for parametric …

StatMathAppli: Top Two Algorithms Revisited

Teaching

Statistique computationnelle

Instructor: Céline Duval and Amir Aboubacar

Machine Perception

Instructor: Otmar Hilliges

Contact

marc.jourdan@epfl.ch
EPFL IC IINFCOM TML
Bâtiment INR
1015 Lausanne
INR 111