Marc Jourdan
Marc Jourdan
Home
Publications
Experience
Talks
Teaching
Contact
CV
Gaussian Bandits
Non-Asymptotic Analysis of a UCB-based Top Two Algorithm
A Top Two sampling rule for bandit identification is a method which selects the next arm to sample from among two candidate arms, a …
Marc Jourdan
,
Rémy Degenne
PDF
Cite
Code
Poster
Slides
Video
Dealing with Unknown Variances in Best-Arm Identification
The problem of identifying the best arm among a collection of items having Gaussian rewards distribution is well understood when the …
Marc Jourdan
,
Rémy Degenne
,
Emilie Kaufmann
PDF
Cite
Slides
Video
Cite
×