Marc Jourdan
Marc Jourdan
Home
Publications
Experience
Talks
Teaching
Contact
CV
Finite-time
Non-Asymptotic Analysis of a UCB-based Top Two Algorithm
A Top Two sampling rule for bandit identification is a method which selects the next arm to sample from among two candidate arms, a …
Marc Jourdan
,
Rémy Degenne
PDF
Cite
Code
Poster
Slides
Video
Cite
×