Search Results for author: Joe Mellor

Found 1 papers, 0 papers with code

The K-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates

no code implementations1 Mar 2018 Henry WJ Reeve, Joe Mellor, Gavin Brown

In addition, focusing on the case of bounded rewards, we give corresponding regret bounds for the k-Nearest Neighbour KL-UCB algorithm, which is an analogue of the KL-UCB algorithm adapted to the setting of multi-armed bandits with covariates.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.