no code implementations • 3 Jul 2023 • Bo Jiang, Tianchi Zhao, Ming Li
This paper investigates the problem of regret minimization for multi-armed bandit (MAB) problems with local differential privacy (LDP) guarantee.
Thompson Sampling