no code implementations • 26 Feb 2024 • Kellen Kanarios, Qining Zhang, Lei Ying
In this paper, we study a best arm identification problem with dual objects.
no code implementations • NeurIPS 2023 • Qining Zhang, Lei Ying
This paper considers a stochastic Multi-Armed Bandit (MAB) problem with dual objectives: (i) quick identification and commitment to the optimal arm, and (ii) reward maximization throughout a sequence of $T$ consecutive rounds.