no code implementations • 10 Mar 2024 • Yun-Ang Wu, Yun-Da Tsai, Shou-De Lin
In this study, we delve into the Thresholding Linear Bandit (TLB) problem, a nuanced domain within stochastic Multi-Armed Bandit (MAB) problems, focusing on maximizing decision accuracy against a linearly defined threshold under resource constraints.