no code implementations • 8 Jul 2016 • Pranav Sakulkar, Bhaskar Krishnamachari
In this problem, the transmitter has to choose the transmit power based on the amount of stored energy in its battery with the goal of maximizing the average rate obtained over time.
no code implementations • 30 Apr 2016 • Pranav Sakulkar, Bhaskar Krishnamachari
Motivated by networking applications, we analyze a setting where the reward is a known non-linear function of the context and the chosen arm's current state.