Search Results for author: Sudeep Raja Putta

Found 3 papers, 0 papers with code

Scale Free Adversarial Multi Armed Bandits

no code implementations8 Jun 2021 Sudeep Raja Putta, Shipra Agrawal

This technique plays a crucial role in our analysis for controlling the regret when using importance weighted estimators of unbounded losses.

Multi-Armed Bandits

Exponential Weights on the Hypercube in Polynomial Time

no code implementations12 Jun 2018 Sudeep Raja Putta, Abhishek Shetty

This problem is equivalent to OLO on the $\{0, 1\}^n$ hypercube.

Efficient Reinforcement Learning via Initial Pure Exploration

no code implementations7 Jun 2017 Sudeep Raja Putta, Theja Tulabandhula

Based of the scores she obtains in these practice tests, she would formulate a strategy for maximizing her scores in the actual tests.

Multi-Armed Bandits reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.