Search Results for author: Chinmaya Kausik

Found 5 papers, 1 papers with code

Leveraging Offline Data in Linear Latent Bandits

no code implementations27 May 2024 Chinmaya Kausik, Kevin Tan, Ambuj Tewari

One can leverage offline latent bandit data to learn a complex model for each latent state, so that an agent can simply learn the latent state online to act optimally.

Offline Policy Evaluation and Optimization under Confounding

no code implementations29 Nov 2022 Chinmaya Kausik, Yangyi Lu, Kevin Tan, Maggie Makar, Yixin Wang, Ambuj Tewari

Evaluating and optimizing policies in the presence of unobserved confounders is a problem of growing interest in offline reinforcement learning.

Offline RL Off-policy evaluation

Learning Mixtures of Markov Chains and MDPs

1 code implementation17 Nov 2022 Chinmaya Kausik, Kevin Tan, Ambuj Tewari

We present an algorithm for learning mixtures of Markov chains and Markov decision processes (MDPs) from short unlabeled trajectories.

Cannot find the paper you are looking for? You can Submit a new open access paper.