Search Results for author: Chinmaya Kausik

Found 5 papers, 1 papers with code

Leveraging Offline Data in Linear Latent Bandits

no code implementations • 27 May 2024 • Chinmaya Kausik, Kevin Tan, Ambuj Tewari

One can leverage offline latent bandit data to learn a complex model for each latent state, so that an agent can simply learn the latent state online to act optimally.

Paper
Add Code

A Theoretical Framework for Partially Observed Reward-States in RLHF

no code implementations • 5 Feb 2024 • Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano, Ambuj Tewari

Both of these can be instrumental in speeding up learning and improving alignment.

reinforcement-learning

Paper
Add Code

Double Descent and Overfitting under Noisy Inputs and Distribution Shift for Linear Denoisers

no code implementations • 26 May 2023 • Chinmaya Kausik, Kashvi Srivastava, Rishi Sonthalia

Motivated by this, we study supervised denoising and noisy-input regression under distribution shift.

Data Augmentation Denoising +3

Paper
Add Code

Offline Policy Evaluation and Optimization under Confounding

no code implementations • 29 Nov 2022 • Chinmaya Kausik, Yangyi Lu, Kevin Tan, Maggie Makar, Yixin Wang, Ambuj Tewari

Evaluating and optimizing policies in the presence of unobserved confounders is a problem of growing interest in offline reinforcement learning.

Offline RL Off-policy evaluation

Paper
Add Code

Learning Mixtures of Markov Chains and MDPs

1 code implementation • 17 Nov 2022 • Chinmaya Kausik, Kevin Tan, Ambuj Tewari

We present an algorithm for learning mixtures of Markov chains and Markov decision processes (MDPs) from short unlabeled trajectories.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.