Search Results for author: Perry Dong

Found 3 papers, 0 papers with code

Adaptively Learning to Select-Rank in Online Platforms

no code implementations7 Jun 2024 Jingyuan Wang, Perry Dong, Ying Jin, Ruohan Zhan, Zhengyuan Zhou

We develop a user response model that considers diverse user preferences and the varying effects of item positions, aiming to optimize overall user satisfaction with the ranked list.

RLIF: Interactive Imitation Learning as Reinforcement Learning

no code implementations21 Nov 2023 Jianlan Luo, Perry Dong, Yuexiang Zhai, Yi Ma, Sergey Levine

We also provide a unified framework to analyze our RL method and DAgger; for which we present the asymptotic analysis of the suboptimal gap for both methods as well as the non-asymptotic sample complexity bound of our method.

Continuous Control Imitation Learning +1

Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning

no code implementations18 Oct 2023 Jianlan Luo, Perry Dong, Jeffrey Wu, Aviral Kumar, Xinyang Geng, Sergey Levine

We use a VQ-VAE to learn state-conditioned action quantization, avoiding the exponential blowup that comes with na\"ive discretization of the action space.

Offline RL Quantization +2

Cannot find the paper you are looking for? You can Submit a new open access paper.