Search Results for author: Perry Dong

Found 3 papers, 0 papers with code

Adaptively Learning to Select-Rank in Online Platforms

no code implementations • 7 Jun 2024 • Jingyuan Wang, Perry Dong, Ying Jin, Ruohan Zhan, Zhengyuan Zhou

We develop a user response model that considers diverse user preferences and the varying effects of item positions, aiming to optimize overall user satisfaction with the ranked list.

Paper
Add Code

RLIF: Interactive Imitation Learning as Reinforcement Learning

no code implementations • 21 Nov 2023 • Jianlan Luo, Perry Dong, Yuexiang Zhai, Yi Ma, Sergey Levine

We also provide a unified framework to analyze our RL method and DAgger; for which we present the asymptotic analysis of the suboptimal gap for both methods as well as the non-asymptotic sample complexity bound of our method.

Continuous Control Imitation Learning +1

Paper
Add Code

Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning

no code implementations • 18 Oct 2023 • Jianlan Luo, Perry Dong, Jeffrey Wu, Aviral Kumar, Xinyang Geng, Sergey Levine

We use a VQ-VAE to learn state-conditioned action quantization, avoiding the exponential blowup that comes with na\"ive discretization of the action space.

Offline RL Quantization +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.