Search Results for author: Xudong Yu

Found 6 papers, 1 papers with code

Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning

no code implementations • 12 May 2024 • Changhong Wang, Xudong Yu, Chenjia Bai, Qiaosheng Zhang, Zhen Wang

To address this problem, our work builds upon the investigation of successor representations for task generalization in online RL and extends the framework to incorporate offline-to-online learning.

Offline RL Reinforcement Learning (RL) +1

Paper
Add Code

Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning

1 code implementation • 10 May 2024 • Xiaoyu Wen, Chenjia Bai, Kang Xu, Xudong Yu, Yang Zhang, Xuelong Li, Zhen Wang

In this paper, we propose a novel representation-based approach to measure the domain gap, where the representation is learned through a contrastive objective by sampling transitions from different domains.

reinforcement-learning

Paper
Code

Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning

no code implementations • 9 Apr 2024 • Xudong Yu, Chenjia Bai, Hongyi Guo, Changhong Wang, Zhen Wang

Offline Reinforcement Learning (RL) faces distributional shift and unreliable value estimation, especially for out-of-distribution (OOD) actions.

Reinforcement Learning (RL) Uncertainty Quantification

Paper
Add Code

Regularized Conditional Diffusion Model for Multi-Task Preference Alignment

no code implementations • 7 Apr 2024 • Xudong Yu, Chenjia Bai, Haoran He, Changhong Wang, Xuelong Li

Sequential decision-making is desired to align with human intents and exhibit versatility across various tasks.

D4RL Decision Making

Paper
Add Code

Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness

no code implementations • 29 Sep 2023 • Xiaoyu Wen, Xudong Yu, Rui Yang, Chenjia Bai, Zhen Wang

Experimental results illustrate the superiority of RO2O in facilitating stable offline-to-online learning and achieving significant improvement with limited online interactions.

Offline RL reinforcement-learning +1

Paper
Add Code

6 GHz hyperfast rotation of an optically levitated nanoparticle in vacuum

no code implementations • 17 Dec 2020 • Yuanbin Jin, Jiangwei Yan, Shah Jee Rahman, Jie Li, Xudong Yu, Jing Zhang

We measure a highest rotation frequency about 4. 3 GHz of the trapped nanoparticle without feedback cooling and a 6 GHz rotation with feedback cooling, which is the fastest mechanical rotation ever reported to date.

Optics Mesoscale and Nanoscale Physics Quantum Physics

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.