no code implementations • 1 Mar 2024 • Shengjie Wang, Shaohuai Liu, Weirui Ye, Jiacheng You, Yang Gao
We have expanded the performance of EfficientZero to multiple domains, encompassing both continuous and discrete actions, as well as visual and low-dimensional inputs.
no code implementations • 4 Oct 2023 • Weirui Ye, Yunsheng Zhang, Mengchen Wang, Shengjie Wang, Xianfan Gu, Pieter Abbeel, Yang Gao
Our method tolerates the unavoidable noise in embodied foundation models.
no code implementations • 27 Mar 2023 • Xianfan Gu, Chuan Wen, Weirui Ye, Jiaming Song, Yang Gao
Imagining the future trajectory is the key for robots to make sound planning and successfully reach their goals.
no code implementations • 9 Mar 2023 • Shaohuai Liu, Jinbo Liu, Weirui Ye, Nan Yang, Guanglun Zhang, Haiwang Zhong, Chongqing Kang, Qirong Jiang, Xuri Song, Fangchun Di, Yang Gao
The well-trained scheduling agent significantly reduces renewable curtailment and load shedding, which are issues arising from traditional scheduling's reliance on inaccurate day-ahead forecasts.
no code implementations • 23 Oct 2022 • Weirui Ye, Pieter Abbeel, Yang Gao
This paper proposes the Virtual MCTS (V-MCTS), a variant of MCTS that spends more search time on harder states and less search time on simpler states adaptively.
1 code implementation • 18 Oct 2022 • Zhao-Heng Yin, Weirui Ye, Qifeng Chen, Yang Gao
Inspired by the recent success of EfficientZero in RL, we propose EfficientImitate (EI), a planning-based imitation learning method that can achieve high in-environment sample efficiency and performance simultaneously.
3 code implementations • NeurIPS 2021 • Weirui Ye, Shaohuai Liu, Thanard Kurutach, Pieter Abbeel, Yang Gao
Recently, there has been significant progress in sample efficient image-based RL algorithms; however, consistent human-level performance on the Atari game benchmark remains an elusive goal.
Ranked #2 on Atari Games 100k on Atari 100k