1 code implementation • 20 Sep 2023 • Tianbao Xie, Siheng Zhao, Chen Henry Wu, Yitao Liu, Qian Luo, Victor Zhong, Yanchao Yang, Tao Yu
Unlike inverse RL and recent work that uses LLMs to write sparse reward codes, Text2Reward produces interpretable, free-form dense reward codes that cover a wide range of tasks, utilize existing packages, and allow iterative refinement with human feedback.
no code implementations • 22 Sep 2021 • Naoki Yokoyama, Qian Luo, Dhruv Batra, Sehoon Ha
Recent advances in deep reinforcement learning and scalable photorealistic simulation have led to increasingly mature embodied AI for various visual tasks, including navigation.
no code implementations • 28 Apr 2021 • Zhiwei Xing, Lin Zhang, Huan Xia, Qian Luo, Zhao-xin Chen
In the existing ground support research, there has not yet been a process model that directly obtains support from the ground support log to study the causal relationship between service nodes and flight delays.
1 code implementation • 7 Dec 2020 • Qian Luo, Jing Wu, Matthew Gombolay
Learning from demonstration (LfD) is a powerful learning method to enable a robot to infer how to perform a task given one or more human demonstrations of the desired task.
Robotics
no code implementations • 6 Nov 2020 • Qian Luo, Maks Sorokin, Sehoon Ha
Therefore, learning a navigation policy for a new robot with a new sensor configuration or a new target still remains a challenging problem.