no code implementations • 27 Sep 2022 • Chen Zhao, Kai Xing Huang, Chun Yuan
Previous conservative estimation methods are usually difficult to avoid the impact of OOD actions on Q-value estimates.
Computational Efficiency D4RL +2