no code implementations • NeurIPS 2020 • Qi Zhou, Yufei Kuang, Zherui Qiu, Houqiang Li, Jie Wang
However, in continuous action spaces, integrating entropy regularization with expressive policies is challenging and usually requires complex inference procedures.