1 code implementation • 13 Oct 2023 • Yash Shukla, Bharat Kesari, Shivam Goel, Robert Wright, Jivko Sinapov
We use Generative Adversarial Networks (GANs) along with a cycle-consistency loss to map the observations between the source and target domains and later use this learned mapping to clone the successful source task behavior policy to the target domain.