no code implementations • 22 Aug 2023 • Linjian Meng, Zhenxing Ge, Wenbin Li, Bo An, Yang Gao
Recent works propose a Reward Transformation (RT) framework for MWU, which removes the uniqueness condition and achieves competitive performance with OMWU.
no code implementations • 11 Mar 2022 • Linjian Meng, Yang Gao
In this paper, we propose a generalized framework for this learning setting.