no code implementations • 5 Feb 2024 • Johan Peralez, Aurélien Delage, Olivier Buffet, Jilles S. Dibangoye
A recent theory shows that a multi-player decentralized partially observable Markov decision process can be transformed into an equivalent single-player game, enabling the application of \citeauthor{bellman}'s principle of optimality to solve the single-player game by breaking it down into single-stage subgames.