no code implementations • 3 Nov 2023 • Jonathan Colaço Carr, Prakash Panangaden, Doina Precup
Current results guaranteeing the existence of optimal policies in LfPF problems assume that both the preferences and transition dynamics are determined by a Markov Decision Process.