no code implementations • 9 Aug 2021 • David Tolpin, Tomer Dobkin
Reinforcement learning via inference with stochastic preferences naturally describes agent behaviors, does not require introducing rewards and exponential weighing of trajectories, and allows to reason about agents using the solid foundation of Bayesian statistics.
1 code implementation • 8 Jan 2020 • David Tolpin, Tomer Dobkin
We introduce the notion of a stochastic probabilistic program and present a reference implementation of a probabilistic programming facility supporting specification of stochastic probabilistic programs and inference in them.