no code implementations • 11 Jun 2013 • Richard Combes, Ilham El Bouloumi, Stephane Senecal, Zwi Altman
The purpose of this paper is to develop a self-optimized association algorithm based on PGRL (Policy Gradient Reinforcement Learning), which is both scalable, stable and robust.