no code implementations • TRANSACTION 2020 • Yazhou Hu, Wenxue Wang, Hao liu, and Lianqing Liu, Member, IEEE
In this algorithm, a reward function is defined according to the features of tracking control in order to speed up the learning process, and then an RL tracking controller with a kernel-based transition dynamic model is proposed.