no code implementations • 29 Aug 2022 • Shanelle G. Clarke, Omanshu Thapliyal, Inseok Hwang
In this paper, we present a provably convergent Model-Free ${Q}$-Learning algorithm that learns a stabilizing control policy for an unknown Bilinear System from a single online run.