no code implementations • 1 Mar 2024 • Yifan Lin, Yuhao Wang, Enlu Zhou
The efficient utilization of historical trajectories obtained from previous policies is essential for expediting policy optimization.
no code implementations • 26 Jan 2023 • Yifan Lin, Enlu Zhou
We consider infinite-horizon Markov Decision Processes where parameters, such as transition probabilities, are unknown and estimated from data.
no code implementations • 24 Jun 2022 • Yifan Lin, Yuhao Wang, Enlu Zhou
In particular, we consider mean-variance as the risk criterion, and the best arm is the one with the largest mean-variance reward.
no code implementations • 14 Jun 2022 • Jun Li, Yifan Lin, Nan Xie
This technology could shift the measured signal frequency band from near 50 Hz moved to several kilometer Hz, so as to make the output signal avoid the interference from low-frequency temperature drift, stress birefringence and vibration, leading to higher stability and reliability.
no code implementations • 4 Jun 2021 • Yifan Lin, Yuxuan Ren, Enlu Zhou
We consider finite-horizon Markov Decision Processes where parameters, such as transition probabilities, are unknown and estimated from data.