Search Results for author: Paul Masset

Found 1 papers, 1 papers with code

Loss Dynamics of Temporal Difference Reinforcement Learning

1 code implementation NeurIPS 2023 Blake Bordelon, Paul Masset, Henry Kuo, Cengiz Pehlevan

We study how learning dynamics and plateaus depend on feature structure, learning rate, discount factor, and reward function.

reinforcement-learning

Cannot find the paper you are looking for? You can Submit a new open access paper.