Search Results for author: Rosie Zhao

Found 7 papers, 2 papers with code

Feature emergence via margin maximization: case studies in algebraic tasks

no code implementations • 13 Nov 2023 • Depen Morwani, Benjamin L. Edelman, Costin-Andrei Oncescu, Rosie Zhao, Sham Kakade

Understanding the internal representations learned by neural networks is a cornerstone challenge in the science of machine learning.

Paper
Add Code

Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning

no code implementations • 14 Jun 2023 • Nikhil Vyas, Depen Morwani, Rosie Zhao, Gal Kaplun, Sham Kakade, Boaz Barak

The success of SGD in deep learning has been ascribed by prior works to the implicit bias induced by high learning rate or small batch size ("SGD noise").

Paper
Add Code

Policy Gradient Methods in the Presence of Symmetries and State Abstractions

2 code implementations • 9 May 2023 • Prakash Panangaden, Sahand Rezaei-Shoshtari, Rosie Zhao, David Meger, Doina Precup

Our policy gradient results allow for leveraging approximate symmetries of the environment for policy optimization.

Continuous Control Policy Gradient Methods +2

Paper
Code

Loss of Plasticity in Continual Deep Reinforcement Learning

no code implementations • 13 Mar 2023 • Zaheer Abbas, Rosie Zhao, Joseph Modayil, Adam White, Marlos C. Machado

The ability to learn continually is essential in a complex and changing world.

Atari Games Continual Learning +2

Paper
Add Code

Continuous MDP Homomorphisms and Homomorphic Policy Gradient

1 code implementation • 15 Sep 2022 • Sahand Rezaei-Shoshtari, Rosie Zhao, Prakash Panangaden, David Meger, Doina Precup

Abstraction has been widely studied as a way to improve the efficiency and generalization of reinforcement learning algorithms.

Continuous Control Policy Gradient Methods +2

Paper
Code

Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management

no code implementations • EACL (AdaptNLP) 2021 • Mikael Brunila, Rosie Zhao, Andrei Mircea, Sam Lumley, Renee Sieber

Social media such as Twitter provide valuable information to crisis managers and affected people during natural disasters.

Domain Adaptation Management +1

Paper
Add Code

A Study of Policy Gradient on a Class of Exactly Solvable Models

no code implementations • 3 Nov 2020 • Gavin McCracken, Colin Daniels, Rosie Zhao, Anna Brandenberger, Prakash Panangaden, Doina Precup

Policy gradient methods are extensively used in reinforcement learning as a way to optimize expected return.

Policy Gradient Methods

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.