Search Results for author: Peter Chatain

Found 3 papers, 1 papers with code

Markovian Agents for Informative Language Modeling

no code implementations • 29 Apr 2024 • Scott Viteri, Max Lamparth, Peter Chatain, Clark Barrett

We derive a "Markovian training" procedure by applying our definition of informativeness to a Markovian LM and optimizing via policy gradient and Proximal Policy Optimization (PPO).

Informativeness Language Modelling

Paper
Add Code

SuperHF: Supervised Iterative Learning from Human Feedback

1 code implementation • 25 Oct 2023 • Gabriel Mukobi, Peter Chatain, Su Fong, Robert Windesheim, Gitta Kutyniok, Kush Bhatia, Silas Alberti

Here, we focus on two prevalent methods used to align these models, Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF).

Language Modelling

Paper
Code

Do Neural Networks Generalize from Self-Averaging Sub-classifiers in the Same Way As Adaptive Boosting?

no code implementations • 14 Feb 2023 • Michael Sun, Peter Chatain

In recent years, neural networks (NNs) have made giant leaps in a wide variety of domains.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.