Search Results for author: Aliaksei Korshuk

Found 1 papers, 0 papers with code

Rewarding Chatbots for Real-World Engagement with Millions of Users

no code implementations • 10 Mar 2023 • Robert Irvine, Douglas Boubert, Vyas Raina, Adian Liusie, Ziyi Zhu, Vineet Mudupalli, Aliaksei Korshuk, Zongyi Liu, Fritz Cremer, Valentin Assassi, Christie-Carol Beauchamp, Xiaoding Lu, Thomas Rialan, William Beauchamp

The proposed approach uses automatic pseudo-labels collected from user interactions to train a reward model that can be used to reject low-scoring sample responses generated by the chatbot model at inference time.

Chatbot Language Modelling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.