Search Results for author: Kevin A. Wang

Found 3 papers, 0 papers with code

Lessons from the Trenches on Reproducible Evaluation of Language Models

no code implementations • 23 May 2024 • Stella Biderman, Hailey Schoelkopf, Lintang Sutawika, Leo Gao, Jonathan Tow, Baber Abbasi, Alham Fikri Aji, Pawan Sasanka Ammanamanchi, Sidney Black, Jordan Clive, Anthony DiPofi, Julen Etxaniz, Benjamin Fattori, Jessica Zosa Forde, Charles Foster, Mimansa Jaiswal, Wilson Y. Lee, Haonan Li, Charles Lovering, Niklas Muennighoff, Ellie Pavlick, Jason Phang, Aviya Skowron, Samson Tan, Xiangru Tang, Kevin A. Wang, Genta Indra Winata, François Yvon, Andy Zou

Third, we present the Language Model Evaluation Harness (lm-eval): an open source library for independent, reproducible, and extensible evaluation of language models that seeks to address these issues.

Language Modelling

Paper
Add Code

The Update-Equivalence Framework for Decision-Time Planning

no code implementations • 25 Apr 2023 • Samuel Sokota, Gabriele Farina, David J. Wu, Hengyuan Hu, Kevin A. Wang, J. Zico Kolter, Noam Brown

Using this framework, we derive a provably sound search algorithm for fully cooperative games based on mirror descent and a search algorithm for adversarial games based on magnetic mirror descent.

Paper
Add Code

Bayesian Opponent Modeling in Multiplayer Imperfect-Information Games

no code implementations • 12 Dec 2022 • Sam Ganzfried, Kevin A. Wang, Max Chiswick

In many real-world settings agents engage in strategic interactions with multiple opposing agents who can employ a wide variety of strategies.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.