Search Results for author: Anson Ho

Found 5 papers, 2 papers with code

Algorithmic progress in language models

1 code implementation • 9 Mar 2024 • Anson Ho, Tamay Besiroglu, Ege Erdil, David Owen, Robi Rahman, Zifan Carl Guo, David Atkinson, Neil Thompson, Jaime Sevilla

We investigate the rate at which algorithms for pre-training language models have improved since the advent of deep learning.

Language Modelling

Paper
Code

Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

no code implementations • 26 Oct 2022 • Pablo Villalobos, Jaime Sevilla, Lennart Heim, Tamay Besiroglu, Marius Hobbhahn, Anson Ho

We analyze the growth of dataset sizes used in machine learning for natural language processing and computer vision, and extrapolate these using two methods; using the historical growth rate and estimating the compute-optimal dataset size for future predicted compute budgets.

Paper
Add Code

Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks

no code implementations • 27 Jul 2022 • Tilman Räuker, Anson Ho, Stephen Casper, Dylan Hadfield-Menell

The last decade of machine learning has seen drastic increases in scale and capabilities.

Adversarial Robustness Benchmarking +1

Paper
Add Code

Machine Learning Model Sizes and the Parameter Gap

no code implementations • 5 Jul 2022 • Pablo Villalobos, Jaime Sevilla, Tamay Besiroglu, Lennart Heim, Anson Ho, Marius Hobbhahn

From 1950 to 2018, model size in language models increased steadily by seven orders of magnitude.

BIG-bench Machine Learning

Paper
Add Code

Compute Trends Across Three Eras of Machine Learning

1 code implementation • 11 Feb 2022 • Jaime Sevilla, Lennart Heim, Anson Ho, Tamay Besiroglu, Marius Hobbhahn, Pablo Villalobos

Since the advent of Deep Learning in the early 2010s, the scaling of training compute has accelerated, doubling approximately every 6 months.

BIG-bench Machine Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.