Search Results for author: Alex Havrilla

Found 6 papers, 1 papers with code

Teaching Large Language Models to Reason with Reinforcement Learning

no code implementations7 Mar 2024 Alex Havrilla, Yuqing Du, Sharath Chandra Raparthy, Christoforos Nalmpantis, Jane Dwivedi-Yu, Maksym Zhuravinskyi, Eric Hambro, Sainbayar Sukhbaatar, Roberta Raileanu

Surprisingly, we find the sample complexity of Expert Iteration is similar to that of PPO, requiring at most on the order of $10^6$ samples to converge from a pretrained checkpoint.

reinforcement-learning

Understanding the Effect of Noise in LLM Training Data with Algorithmic Chains of Thought

no code implementations6 Feb 2024 Alex Havrilla, Maia Iyer

We then evaluate the test performance of pretrained models both prompted and fine-tuned on noised datasets with varying levels of dataset contamination and intensity.

Deep Nonparametric Estimation of Intrinsic Data Structures by Chart Autoencoders: Generalization Error and Robustness

no code implementations17 Mar 2023 Hao liu, Alex Havrilla, Rongjie Lai, Wenjing Liao

Our paper establishes statistical guarantees on the generalization error of chart autoencoders, and we demonstrate their denoising capabilities by considering $n$ noisy training samples, along with their noise-free counterparts, on a $d$-dimensional manifold.

Denoising

On Deep Generative Models for Approximation and Estimation of Distributions on Manifolds

no code implementations25 Feb 2023 Biraj Dahal, Alex Havrilla, Minshuo Chen, Tuo Zhao, Wenjing Liao

Many existing experiments have demonstrated that generative networks can generate high-dimensional complex data from a low-dimensional easy-to-sample distribution.

Khinchin-type inequalities via Hadamard's factorisation

no code implementations18 Feb 2021 Alex Havrilla, Piotr Nayar, Tomasz Tkocz

We prove Khinchin-type inequalities with sharp constants for type L random variables and all even moments.

Probability

Cannot find the paper you are looking for? You can Submit a new open access paper.