no code implementations • 18 Feb 2024 • Oluwole Fagbohun, Rachel M. Harrison, Anton Dereventsov
Due to rapid advancements in the development of Large Language Models (LLMs), programming these models with prompts has recently gained significant attention.
1 code implementation • 9 Oct 2023 • Andrew Starnes, Anton Dereventsov, Clayton Webster
In this effort, we consider the impact of regularization on the diversity of actions taken by policies generated from reinforcement learning agents trained using a policy gradient.
2 code implementations • 2 Sep 2023 • Rachel M. Harrison, Anton Dereventsov, Anton Bibin
We present a method for zero-shot recommendation of multimodal non-stationary content that leverages recent advancements in the field of generative AI.
2 code implementations • 23 Jan 2023 • Andrew Starnes, Anton Dereventsov, E. Susanne Blazek, Folasade Phillips
We establish a non-deterministic model that predicts a user's food preferences from their demographic information.
1 code implementation • 21 Nov 2022 • Anton Dereventsov, Andrew Starnes, Clayton G. Webster
This effort is focused on examining the behavior of reinforcement learning systems in personalization environments and detailing the differences in policy entropy associated with the type of learning algorithm utilized.
1 code implementation • 12 Oct 2022 • Anton Dereventsov, Anton Bibin
We propose a method for generating simulated contextual bandit environments for personalization tasks from recommendation datasets like MovieLens, Netflix, Last. fm, Million Song, etc.
2 code implementations • 24 Dec 2021 • Anton Dereventsov, Ranga Raju Vatsavai, Clayton Webster
In this effort we consider a reinforcement learning (RL) technique for solving personalization tasks with complex reward signals.
1 code implementation • 7 Jun 2021 • Anton Dereventsov, Joseph D. Daws Jr., Clayton Webster
We address the challenge of policy evaluation in real-world applications of reinforcement learning systems where the available historical data is limited due to ethical, practical, or security considerations.
1 code implementation • 18 Jun 2020 • Anton Dereventsov, Clayton G. Webster, Joseph D. Daws Jr
In this work, we propose a novel adaptive stochastic gradient-free (ASGF) approach for solving high-dimensional nonconvex optimization problems based on function evaluations.
2 code implementations • 15 Jan 2020 • Anton Dereventsov, Vladimir Temlyakov
We show that the following well-known algorithms for convex optimization -- the Weak Chebyshev Greedy Algorithm (co) and the Weak Greedy Algorithm with Free Relaxation (co) -- belong to this class, and introduce a new algorithm -- the Rescaled Weak Relaxed Greedy Algorithm (co).
Numerical Analysis Numerical Analysis Optimization and Control
no code implementations • 7 Oct 2019 • Armenak Petrosyan, Anton Dereventsov, Clayton Webster
In this effort, we derive a formula for the integral representation of a shallow neural network with the ReLU activation function.
1 code implementation • 24 May 2019 • Anton Dereventsov, Armenak Petrosyan, Clayton Webster
We present a greedy-based approach to construct an efficient single hidden layer neural network with the ReLU activation that approximates a target function.