Search Results for author: Anton Dereventsov

Found 12 papers, 10 papers with code

An Empirical Categorization of Prompting Techniques for Large Language Models: A Practitioner's Guide

no code implementations • 18 Feb 2024 • Oluwole Fagbohun, Rachel M. Harrison, Anton Dereventsov

Due to rapid advancements in the development of Large Language Models (LLMs), programming these models with prompts has recently gained significant attention.

Prompt Engineering

Paper
Add Code

Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks

1 code implementation • 9 Oct 2023 • Andrew Starnes, Anton Dereventsov, Clayton Webster

In this effort, we consider the impact of regularization on the diversity of actions taken by policies generated from reinforcement learning agents trained using a policy gradient.

Paper
Code

Zero-Shot Recommendations with Pre-Trained Large Language Models for Multimodal Nudging

2 code implementations • 2 Sep 2023 • Rachel M. Harrison, Anton Dereventsov, Anton Bibin

We present a method for zero-shot recommendation of multimodal non-stationary content that leverages recent advancements in the field of generative AI.

Paper
Code

Modeling Non-deterministic Human Behaviors in Discrete Food Choices

2 code implementations • 23 Jan 2023 • Andrew Starnes, Anton Dereventsov, E. Susanne Blazek, Folasade Phillips

We establish a non-deterministic model that predicts a user's food preferences from their demographic information.

Paper
Code

Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks

1 code implementation • 21 Nov 2022 • Anton Dereventsov, Andrew Starnes, Clayton G. Webster

This effort is focused on examining the behavior of reinforcement learning systems in personalization environments and detailing the differences in policy entropy associated with the type of learning algorithm utilized.

Q-Learning reinforcement-learning +1

Paper
Code

Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets

1 code implementation • 12 Oct 2022 • Anton Dereventsov, Anton Bibin

We propose a method for generating simulated contextual bandit environments for personalization tasks from recommendation datasets like MovieLens, Netflix, Last. fm, Million Song, etc.

Benchmarking Multi-Armed Bandits

Paper
Code

On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

2 code implementations • 24 Dec 2021 • Anton Dereventsov, Ranga Raju Vatsavai, Clayton Webster

In this effort we consider a reinforcement learning (RL) technique for solving personalization tasks with complex reward signals.

Clustering reinforcement-learning +1

Paper
Code

Offline Policy Comparison under Limited Historical Agent-Environment Interactions

1 code implementation • 7 Jun 2021 • Anton Dereventsov, Joseph D. Daws Jr., Clayton Webster

We address the challenge of policy evaluation in real-world applications of reinforcement learning systems where the available historical data is limited due to ethical, practical, or security considerations.

Paper
Code

An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization

1 code implementation • 18 Jun 2020 • Anton Dereventsov, Clayton G. Webster, Joseph D. Daws Jr

In this work, we propose a novel adaptive stochastic gradient-free (ASGF) approach for solving high-dimensional nonconvex optimization problems based on function evaluations.

Vocal Bursts Intensity Prediction

Paper
Code

Biorthogonal Greedy Algorithms in Convex Optimization

2 code implementations • 15 Jan 2020 • Anton Dereventsov, Vladimir Temlyakov

We show that the following well-known algorithms for convex optimization -- the Weak Chebyshev Greedy Algorithm (co) and the Weak Greedy Algorithm with Free Relaxation (co) -- belong to this class, and introduce a new algorithm -- the Rescaled Weak Relaxed Greedy Algorithm (co).

Numerical Analysis Numerical Analysis Optimization and Control

Paper
Code

Neural network integral representations with the ReLU activation function

no code implementations • 7 Oct 2019 • Armenak Petrosyan, Anton Dereventsov, Clayton Webster

In this effort, we derive a formula for the integral representation of a shallow neural network with the ReLU activation function.

Paper
Add Code

Greedy Shallow Networks: An Approach for Constructing and Training Neural Networks

1 code implementation • 24 May 2019 • Anton Dereventsov, Armenak Petrosyan, Clayton Webster

We present a greedy-based approach to construct an efficient single hidden layer neural network with the ReLU activation that approximates a target function.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.