Search Results for author: Anton Dereventsov

Found 12 papers, 10 papers with code

An Empirical Categorization of Prompting Techniques for Large Language Models: A Practitioner's Guide

no code implementations18 Feb 2024 Oluwole Fagbohun, Rachel M. Harrison, Anton Dereventsov

Due to rapid advancements in the development of Large Language Models (LLMs), programming these models with prompts has recently gained significant attention.

Prompt Engineering

Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks

1 code implementation9 Oct 2023 Andrew Starnes, Anton Dereventsov, Clayton Webster

In this effort, we consider the impact of regularization on the diversity of actions taken by policies generated from reinforcement learning agents trained using a policy gradient.

Zero-Shot Recommendations with Pre-Trained Large Language Models for Multimodal Nudging

2 code implementations2 Sep 2023 Rachel M. Harrison, Anton Dereventsov, Anton Bibin

We present a method for zero-shot recommendation of multimodal non-stationary content that leverages recent advancements in the field of generative AI.

Modeling Non-deterministic Human Behaviors in Discrete Food Choices

2 code implementations23 Jan 2023 Andrew Starnes, Anton Dereventsov, E. Susanne Blazek, Folasade Phillips

We establish a non-deterministic model that predicts a user's food preferences from their demographic information.

Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks

1 code implementation21 Nov 2022 Anton Dereventsov, Andrew Starnes, Clayton G. Webster

This effort is focused on examining the behavior of reinforcement learning systems in personalization environments and detailing the differences in policy entropy associated with the type of learning algorithm utilized.

Q-Learning reinforcement-learning +1

Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets

1 code implementation12 Oct 2022 Anton Dereventsov, Anton Bibin

We propose a method for generating simulated contextual bandit environments for personalization tasks from recommendation datasets like MovieLens, Netflix, Last. fm, Million Song, etc.

Benchmarking Multi-Armed Bandits

On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

2 code implementations24 Dec 2021 Anton Dereventsov, Ranga Raju Vatsavai, Clayton Webster

In this effort we consider a reinforcement learning (RL) technique for solving personalization tasks with complex reward signals.

Clustering reinforcement-learning +1

Offline Policy Comparison under Limited Historical Agent-Environment Interactions

1 code implementation7 Jun 2021 Anton Dereventsov, Joseph D. Daws Jr., Clayton Webster

We address the challenge of policy evaluation in real-world applications of reinforcement learning systems where the available historical data is limited due to ethical, practical, or security considerations.

An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization

1 code implementation18 Jun 2020 Anton Dereventsov, Clayton G. Webster, Joseph D. Daws Jr

In this work, we propose a novel adaptive stochastic gradient-free (ASGF) approach for solving high-dimensional nonconvex optimization problems based on function evaluations.

Vocal Bursts Intensity Prediction

Biorthogonal Greedy Algorithms in Convex Optimization

2 code implementations15 Jan 2020 Anton Dereventsov, Vladimir Temlyakov

We show that the following well-known algorithms for convex optimization -- the Weak Chebyshev Greedy Algorithm (co) and the Weak Greedy Algorithm with Free Relaxation (co) -- belong to this class, and introduce a new algorithm -- the Rescaled Weak Relaxed Greedy Algorithm (co).

Numerical Analysis Numerical Analysis Optimization and Control

Neural network integral representations with the ReLU activation function

no code implementations7 Oct 2019 Armenak Petrosyan, Anton Dereventsov, Clayton Webster

In this effort, we derive a formula for the integral representation of a shallow neural network with the ReLU activation function.

Greedy Shallow Networks: An Approach for Constructing and Training Neural Networks

1 code implementation24 May 2019 Anton Dereventsov, Armenak Petrosyan, Clayton Webster

We present a greedy-based approach to construct an efficient single hidden layer neural network with the ReLU activation that approximates a target function.

Cannot find the paper you are looking for? You can Submit a new open access paper.