Search Results for author: Achraf Azize

Found 5 papers, 1 papers with code

How Much Does Each Datapoint Leak Your Privacy? Quantifying the Per-datum Membership Leakage

no code implementations15 Feb 2024 Achraf Azize, Debabrota Basu

We study the per-datum Membership Inference Attacks (MIAs), where an attacker aims to infer whether a fixed target datum has been included in the input dataset of an algorithm and thus, violates privacy.

Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation

no code implementations24 Dec 2023 Paul Daoudi, Mathias Formoso, Othman Gaizi, Achraf Azize, Evrard Garcelon

A precondition for the deployment of a Reinforcement Learning agent to a real-world system is to provide guarantees on the learning process.

Concentrated Differential Privacy for Bandits

no code implementations1 Sep 2023 Achraf Azize, Debabrota Basu

Next, we complement our regret upper bounds with the first minimax lower bounds on the regret of bandits with zCDP.

Multi-Armed Bandits Recommendation Systems

When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits

no code implementations6 Sep 2022 Achraf Azize, Debabrota Basu

First, we prove the minimax and problem-dependent regret lower bounds for stochastic and linear bandits that quantify the hardness of bandits with $\epsilon$-global DP.

Multi-Armed Bandits

Conservative Optimistic Policy Optimization via Multiple Importance Sampling

1 code implementation4 Mar 2021 Achraf Azize, Othman Gaizi

Reinforcement Learning (RL) has been able to solve hard problems such as playing Atari games or solving the game of Go, with a unified approach.

Atari Games Game of Go +2

Cannot find the paper you are looking for? You can Submit a new open access paper.