Search Results for author: Milad Kazemi

Found 7 papers, 0 papers with code

Conformal Off-Policy Prediction for Multi-Agent Systems

no code implementations25 Mar 2024 Tom Kuipers, Renukanandan Tumu, Shuo Yang, Milad Kazemi, Rahul Mangharam, Nicola Paoletti

In this work, we introduce MA-COPP, the first conformal prediction method to solve OPP problems involving multi-agent systems, deriving joint prediction regions for all agents' trajectories when one or more "ego" agents change their policies.

Conformal Prediction

Counterfactual Influence in Markov Decision Processes

no code implementations13 Feb 2024 Milad Kazemi, Jessica Lally, Ekaterina Tishchenko, Hana Chockler, Nicola Paoletti

Our work addresses a fundamental problem in the context of counterfactual inference for Markov Decision Processes (MDPs).

counterfactual Counterfactual Inference

Assume-Guarantee Reinforcement Learning

no code implementations15 Dec 2023 Milad Kazemi, Mateo Perez, Fabio Somenzi, Sadegh Soudjani, Ashutosh Trivedi, Alvaro Velasquez

We present a modular approach to \emph{reinforcement learning} (RL) in environments consisting of simpler components evolving in parallel.

reinforcement-learning Reinforcement Learning (RL)

Causal Temporal Reasoning for Markov Decision Processes

no code implementations16 Dec 2022 Milad Kazemi, Nicola Paoletti

We introduce $\textit{PCFTL (Probabilistic CounterFactual Temporal Logic)}$, a new probabilistic temporal logic for the verification of Markov Decision Processes (MDP).

counterfactual Counterfactual Reasoning +1

Compositional Reinforcement Learning for Discrete-Time Stochastic Control Systems

no code implementations6 Aug 2022 Abolfazl Lavaei, Mateo Perez, Milad Kazemi, Fabio Somenzi, Sadegh Soudjani, Ashutosh Trivedi, Majid Zamani

A key contribution is to leverage the convergence results for adversarial RL (minimax Q-learning) on finite stochastic arenas to provide control strategies maximizing the probability of satisfaction over the network of continuous-space systems.

Q-Learning reinforcement-learning +1

Data-Driven Abstraction-Based Control Synthesis

no code implementations16 Jun 2022 Milad Kazemi, Rupak Majumdar, Mahmoud Salamati, Sadegh Soudjani, Ben Wooding

The growth bound together with the sampled trajectories are then used to construct the abstraction and synthesise a controller.

Formal Policy Synthesis for Continuous-Space Systems via Reinforcement Learning

no code implementations4 May 2020 Milad Kazemi, Sadegh Soudjani

We use this procedure to guide the RL algorithm towards a policy that converges to an optimal policy under suitable assumptions on the process.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.