Search Results for author: Wenzhuo Zhou

Found 7 papers, 0 papers with code

Stage-Aware Learning for Dynamic Treatments

no code implementations • 30 Oct 2023 • Hanwen Ye, Wenzhuo Zhou, Ruoqing Zhu, Annie Qu

In particular, the proposed learning scheme builds a more general framework which includes the popular outcome weighted learning framework as a special case of ours.

Decision Making

Paper
Add Code

Stackelberg Batch Policy Learning

no code implementations • 28 Sep 2023 • Wenzhuo Zhou, Annie Qu

From a theoretical standpoint, we provide instance-dependent regret bounds with general function approximation, which shows that our algorithm can learn a best-effort policy that is able to compete against any comparator policy that is covered by batch data.

Decision Making Reinforcement Learning (RL)

Paper
Add Code

A Model-Agnostic Graph Neural Network for Integrating Local and Global Information

no code implementations • 23 Sep 2023 • Wenzhuo Zhou, Annie Qu, Keiland W. Cooper, Norbert Fortin, Babak Shahbaba

Graph Neural Networks (GNNs) have achieved promising performance in a variety of graph-focused tasks.

Paper
Add Code

Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework

no code implementations • 23 Sep 2023 • Wenzhuo Zhou, Yuhan Li, Ruoqing Zhu, Annie Qu

This task faces two primary challenges: providing a comprehensive and rigorous error quantification in CI estimation, and addressing the distributional shift that results from discrepancies between the distribution induced by the target policy and the offline data-generating process.

Off-policy evaluation

Paper
Add Code

Quasi-optimal Reinforcement Learning with Continuous Actions

no code implementations • 21 Jan 2023 • Yuhan Li, Wenzhuo Zhou, Ruoqing Zhu

Many real-world applications of reinforcement learning (RL) require making decisions in continuous action environments.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning

no code implementations • 20 Oct 2021 • Wenzhuo Zhou, Ruoqing Zhu, Annie Qu

To address these challenges, we propose a Proximal Temporal consistency Learning (pT-Learning) framework to estimate an optimal regime that is adaptively adjusted between deterministic and stochastic sparse policy models.

Decision Making

Paper
Add Code

Multicategory Angle-based Learning for Estimating Optimal Dynamic Treatment Regimes with Censored Data

no code implementations • 14 Jan 2020 • Fei Xue, Yanqing Zhang, Wenzhuo Zhou, Haoda Fu, Annie Qu

An optimal dynamic treatment regime (DTR) consists of a sequence of decision rules in maximizing long-term benefits, which is applicable for chronic diseases such as HIV infection or cancer.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.