Search Results for author: Amirreza Kazemi

Found 3 papers, 2 papers with code

MindStar: Enhancing Math Reasoning in Pre-trained LLMs at Inference Time

no code implementations25 May 2024 Jikun Kang, Xin Zhe Li, Xi Chen, Amirreza Kazemi, Boxing Chen

Inspired by findings that LLMs know how to produce right answer but struggle to select the correct reasoning path, we propose a purely inference-based searching method called MindStar (M*), which treats reasoning tasks as search problems.

Adversarially Balanced Representation for Continuous Treatment Effect Estimation

1 code implementation17 Dec 2023 Amirreza Kazemi, Martin Ester

Individual treatment effect (ITE) estimation requires adjusting for the covariate shift between populations with different treatments, and deep representation learning has shown great promise in learning a balanced representation of covariates.

counterfactual Representation Learning

Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees

1 code implementation NeurIPS 2023 Sharan Vaswani, Amirreza Kazemi, Reza Babanezhad, Nicolas Le Roux

Instantiating the generic algorithm results in an actor that involves maximizing a sequence of surrogate functions (similar to TRPO, PPO) and a critic that involves minimizing a closely connected objective.

Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.