Search Results for author: Mingzhi Wang

Found 3 papers, 0 papers with code

Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles

no code implementations • 31 May 2024 • Jiesong Lian, Yucong Huang, Mingzhi Wang, Chengdong Ma, Yixue Hao, Ying Wen, Yaodong Yang

A popular approach for solving zero-sum games is to maintain populations of policies to approximate the Nash Equilibrium (NE).

Paper
Add Code

Efficient Model-agnostic Alignment via Bayesian Persuasion

no code implementations • 29 May 2024 • Fengshuo Bai, Mingzhi Wang, Zhaowei Zhang, Boyuan Chen, Yinda Xu, Ying Wen, Yaodong Yang

This paper explores an efficient method for aligning black-box large models using smaller models, introducing a model-agnostic and lightweight Bayesian Persuasion Alignment framework.

Code Generation Mathematical Reasoning

Paper
Add Code

Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects

no code implementations • 20 Feb 2024 • Zhaowei Zhang, Fengshuo Bai, Mingzhi Wang, Haoyang Ye, Chengdong Ma, Yaodong Yang

The burgeoning integration of artificial intelligence (AI) into human society brings forth significant implications for societal governance and safety.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.