Search Results for author: Mingzhi Wang

Found 3 papers, 0 papers with code

Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles

no code implementations31 May 2024 Jiesong Lian, Yucong Huang, Mingzhi Wang, Chengdong Ma, Yixue Hao, Ying Wen, Yaodong Yang

A popular approach for solving zero-sum games is to maintain populations of policies to approximate the Nash Equilibrium (NE).

Efficient Model-agnostic Alignment via Bayesian Persuasion

no code implementations29 May 2024 Fengshuo Bai, Mingzhi Wang, Zhaowei Zhang, Boyuan Chen, Yinda Xu, Ying Wen, Yaodong Yang

This paper explores an efficient method for aligning black-box large models using smaller models, introducing a model-agnostic and lightweight Bayesian Persuasion Alignment framework.

Code Generation Mathematical Reasoning

Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects

no code implementations20 Feb 2024 Zhaowei Zhang, Fengshuo Bai, Mingzhi Wang, Haoyang Ye, Chengdong Ma, Yaodong Yang

The burgeoning integration of artificial intelligence (AI) into human society brings forth significant implications for societal governance and safety.

Cannot find the paper you are looking for? You can Submit a new open access paper.