Search Results for author: Chengdong Ma

Found 4 papers, 1 papers with code

Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects

no code implementations • 20 Feb 2024 • Zhaowei Zhang, Fengshuo Bai, Mingzhi Wang, Haoyang Ye, Chengdong Ma, Yaodong Yang

The burgeoning integration of artificial intelligence (AI) into human society brings forth significant implications for societal governance and safety.

Paper
Add Code

Panacea: Pareto Alignment via Preference Adaptation for LLMs

no code implementations • 3 Feb 2024 • Yifan Zhong, Chengdong Ma, Xiaoyuan Zhang, Ziran Yang, Qingfu Zhang, Siyuan Qi, Yaodong Yang

Our work marks a step forward in effectively and efficiently aligning models to diverse and intricate human preferences in a controllable and Pareto-optimal manner.

Language Modelling Large Language Model

Paper
Add Code

Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models

no code implementations • 30 Sep 2023 • Chengdong Ma, Ziran Yang, Minquan Gao, Hai Ci, Jun Gao, Xuehai Pan, Yaodong Yang

In this paper, we present Red-teaming Game (RTG), a general game-theoretic framework without manual annotation.

Language Modelling Vulnerability Detection

Paper
Add Code

Scalable Model-based Policy Optimization for Decentralized Networked Systems

2 code implementations • 13 Jul 2022 • Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang

Reinforcement learning algorithms require a large amount of samples; this often limits their real-world applications on even simple tasks.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.