no code implementations • 7 Feb 2023 • Rundong Wang, Longtao Zheng, Wei Qiu, Bowei He, Bo An, Zinovi Rabinovich, Yujing Hu, Yingfeng Chen, Tangjie Lv, Changjie Fan
Despite its success, ACL's applicability is limited by (1) the lack of a general student framework for dealing with the varying number of agents across tasks and the sparse reward problem, and (2) the non-stationarity of the teacher's task due to ever-changing student strategies.
Multi-agent Reinforcement Learning reinforcement-learning +1
1 code implementation • 2 Jan 2023 • Ian Covert, Wei Qiu, Mingyu Lu, Nayoon Kim, Nathan White, Su-In Lee
Feature selection helps reduce data acquisition costs in ML, but the standard approach is to train models with static feature subsets.
1 code implementation • 7 Nov 2022 • Yue Guo, Wei Qiu, Gondy Leroy, Sheng Wang, Trevor Cohen
Recent lay language generation systems have used Transformer models trained on a parallel corpus to increase health information accessibility.
no code implementations • 18 Oct 2022 • Wei Qiu, Xiao Ma, Bo An, Svetlana Obraztsova, Shuicheng Yan, Zhongwen Xu
Despite the recent advancement in multi-agent reinforcement learning (MARL), the MARL agents easily overfit the training environment and perform poorly in the evaluation scenarios where other agents behave differently.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 27 May 2022 • Wei Qiu, Weixun Wang, Rundong Wang, Bo An, Yujing Hu, Svetlana Obraztsova, Zinovi Rabinovich, Jianye Hao, Yingfeng Chen, Changjie Fan
During execution durations, the environment changes are influenced by, but not synchronised with, action execution.
Multi-agent Reinforcement Learning reinforcement-learning +3
no code implementations • NeurIPS 2021 • Wei Qiu, Xinrun Wang, Runsheng Yu, Rundong Wang, Xu He, Bo An, Svetlana Obraztsova, Zinovi Rabinovich
Current value-based multi-agent reinforcement learning methods optimize individual Q values to guide individuals' behaviours via centralized training with decentralized execution (CTDE).
Multi-agent Reinforcement Learning reinforcement-learning +3
no code implementations • 9 Aug 2021 • Wanqi Xue, Wei Qiu, Bo An, Zinovi Rabinovich, Svetlana Obraztsova, Chai Kiat Yeo
Empirical results demonstrate that many state-of-the-art MACRL methods are vulnerable to message attacks, and our method can significantly improve their robustness.
Multi-agent Reinforcement Learning reinforcement-learning +1
1 code implementation • 13 Jun 2021 • Haipeng Chen, Wei Qiu, Han-Ching Ou, Bo An, Milind Tambe
Empirical results show that our method achieves influence as high as the state-of-the-art methods for contingency-aware IM, while having negligible runtime at test phase.
no code implementations • 16 Feb 2021 • Wei Qiu, Xinrun Wang, Runsheng Yu, Xu He, Rundong Wang, Bo An, Svetlana Obraztsova, Zinovi Rabinovich
Current value-based multi-agent reinforcement learning methods optimize individual Q values to guide individuals' behaviours via centralized training with decentralized execution (CTDE).
Multi-agent Reinforcement Learning reinforcement-learning +3
no code implementations • 1 Jan 2021 • Wei Qiu, Xinrun Wang, Runsheng Yu, Xu He, Rundong Wang, Bo An, Svetlana Obraztsova, Zinovi Rabinovich
Centralized training with decentralized execution (CTDE) has become an important paradigm in multi-agent reinforcement learning (MARL).
Multi-agent Reinforcement Learning reinforcement-learning +3
1 code implementation • 23 Dec 2020 • Yue Guo, Wei Qiu, Yizhong Wang, Trevor Cohen
Health literacy has emerged as a crucial factor in making appropriate health decisions and ensuring treatment outcomes.
no code implementations • 23 Dec 2020 • Wei Qiu, Yangsibo Huang, Quanzheng Li
Missing value imputation is a challenging and well-researched topic in data mining.
no code implementations • 21 Oct 2020 • Shutang You, Hongyu Li, Shengyuan Liu, Kaiqi Sun, Weikang Wang, Wei Qiu, Yilu Liu
The power system frequency is important for the system overall stability.
no code implementations • ICML 2020 • Rundong Wang, Xu He, Runsheng Yu, Wei Qiu, Bo An, Zinovi Rabinovich
Under the limited bandwidth constraint, a communication protocol is required to generate informative messages.
no code implementations • 7 Oct 2019 • Wei Qiu, Jiaming Guo, Xiang Li, Mengjia Xu, Mo Zhang, Ning Guo, Quanzheng Li
As the six networks are trained with image patches consisting of both individual cells and touching/overlapping cells, they can effectively recognize cell types that are presented in multi-instance image samples.
no code implementations • 1 Oct 2019 • Jiaming Guo, Wei Qiu, Xiang Li, Xuandong Zhao, Ning Guo, Quanzheng Li
Imaging-based early diagnosis of Alzheimer Disease (AD) has become an effective approach, especially by using nuclear medicine imaging techniques such as Positron Emission Topography (PET).
no code implementations • SEMEVAL 2018 • Wei Qiu, Mosha Chen, Linlin Li, Luo Si
Hypernym discovery aims to discover the hypernym word sets given a hyponym word and proper corpus.
Ranked #3 on Hypernym Discovery on General
no code implementations • 24 Mar 2014 • Anna Senina, Marcus Rohrbach, Wei Qiu, Annemarie Friedrich, Sikandar Amin, Mykhaylo Andriluka, Manfred Pinkal, Bernt Schiele
Humans can easily describe what they see in a coherent way and at varying level of detail.