1 code implementation • ACL 2022 • Hao Chen, Zepeng Zhai, Fangxiang Feng, Ruifan Li, Xiaojie Wang
Specifically, we first define ten types of relations for ASTE task, and then adopt a biaffine attention module to embed these relations as an adjacent tensor between words in a sentence.
no code implementations • ECNLP (ACL) 2022 • Lvxing Zhu, Hao Chen, Chao Wei, Weiru Zhang
To solve the above problem, we propose a novel method that leverages an auxiliary module to enhance the representations of long-tail queries by taking advantage of reliable supervised information of variant frequent queries.
1 code implementation • EMNLP 2021 • Hao Chen, Rui Xia, Jianfei Yu
Data augmentation and adversarial perturbation approaches have recently achieved promising results in solving the over-fitting problem in many natural language processing (NLP) tasks including sentiment classification.
no code implementations • 24 Apr 2024 • Jiaxin Zhuang, Linshan Wu, Qiong Wang, Varut Vardhanabhuti, Lin Luo, Hao Chen
We further scale up the MiM to large pre-training datasets with more than 10k volumes, showing that large-scale pre-training can further enhance the performance of downstream tasks.
no code implementations • 23 Apr 2024 • Sunan He, Yuxiang Nie, Zhixuan Chen, Zhiyuan Cai, Hongmei Wang, Shu Yang, Hao Chen
The rapid advancement of large-scale vision-language models has showcased remarkable capabilities across various tasks.
no code implementations • 20 Apr 2024 • Jiongliang Lin, Yiwen Guo, Hao Chen
Intrusion detection is a long standing and crucial problem in security.
1 code implementation • Under review for Transaction 2024 • Mu Hu, Wei Yin, Chi Zhang, Zhipeng Cai, Xiaoxiao Long, Hao Chen, Kaixuan Wang, Gang Yu, Chunhua Shen, Shaojie Shen
Our method benefits various applications including in-the-wild metrology monocular-SLAM, and 3D reconstruction, which highlight the versatility of Metric3D v2 models as geometric foundation models.
Ranked #1 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)
no code implementations • 15 Apr 2024 • Fangwei Zhong, Kui Wu, Hai Ci, Churan Wang, Hao Chen
We evaluate our tracker on several high-fidelity environments with challenging situations, such as distraction and occlusion.
2 code implementations • 10 Apr 2024 • Hao Lu, Jiaqi Tang, Xinli Xu, Xu Cao, Yunpeng Zhang, Guoqing Wang, Dalong Du, Hao Chen, Yingcong Chen
Finally, for MC3D-Det joint training, the elaborate dataset merge strategy is designed to solve the problem of inconsistent camera numbers and camera parameters.
no code implementations • 9 Apr 2024 • Junlin Hou, Jilan Xu, Hao Chen
In the former branch, we train the CNN with a CAW layer inserted to perform skin lesion diagnosis.
no code implementations • 8 Apr 2024 • Junlin Hou, Jilan Xu, Rui Feng, Hao Chen
Previous noise learning methods mainly considered noise arising from images being mislabeled, i. e. label noise, assuming that all mislabeled images are of high image quality.
1 code implementation • 6 Apr 2024 • Yu Cai, Weiwen Zhang, Hao Chen, Kwang-Ting Cheng
Anomaly detection (AD) aims at detecting abnormal samples that deviate from the expected normal patterns.
1 code implementation • 6 Apr 2024 • Haibo Jin, Haoxuan Che, Hao Chen
Self-training is a simple yet effective method for semi-supervised learning, during which pseudo-label selection plays an important role for handling confirmation bias.
1 code implementation • 4 Apr 2024 • Yuting He, Fuxiang Huang, Xinrui Jiang, Yuxiang Nie, Minghao Wang, Jiguang Wang, Hao Chen
To answer these questions, a comprehensive and deep survey of the challenges, opportunities, and future directions of HFMs is presented in this survey.
1 code implementation • 3 Apr 2024 • Sijie Zhao, Hao Chen, Xueliang Zhang, Pengfeng Xiao, Lei Bai, Wanli Ouyang
RSM is specifically designed to capture the global context of remote sensing images with linear complexity, facilitating the effective processing of large VHR images.
Ranked #1 on Road Segmentation on Massachusetts Roads Dataset (F1 metric)
Building change detection for remote sensing images Change Detection +1
no code implementations • 3 Apr 2024 • Huajun Zhou, Fengtao Zhou, Hao Chen
In this paper, we propose a Cohort-individual Cooperative Learning (CCL) framework to advance cancer survival analysis by collaborating knowledge decomposition and cohort guidance.
1 code implementation • 1 Apr 2024 • Fengtao Zhou, Yingxue Xu, Yanfen Cui, Shenyan Zhang, Yun Zhu, Weiyang He, Jiguang Wang, Xin Wang, Ronald Chan, Louis Ho Shing Lau, Chu Han, Dafu Zhang, Zhenhui Li, Hao Chen
The limited availability of modalities for each patient would cause information loss, adversely affecting predictive accuracy.
no code implementations • 1 Apr 2024 • Hao Chen, Yuqi Hou, Chenyuan Qu, Irene Testini, Xiaohan Hong, Jianbo Jiao
While many existing datasets focus on scene understanding from a certain perspective (e. g. egocentric or third-person views), our dataset offers a panoptic perspective (i. e. multiple viewpoints with multiple data modalities).
no code implementations • 30 Mar 2024 • Tongkun Su, Jun Li, Xi Zhang, Haibo Jin, Hao Chen, Qiong Wang, Faqin Lv, Baoliang Zhao, Yin Hu
In this work, we leverage descriptions in medical reports to design multi-granular question-answer pairs associated with different diseases, which assist the framework in pre-training without requiring extra annotations from experts.
no code implementations • 25 Mar 2024 • Zhixuan Chen, Luyang Luo, Yequan Bie, Hao Chen
Medical report generation has achieved remarkable advancements yet has still been faced with several challenges.
no code implementations • 23 Mar 2024 • Hao Chen, Minyu Chen, Ruibang Liu, Guoqiang Li
ZKP systems have surged attention and held a fundamental role in contemporary cryptography.
no code implementations • 22 Mar 2024 • Mu Hu, Wei Yin, Chi Zhang, Zhipeng Cai, Xiaoxiao Long, Hao Chen, Kaixuan Wang, Gang Yu, Chunhua Shen, Shaojie Shen
For metric depth estimation, we show that the key to a zero-shot single-view model lies in resolving the metric ambiguity from various camera models and large-scale data training.
1 code implementation • 20 Mar 2024 • Linshan Wu, Zhun Zhong, Jiayi Ma, Yunchao Wei, Hao Chen, Leyuan Fang, Shutao Li
Based on the label distributions, we leverage the GMM to generate high-quality pseudo labels for more reliable supervision.
Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation
no code implementations • 19 Mar 2024 • Yi Lin, Zhengjie ZHU, Kwang-Ting Cheng, Hao Chen
To address this issue, we propose PAMT, a novel Prompt-guided Adaptive Model Transformation framework that enhances MIL classification performance by seamlessly adapting pre-trained models to the specific characteristics of histopathology data.
no code implementations • 18 Mar 2024 • Qingqiu Li, Runtian Yuan, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen
To make a more accurate diagnosis of COVID-19, we propose a straightforward yet effective model.
no code implementations • 18 Mar 2024 • Runtian Yuan, Qingqiu Li, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen
In response to the need for rapid and accurate COVID-19 diagnosis during the global pandemic, we present a two-stage framework that leverages pseudo labels for domain adaptation to enhance the detection of COVID-19 from CT scans.
no code implementations • 17 Mar 2024 • Kangyang Xie, BinBin Yang, Hao Chen, Meng Wang, Cheng Zou, Hui Xue, Ming Yang, Chunhua Shen
Beyond the superiority of the text-to-image diffusion model in generating high-quality images, recent studies have attempted to uncover its potential for adapting the learned semantic knowledge to visual perception tasks.
no code implementations • 17 Mar 2024 • Yongtao Ge, Wenjia Wang, Yongfan Chen, Hao Chen, Chunhua Shen
In this work, we show that synthetic data created by generative models is complementary to computer graphics (CG) rendered data for achieving remarkable generalization performance on diverse real-world scenes for 3D human pose and shape estimation (HPS).
1 code implementation • 17 Mar 2024 • Renlong Wu, Zhilu Zhang, Shuohao Zhang, Longfei Gou, Haobin Chen, Lei Zhang, Hao Chen, WangMeng Zuo
On the other hand, in order to enhance the desmoking performance, we further feed the valuable information from PS frame into models, where a masking strategy and a regularization term are presented to avoid trivial solutions.
1 code implementation • 15 Mar 2024 • Zhikang Wang, Yumeng Zhang, Yingxue Xu, Seiya Imoto, Hao Chen, Jiangning Song
G-HANet is expected to be explored as a useful tool by the research community to address the current bottleneck of insufficient histo-genomic data pairing in the context of cancer prognosis and precision oncology.
1 code implementation • 15 Mar 2024 • Yukun Li, Guansong Pang, Wei Suo, Chenchen Jing, Yuling Xi, Lingqiao Liu, Hao Chen, Guoqiang Liang, Peng Wang
Large pre-trained VLMs like CLIP have demonstrated superior zero-shot recognition ability, and a number of recent studies leverage this ability to mitigate catastrophic forgetting in CL, but they focus on closed-set CL in a single domain dataset.
no code implementations • 14 Mar 2024 • Yequan Bie, Luyang Luo, Zhixuan Chen, Hao Chen
Utilizing potent representations of the large vision-language models (VLMs) to accomplish various downstream tasks has attracted increasing attention.
Explainable artificial intelligence Explainable Artificial Intelligence (XAI) +1
no code implementations • 14 Mar 2024 • Yu Cai, Hao Chen, Kwang-Ting Cheng
To the best of our knowledge, this is the first effort to theoretically clarify the principles and design philosophy of AE for anomaly detection.
no code implementations • 13 Mar 2024 • Shuhan LI, Yi Lin, Hao Chen, Kwang-Ting Cheng
In this paper, we introduce an Iterative Online Image Synthesis (IOIS) framework to address the class imbalance problem in medical image classification.
1 code implementation • 11 Mar 2024 • Shu Yang, Yihui Wang, Hao Chen
Multiple Instance Learning (MIL) has emerged as a dominant paradigm to extract discriminative feature representations within Whole Slide Images (WSIs) in computational pathology.
no code implementations • 11 Mar 2024 • Hao Chen, Jindong Wang, Zihan Wang, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj
Foundation models are usually pre-trained on large-scale datasets and then adapted to downstream tasks through tuning.
no code implementations • 10 Mar 2024 • Guangkai Xu, Yongtao Ge, MingYu Liu, Chengxiang Fan, Kangyang Xie, Zhiyue Zhao, Hao Chen, Chunhua Shen
We show that, simply initializing image understanding models using a pre-trained UNet (or transformer) of diffusion models, it is possible to achieve remarkable transferable performance on fundamental vision perception tasks using a moderate amount of target data (even synthetic data only), including monocular depth, surface normal, image segmentation, matting, human pose estimation, among virtually many others.
2 code implementations • 9 Mar 2024 • Hao Lu, Xuesong Niu, Jiyao Wang, Yin Wang, Qingyong Hu, Jiaqi Tang, Yuting Zhang, Kaishen Yuan, Bin Huang, Zitong Yu, Dengbo He, Shuiguang Deng, Hao Chen, Yingcong Chen, Shiguang Shan
In conclusion, this paper provides valuable insights into the potential applications and challenges of MLLMs in human-centric computing.
1 code implementation • 8 Mar 2024 • Zhengrui Guo, Jiabo Ma, Yingxue Xu, Yihui Wang, Liansheng Wang, Hao Chen
Histopathology serves as the gold standard in cancer diagnosis, with clinical reports being vital in interpreting and understanding this process, guiding cancer treatment and patient care.
2 code implementations • 7 Mar 2024 • Xiang Li, Kai Qiu, Jinglu Wang, Xiaohao Xu, Rita Singh, Kashu Yamazak, Hao Chen, Xiaonan Huang, Bhiksha Raj
Referring perception, which aims at grounding visual objects with multimodal referring guidance, is essential for bridging the gap between humans, who provide instructions, and the environment where intelligent systems perceive.
1 code implementation • 6 Mar 2024 • Yufan Chen, Ching Ting Leung, Yong Huang, Jianwei Sun, Hao Chen, Hanyu Gao
In addition, it employs a series of novel augmentation algorithms to significantly enhance the robustness and performance of the model.
no code implementations • 4 Mar 2024 • Shulei Ni, Yisheng Qiu, YunChun Chen, Zihao Song, Hao Chen, Xuejian Jiang, Huaxi Chen
In the imaging process of an astronomical telescope, the deconvolution of its beam or Point Spread Function (PSF) is a crucial task.
no code implementations • 2 Mar 2024 • Xinyi Yu, Ling Yan, PengTao Jiang, Hao Chen, Bo Li, Lin Yuanbo Wu, Linlin Ou
This innovative approach empowers the network to simultaneously predict masks and depth, enhancing its ability to capture nuanced depth-related information during the instance segmentation process.
no code implementations • 1 Mar 2024 • Zhenpeng Huang, Chao Li, Hao Chen, Yongjian Deng, Yifeng Geng, LiMin Wang
Our pre-training overcomes the limitations of previous methods, which either sacrifice temporal information by converting event sequences into 2D images for utilizing pre-trained image models or directly employ paired image data for knowledge distillation to enhance the learning of event streams.
no code implementations • 29 Feb 2024 • Lei Xie, Qingrun Zeng, Huajun Zhou, Guoqiang Xie, Mingchu Li, Jiahao Huang, Jianan Cui, Hao Chen, Yuanjing Feng
Diffusion MRI tractography is an important tool for identifying and analyzing the intracranial course of cranial nerves (CNs).
1 code implementation • 29 Feb 2024 • Hanxi Li, Zhengxun Zhang, Hao Chen, Lin Wu, Bo Li, Deyin Liu, Mingwen Wang
Effectively addressing the challenge of industrial Anomaly Detection (AD) necessitates an ample supply of defective samples, a constraint often hindered by their scarcity in industrial contexts.
1 code implementation • 27 Feb 2024 • Linshan Wu, Jiaxin Zhuang, Hao Chen
Through this pretext task, VoCo implicitly encodes the contextual position priors into model representations without the guidance of annotations, enabling us to effectively improve the performance of downstream tasks that require high-level semantics.
no code implementations • 19 Feb 2024 • Qinggang Zhang, Junnan Dong, Hao Chen, Wentao Li, Feiran Huang, Xiao Huang
Existing models typically input queries and database schemas into the LLM and rely on the LLM to perform semantic-structure matching and generate structured SQL.
no code implementations • 18 Feb 2024 • Zijin Hong, Zheng Yuan, Hao Chen, Qinggang Zhang, Feiran Huang, Xiao Huang
Generating accurate SQL for user queries (text-to-SQL) is a long-standing problem since the generation of the SQL requires comprehending the query and database and retrieving the accurate data from the database accordingly.
no code implementations • 14 Feb 2024 • Feiran Huang, Zhenghang Yang, Junyi Jiang, Yuanchen Bei, Yijie Zhang, Hao Chen
To address this challenge, we propose an LLM Interaction Simulator (LLM-InS) to model users' behavior patterns based on the content aspect.
no code implementations • 14 Feb 2024 • Jiancheng Yang, Rui Shi, Liang Jin, Xiaoyang Huang, Kaiming Kuang, Donglai Wei, Shixuan Gu, Jianying Liu, PengFei Liu, Zhizhong Chai, Yongjie Xiao, Hao Chen, Liming Xu, Bang Du, Xiangyi Yan, Hao Tang, Adam Alessio, Gregory Holste, Jiapeng Zhang, Xiaoming Wang, Jianye He, Lixuan Che, Hanspeter Pfister, Ming Li, Bingbing Ni
The resulting FracNet+ demonstrates competitive performance in rib fracture detection, which lays a foundation for further research and development in AI-assisted rib fracture detection and diagnosis.
no code implementations • 12 Feb 2024 • Yijie Zhang, Yuanchen Bei, Hao Chen, Qijie Shen, Zheng Yuan, Huan Gong, Senzhang Wang, Feiran Huang, Xiao Huang
POG defines the partial order relation of multiple behaviors and models behavior combinations as weighted edges to merge separate behavior graphs into a joint POG.
1 code implementation • 11 Feb 2024 • Hao Chen, Gonzalo E. Constante Flores, Can Li
The incorporation of physics into neural networks can improve generalization and data efficiency.
no code implementations • 4 Feb 2024 • Yifeng He, Jiabo Huang, Yuyang Rong, Yiwen Guo, Ethan Wang, Hao Chen
The remarkable capability of large language models (LLMs) in generating high-quality code has drawn increasing attention in the software testing community.
1 code implementation • 2 Feb 2024 • Wanghan Xu, Kang Chen, Tao Han, Hao Chen, Wanli Ouyang, Lei Bai
Data-driven weather forecast based on machine learning (ML) has experienced rapid development and demonstrated superior performance in the global medium-range forecast compared to traditional physics-based dynamical models.
no code implementations • 2 Feb 2024 • Hao Chen, Bhiksha Raj, Xing Xie, Jindong Wang
Large foundation models (LFMs) are claiming incredible performances.
1 code implementation • 2 Feb 2024 • Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj
Weakly supervised learning generally faces challenges in applicability to various scenarios with diverse weak supervision and in scalability due to the complexity of existing algorithms, thereby hindering the practical deployment.
no code implementations • 31 Jan 2024 • Yicui Peng, Hao Chen, ChingSheng Lin, Guo Huang, Jinrong Hu, Hui Guo, Bin Kong, Shu Hu, Xi Wu, Xin Wang
Providing explanations within the recommendation system would boost user satisfaction and foster trust, especially by elaborating on the reasons for selecting recommended items tailored to the user.
1 code implementation • 30 Jan 2024 • Jiayuan Luo, Wentao Zhang, Yuchen Fang, Xiaowei Gao, Dingyi Zhuang, Hao Chen, Xinke Jiang
Time Series Supplier Allocation (TSSA) poses a complex NP-hard challenge, aimed at refining future order dispatching strategies to satisfy order demands with maximum supply efficiency fully.
no code implementations • 26 Jan 2024 • Shumin Yao, Xiaodong Xu, Hao Chen, Yaping Sun, Qinglin Zhao
Cross-technology communication (CTC) is a promising technique that enables direct communications among incompatible wireless technologies without needing hardware modification.
1 code implementation • 26 Jan 2024 • Hao Chen, Yuanchen Bei, Qijie Shen, Yue Xu, Sheng Zhou, Wenbing Huang, Feiran Huang, Senzhang Wang, Xiao Huang
Predicting Click-Through Rate (CTR) in billion-scale recommender systems poses a long-standing challenge for Graph Neural Networks (GNNs) due to the overwhelming computational complexity involved in aggregating billions of neighbors.
no code implementations • 22 Jan 2024 • Zili Liu, Hao Chen, Lei Bai, Wenyuan Li, Keyan Chen, Zhengyi Wang, Wanli Ouyang, Zhengxia Zou, Zhenwei Shi
In this paper, we extend meteorological downscaling to arbitrary scattered station scales, establish a brand new benchmark and dataset, and retrieve meteorological states at any given station location from a coarse-resolution meteorological field.
no code implementations • 22 Jan 2024 • Hao Chen, Jiaze Wang, Ziyu Guo, Jinpeng Li, Donghao Zhou, Bian Wu, Chenyong Guan, Guangyong Chen, Pheng-Ann Heng
Sign language recognition (SLR) plays a vital role in facilitating communication for the hearing-impaired community.
no code implementations • 22 Jan 2024 • PeiGen Ye, Yaping Sun, Shumin Yao, Hao Chen, Xiaodong Xu, Shuguang Cui
Codebook-based generative semantic communication attracts increasing attention, since only indices are required to be transmitted when the codebook is shared between transmitter and receiver.
no code implementations • 22 Jan 2024 • Luyang Luo, Xin Huang, Minghao Wang, Zhuoyue Wan, Hao Chen
Specifically, the debiasing model is required to learn adaptive agreement with the biased council by agreeing on the correctly predicted samples and disagreeing on the wrongly predicted samples by the biased council.
no code implementations • 19 Jan 2024 • Hao Qian, Hongting Zhou, Qian Zhao, Hao Chen, Hongxiang Yao, Jingwei Wang, Ziqi Liu, Fei Yu, Zhiqiang Zhang, Jun Zhou
The stock market is a crucial component of the financial system, but predicting the movement of stock prices is challenging due to the dynamic and intricate relations arising from various aspects such as economic indicators, financial reports, global news, and investor sentiment.
no code implementations • 18 Jan 2024 • Jie Guo, Hao Chen, Bin Song, Yuhao Chi, Chau Yuen, Fei Richard Yu, Geoffrey Ye Li, Dusit Niyato
In this article, we present a novel framework, named distributed task-oriented communication networks (DTCN), based on recent advances in multimodal semantic transmission and edge intelligence.
no code implementations • 17 Jan 2024 • Zili Liu, Hao Chen, Wenyuan Li, Keyan Chen, Zipeng Qi, Chenyang Liu, Zhengxia Zou, Zhenwei Shi
This paper is the first to consider the impact of label noise on the detection of clouds and snow in remote sensing images.
1 code implementation • 16 Jan 2024 • Yequan Bie, Luyang Luo, Hao Chen
Black-box deep learning approaches have showcased significant potential in the realm of medical image analysis.
no code implementations • 15 Jan 2024 • Yihan Cao, Xu Chen, Lun Du, Hao Chen, Qiang Fu, Shi Han, Yushu Du, Yanbin Kang, Guangming Lu, Zi Li
Person-job fit is an essential part of online recruitment platforms in serving various downstream applications like Job Search and Candidate Recommendation.
1 code implementation • 15 Jan 2024 • Yi Lin, Zeyu Wang, Dong Zhang, Kwang-Ting Cheng, Hao Chen
To alleviate this problem, in this paper, we propose a weakly-supervised nuclei segmentation method that only requires partial point labels of nuclei.
no code implementations • 7 Jan 2024 • Xianghui Xie, Xi Wang, Nikos Athanasiou, Bharat Lal Bhatnagar, Chun-Hao P. Huang, Kaichun Mo, Hao Chen, Xia Jia, Zerui Zhang, Liangxian Cui, Xiao Lin, Bingqiao Qian, Jie Xiao, Wenfei Yang, Hyeongjin Nam, Daniel Sungho Jung, Kihoon Kim, Kyoung Mu Lee, Otmar Hilliges, Gerard Pons-Moll
Modeling the interaction between humans and objects has been an emerging research direction in recent years.
1 code implementation • 4 Jan 2024 • Wenyuan Li, Zili Liu, Keyan Chen, Hao Chen, Shunlin Liang, Zhengxia Zou, Zhenwei Shi
Next, we construct hyper-networks based on deep learning methods to directly learn weather patterns from a large amount of meteorological data.
1 code implementation • 3 Jan 2024 • Yilan Zhang, Yingxue Xu, Jianqi Chen, Fengying Xie, Hao Chen
Despite advantages of multimodal learning for cancer survival prediction, massive redundancy in multimodal data prevents it from extracting discriminative and compact information: (1) An extensive amount of intra-modal task-unrelated information blurs discriminability, especially for gigapixel whole slide images (WSIs) with many patches in pathology and thousands of pathways in genomic data, leading to an ``intra-modal redundancy" issue.
no code implementations • 2 Jan 2024 • Yingbin Zhou, Yaping Sun, GuanYing Chen, Xiaodong Xu, Hao Chen, Binhong Huang, Shuguang Cui, Ping Zhang
Vector quantization-based image semantic communication systems have successfully boosted transmission efficiency, but face a challenge with conflicting requirements between codebook design and digital constellation modulation.
1 code implementation • 29 Dec 2023 • Xiangyu Xiong, Yue Sun, Xiaohong Liu, Wei Ke, Chan-Tong Lam, Jiangang Chen, Mingfeng Jiang, Mingwei Wang, Hui Xie, Tong Tong, Qinquan Gao, Hao Chen, Tao Tan
Experimental results show that DisGAN consistently outperforms the GAN-based augmentation methods with explainable binary classification.
no code implementations • 28 Dec 2023 • Weide Liu, Huijing Zhan, Hao Chen, Fengmao Lv
Multimodal sentiment analysis aims to identify the emotions expressed by individuals through visual, language, and acoustic cues.
no code implementations • 27 Dec 2023 • Xin Yuan, Ning li, Kang Wei, Wenchao Xu, Quan Chen, Hao Chen, Song Guo
The model segmentation without user mobility has been investigated deeply by previous works.
2 code implementations • 26 Dec 2023 • Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, Jiangjin Yin
Designing better deep networks and better reinforcement learning (RL) algorithms are both important for deep RL.
no code implementations • 23 Dec 2023 • Xianjie Zhang, Jiahao Sun, Chen Gong, Kai Wang, Yifei Cao, Hao Chen, Yu Liu
The emergence of on-demand ride pooling services allows each vehicle to serve multiple passengers at a time, thus increasing drivers' income and enabling passengers to travel at lower prices than taxi/car on-demand services (only one passenger can be assigned to a car at a time like UberX and Lyft).
2 code implementations • 23 Dec 2023 • Keyan Chen, Chengyang Liu, Wenyuan Li, Zili Liu, Hao Chen, Haotian Zhang, Zhengxia Zou, Zhenwei Shi
Change detection, a prominent research area in remote sensing, is pivotal in observing and analyzing surface transformations.
Ranked #6 on Change Detection on LEVIR-CD
no code implementations • 19 Dec 2023 • Hao Chen, Lun Du, Yuxuan Lu, Qiang Fu, Xu Chen, Shi Han, Yanbin Kang, Guangming Lu, Zi Li
Online recruitment platforms typically employ Person-Job Fit models in the core service that automatically match suitable job seekers with appropriate job positions.
no code implementations • 18 Dec 2023 • Kun Chen, Lei Bai, Fenghua Ling, Peng Ye, Tao Chen, Jing-Jia Luo, Hao Chen, Yi Xiao, Kang Chen, Tao Han, Wanli Ouyang
Initial states are typically generated by traditional data assimilation components, which are computational expensive and time-consuming.
no code implementations • 14 Dec 2023 • Hao Chen, Abhishek Gupta, Yin Sun, Ness Shroff
In particular, we provide performance guarantees for the MMD-CUSUM test under $\alpha$, $\beta$, and $\phi$-mixing processes, which significantly expands its utility beyond the i. i. d.
1 code implementation • 13 Dec 2023 • Kaijie Zhu, Qinlin Zhao, Hao Chen, Jindong Wang, Xing Xie
The evaluation of large language models (LLMs) is crucial to assess their performance and mitigate potential security risks.
no code implementations • 11 Dec 2023 • Qinggang Zhang, Junnan Dong, Hao Chen, Daochen Zha, Zailiang Yu, Xiao Huang
Generative Large Language Models (LLMs), such as ChatGPT, offer interactive APIs that can answer common questions at a human-expert level.
no code implementations • 9 Dec 2023 • Yuanchen Bei, Sheng Zhou, Qiaoyu Tan, Hao Xu, Hao Chen, Zhao Li, Jiajun Bu
To address these issues, we utilize the advantages of reinforcement learning in adaptively learning in complex environments and propose a novel method that incorporates Reinforcement neighborhood selection for unsupervised graph ANomaly Detection (RAND).
no code implementations • 9 Dec 2023 • Renao Yan, Qiehe Sun, Cheng Jin, Yiqing Liu, Yonghong He, Tian Guan, Hao Chen
While most of the conventional MIL methods use attention scores to estimate instance importance scores (IIS) which contribute to the prediction of the slide labels, these often lead to skewed attention distributions and inaccuracies in identifying crucial instances.
no code implementations • 7 Dec 2023 • Wen Wang, Kecheng Zheng, Qiuyu Wang, Hao Chen, Zifan Shi, Ceyuan Yang, Yujun Shen, Chunhua Shen
We offer a new perspective on approaching the task of video generation.
1 code implementation • 2 Dec 2023 • Hongyi Wang, Luyang Luo, Fang Wang, Ruofeng Tong, Yen-Wei Chen, Hongjie Hu, Lanfen Lin, Hao Chen
Based on this idea, we design Iteratively Coupled Multiple Instance Learning (ICMIL) to couple the embedder and the bag classifier at a low cost.
no code implementations • 30 Nov 2023 • Xiangyu Gao, Yaping Sun, Dongyu Wei, Xiaodong Xu, Hao Chen, Hao Yin, Shuguang Cui
In this context, we address the problem of efficient remote object recognition by optimizing feature transmission between mobile devices and edge servers.
1 code implementation • 26 Nov 2023 • Junhui Yin, Wei Yin, Hao Chen, Xuqian Ren, Zhanyu Ma, Jun Guo, Yifan Liu
These priors ensure the color rendered along rays to be robust to view direction and reduce the inherent ambiguities of density estimated along rays.
no code implementations • 25 Nov 2023 • Murat Bayraktar, Nuria González-Prelcic, Hao Chen
Specifically, we introduce a generalized eigenvalue-based precoder design that considers the downlink user rate, the radar gain, and the SI suppression.
no code implementations • 24 Nov 2023 • Xiangyu Xiong, Yue Sun, Xiaohong Liu, Chan-Tong Lam, Tong Tong, Hao Chen, Qinquan Gao, Wei Ke, Tao Tan
Although current data augmentation methods are successful to alleviate the data insufficiency, conventional augmentation are primarily intra-domain while advanced generative adversarial networks (GANs) generate images remaining uncertain, particularly in small-scale datasets.
1 code implementation • 21 Nov 2023 • Yunpeng Huang, Jingwei Xu, Junyu Lai, Zixu Jiang, Taolue Chen, Zenan Li, Yuan YAO, Xiaoxing Ma, Lijuan Yang, Hao Chen, Shupeng Li, Penghao Zhao
Transformer-based Large Language Models (LLMs) have been applied in diverse areas such as knowledge bases, human interfaces, and dynamic agents, and marking a stride towards achieving Artificial General Intelligence (AGI).
no code implementations • 19 Nov 2023 • Wen Wang, Canyu Zhao, Hao Chen, Zhekai Chen, Kecheng Zheng, Chunhua Shen
We empirically find that sparse control conditions, such as bounding boxes, are suitable for layout planning, while dense control conditions, e. g., sketches and keypoints, are suitable for generating high-quality image content.
no code implementations • 15 Nov 2023 • Xiang Li, Che Wang, Bing Li, Hao Chen, Sizhe Li
In this paper, we propose a method for knowledge graph construction in power distribution networks.
no code implementations • 12 Nov 2023 • Yijie Zhang, Yuanchen Bei, Shiqi Yang, Hao Chen, Zhiqing Li, Lijia Chen, Feiran Huang
To this end, we propose IMGCF, a simple but effective model to alleviate behavior data imbalance for multi-behavior graph collaborative filtering.
1 code implementation • 7 Nov 2023 • Yuhao Zhang, Chen Xu, Bei Li, Hao Chen, Tong Xiao, Chunliang Zhang, Jingbo Zhu
Significant improvements in end-to-end speech translation (ST) have been achieved through the application of multi-task learning.
no code implementations • 6 Nov 2023 • Hao Chen, Nanxi Li, Ruizhe Long, Ying-Chang Liang
To address this issue, we further investigate this ARIS-specific channel estimation problem and propose a least-square (LS) based channel estimator, whose performance can be further improved with the design on ARIS reflection patterns at the channel training phase.
1 code implementation • NeurIPS 2023 • Shenzhi Wang, Qisen Yang, Jiawei Gao, Matthieu Gaetan Lin, Hao Chen, Liwei Wu, Ning Jia, Shiji Song, Gao Huang
Existing solutions tackle this problem by imposing a policy constraint on the policy improvement objective in both offline and online learning.
no code implementations • 26 Oct 2023 • Qinlin Zhao, Jindong Wang, Yixuan Zhang, Yiqiao Jin, Kaijie Zhu, Hao Chen, Xing Xie
Large language models (LLMs) have been widely used as agents to complete different tasks, such as personal assistance or event planning.
no code implementations • 22 Oct 2023 • Liyizhe Peng, Zixing Zhang, Tao Pang, Jing Han, Huan Zhao, Hao Chen, Björn W. Schuller
This indicates the strong transferability and feasibility of LLMs in the field of emotion recognition.
no code implementations • 18 Oct 2023 • Weian Mao, Muzhi Zhu, Zheng Sun, Shuaike Shen, Lin Yuanbo Wu, Hao Chen, Chunhua Shen
Most prior encoders rely on atom-wise features, such as angles and distances between atoms, which are not available in this context.
no code implementations • 18 Oct 2023 • Zhen Yang, Ganggui Ding, Wen Wang, Hao Chen, Bohan Zhuang, Chunhua Shen
Subsequently, we propose an additional reassembly step to seamlessly integrate the respective editing results and the non-editing region to obtain the final edited image.
1 code implementation • 18 Oct 2023 • Songyan Zhang, Xinyu Sun, Hao Chen, Bo Li, Chunhua Shen
Finding corresponding pixels within a pair of images is a fundamental computer vision task with various applications.
no code implementations • 14 Oct 2023 • Khuong N. Nguyen, Abhishek Sehgal, Yuming Zhu, Junsu Choi, Guanbo Chen, Hao Chen, Boon Loong Ng, Charlie Zhang
As the complexity and scale of modern computer networks continue to increase, there has emerged an urgent need for precise traffic analysis, which plays a pivotal role in cutting-edge wireless connectivity technologies.
no code implementations • 9 Oct 2023 • Weimin Xiong, Yiwen Guo, Hao Chen
In this paper, we explore the ability of LLMs for testing programs/code.
no code implementations • 8 Oct 2023 • Tianyang Zhong, Wei Zhao, Yutong Zhang, Yi Pan, Peixin Dong, Zuowei Jiang, Xiaoyan Kui, Youlan Shang, Li Yang, Yaonai Wei, Longtao Yang, Hao Chen, Huan Zhao, Yuxiao Liu, Ning Zhu, Yiwei Li, Yisong Wang, Jiaqi Yao, Jiaqi Wang, Ying Zeng, Lei He, Chao Zheng, Zhixue Zhang, Ming Li, Zhengliang Liu, Haixing Dai, Zihao Wu, Lu Zhang, Shu Zhang, Xiaoyan Cai, Xintao Hu, Shijie Zhao, Xi Jiang, Xin Zhang, Xiang Li, Dajiang Zhu, Lei Guo, Dinggang Shen, Junwei Han, Tianming Liu, Jun Liu, Tuo Zhang
Radiology report generation, as a key step in medical image analysis, is critical to the quantitative analysis of clinically informed decision-making levels.
no code implementations • 7 Oct 2023 • Lei Zhang, Hao Chen, Shu Hu, Bin Zhu, Ching Sheng Lin, Xi Wu, Jinrong Hu, Xin Wang
Generative adversarial networks (GANs) have remarkably advanced in diverse domains, especially image generation and editing.
no code implementations • 4 Oct 2023 • Hao Chen, Abhishek Gupta, Yin Sun, Ness Shroff
This paper studies Hoeffding's inequality for Markov chains under the generalized concentrability condition defined via integral probability metric (IPM).
no code implementations • 4 Oct 2023 • Hao Chen, Qi Zhang, Zenan Huang, Haobo Wang, Junbo Zhao
Distributional shift between domains poses great challenges to modern machine learning algorithms.
no code implementations • 1 Oct 2023 • Xiang Li, Yinpeng Chen, Chung-Ching Lin, Hao Chen, Kai Hu, Rita Singh, Bhiksha Raj, Lijuan Wang, Zicheng Liu
This paper presents a novel approach to object completion, with the primary goal of reconstructing a complete object from its partially visible components.
no code implementations • 29 Sep 2023 • Weiwen Zhang, Dawei Yang, Haoxuan Che, An Ran Ran, Carol Y. Cheung, Hao Chen
For optical coherence tomography angiography (OCTA) images, a limited scanning rate leads to a trade-off between field-of-view (FOV) and imaging resolution.
no code implementations • 29 Sep 2023 • Hao Chen, Jindong Wang, Ankit Shah, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj
This paper aims to understand the nature of noise in pre-training datasets and to mitigate its impact on downstream tasks.
1 code implementation • ICCV 2023 • Fengtao Zhou, Hao Chen
With the rapid advances in high-throughput sequencing technologies, the focus of survival analysis has shifted from examining clinical indicators to incorporating genomic profiles with pathological images.
no code implementations • 21 Sep 2023 • Yusen Wu, Jamie Deng, Hao Chen, Phuong Nguyen, Yelena Yesha
Federated Learning (FL) has revolutionized how we train deep neural networks by enabling decentralized collaboration while safeguarding sensitive data and improving model performance.
no code implementations • 21 Sep 2023 • Hao Chen, Yusen Wu, Phuong Nguyen, Chao Liu, Yelena Yesha
This merging process not only enhances the model performance by converging to a better local optimum, but also minimizes computational costs, offering an efficient and explicit learning process integrated with stochastic gradient descent.
1 code implementation • ICCV 2023 • Hao Chen, Chenyuan Qu, Yu Zhang, Chen Chen, Jianbo Jiao
It is understandable as the model is designed to learn paired mapping (e. g. from a noisy image to its clean version).
Ranked #1 on Denoising on CBSD68 sigm75
no code implementations • 9 Sep 2023 • Hai-Ming Xu, Lingqiao Liu, Hao Chen, Ehsan Abbasnejad, Rafael Felix
As an effective way to alleviate the burden of data annotation, semi-supervised learning (SSL) provides an attractive solution due to its ability to leverage both labeled and unlabeled data to build a predictive model.
no code implementations • 4 Sep 2023 • Jiabo Huang, Jianyu Zhao, Yuyang Rong, Yiwen Guo, Yifeng He, Hao Chen
The test cases are obtained with the assistance of a customized fuzzer and are only required during pre-training.
1 code implementation • 1 Sep 2023 • Shengcong Chen, Changxing Ding, DaCheng Tao, Hao Chen
Second, we propose a new instance normalization method that is robust to the variation in foreground-background ratios.
no code implementations • 31 Aug 2023 • Hina Raja, Asim Munawar, Mohammad Delsoz, Mohammad Elahi, Yeganeh Madadi, Amr Hassan, Hashem Abu Serhan, Onur Inam, Luis Hermandez, Sang Tran, Wuqas Munir, Alaa Abd-Alrazaq, Hao Chen, SiamakYousefi
Moreover, the extendibility of the model to other scientific fields broadens its impact in facilitating research and trend analysis across diverse disciplines.
1 code implementation • 25 Aug 2023 • Haibo Jin, Haoxuan Che, Hao Chen
The framework leverages self-training and domain adversarial learning to address the domain gap during adaptation.
1 code implementation • 24 Aug 2023 • Haibo Jin, Haoxuan Che, Yi Lin, Hao Chen
To address these challenges, we propose diagnosis-driven prompts for medical report generation (PromptMRG), a novel framework that aims to improve the diagnostic accuracy of MRG with the guidance of diagnosis-aware prompts.
no code implementations • 23 Aug 2023 • Hao Chen, Gonzalo E. Constante-Flores, Can Li
Decision-making problems can be represented as mathematical optimization models, finding wide applications in fields such as economics, engineering and manufacturing, transportation, and health care.
no code implementations • 21 Aug 2023 • Hao Chen, Weiwei Wan, Masaki Matsushita, Takeyuki Kotaka, Kensuke Harada
Accurate robotic manipulation of test tubes in biology and medical industries is becoming increasingly important to address workforce shortages and improve worker safety.
no code implementations • 20 Aug 2023 • Bowei Xu, Hao Chen, Zhan Ma
Unlike direct observation-to-action mapping, Karma recurrently maintains a multi-dimensional time series of observations, returns, and actions as input and employs causal sequence modeling via a decision transformer to determine the next action.
no code implementations • 19 Aug 2023 • Hao Chen, Haoran Zhou, Yongjian Deng
In this paper, we present an analytical framework and a novel metric to shed light on the interpretation of the multimodal vision community.
2 code implementations • 15 Aug 2023 • Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li, Yong Qin, Ruiqi Sun, Xin Zhou, Enzhi Wang, Xiaohang Dong
This highlights its potential to augment the reasoning capabilities of LLMs.
1 code implementation • 13 Aug 2023 • Hanxi Li, Jianfei Hu, Bo Li, Hao Chen, Yongbin Zheng, Chunhua Shen
In this framework, the anomaly detection problem is solved via a cascade patch retrieval procedure that retrieves the nearest neighbors for each test image patch in a coarse-to-fine fashion.
Ranked #1 on Supervised Anomaly Detection on BTAD
1 code implementation • 13 Aug 2023 • Hongxiang Fan, Hao Chen, Liam Castelli, Zhiqiang Que, He Li, Kenneth Long, Wayne Luk
Bayesian Neural Networks (BayesNNs) have demonstrated their capability of providing calibrated prediction for safety-critical applications such as medical imaging and autonomous driving.
1 code implementation • ICCV 2023 • Muzhi Zhu, Hengtao Li, Hao Chen, Chengxiang Fan, Weian Mao, Chenchen Jing, Yifan Liu, Chunhua Shen
In this work, we propose a novel training mechanism termed SegPrompt that uses category information to improve the model's class-agnostic segmentation ability for both known and unknown categories.
1 code implementation • NeurIPS 2023 • Weijia Wu, Yuzhong Zhao, Hao Chen, YuChao Gu, Rui Zhao, Yefei He, Hong Zhou, Mike Zheng Shou, Chunhua Shen
To showcase the power of the proposed approach, we generate datasets with rich dense pixel-wise labels for a wide range of downstream tasks, including semantic segmentation, instance segmentation, and depth estimation.
no code implementations • ICCV 2023 • Guangkai Xu, Wei Yin, Hao Chen, Chunhua Shen, Kai Cheng, Feng Zhao
3D scene reconstruction is a long-standing vision task.
no code implementations • 24 Jul 2023 • Chengming Hu, Yeqian Du, Rui Wang, Hao Chen
In this paper, we aim to clarify the relationships between Domain Generalization (DG) and the frequency components, and explore the spatial relationships of the phase spectrum.
1 code implementation • ICCV 2023 • Kaining Ying, Qing Zhong, Weian Mao, Zhenhua Wang, Hao Chen, Lin Yuanbo Wu, Yifan Liu, Chengxiang Fan, Yunzhi Zhuge, Chunhua Shen
The discrimination of instance embeddings plays a vital role in associating instances across time for online video instance segmentation (VIS).
Ranked #2 on Video Instance Segmentation on Youtube-VIS 2022 Validation (using extra training data)
1 code implementation • 22 Jul 2023 • Qiaoyu Tan, Xin Zhang, Xiao Huang, Hao Chen, Jundong Li, Xia Hu
Graph neural networks (GNNs) have shown prominent performance on attributed network embedding.
no code implementations • 21 Jul 2023 • Qizhang Li, Yiwen Guo, Xiaochen Yang, WangMeng Zuo, Hao Chen
Our ICLR work advocated for enhancing transferability in adversarial examples by incorporating a Bayesian formulation into model parameters, which effectively emulates the ensemble of infinitely many deep neural networks, while, in this paper, we introduce a novel extension by incorporating the Bayesian formulation into the model input as well, enabling the joint diversification of both the model input and model parameters.
1 code implementation • ICCV 2023 • Wei Yin, Chi Zhang, Hao Chen, Zhipeng Cai, Gang Yu, Kaixuan Wang, Xiaozhi Chen, Chunhua Shen
State-of-the-art (SOTA) monocular metric depth estimation methods can only handle a single camera model and are unable to perform mixed-data training due to the metric ambiguity.
Ranked #19 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)
1 code implementation • 17 Jul 2023 • Shiye Lei, Hao Chen, Sen Zhang, Bo Zhao, DaCheng Tao
With the rapid development of Artificial Intelligence Generated Content (AIGC), it has become common practice in many learning tasks to train or fine-tune large models on synthetic data due to the data-scarcity and privacy leakage problems.
no code implementations • 17 Jul 2023 • Hao Chen, Yonghan Dong, Zheming Lu, Yunlong Yu, Yingming Li, Jungong Han, Zhongfei Zhang
Few-Shot Segmentation (FSS) aims to segment the novel class images with a few annotated samples.
1 code implementation • 10 Jul 2023 • Haoxuan Che, YuHan Cheng, Haibo Jin, Hao Chen
Diabetic Retinopathy (DR) is a common complication of diabetes and a leading cause of blindness worldwide.
1 code implementation • 6 Jul 2023 • Yupeng Chang, Xu Wang, Jindong Wang, Yuan Wu, Linyi Yang, Kaijie Zhu, Hao Chen, Xiaoyuan Yi, Cunxiang Wang, Yidong Wang, Wei Ye, Yue Zhang, Yi Chang, Philip S. Yu, Qiang Yang, Xing Xie
Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing to their unprecedented performance in various applications.
no code implementations • 29 Jun 2023 • Hao Chen, Zhi Jin
Hence, in this work, we propose a novel residual recurrent multi-wavelet convolutional neural network R2-MWCNN learned in the frequency domain that can simultaneously increase the image contrast and reduce noise signals well.
1 code implementation • 28 Jun 2023 • Keyan Chen, Chenyang Liu, Hao Chen, Haotian Zhang, Wenyuan Li, Zhengxia Zou, Zhenwei Shi
We also propose several ongoing derivatives for instance segmentation tasks, drawing on recent advancements within the SAM community, and compare their performance with RSPrompter.
1 code implementation • 23 Jun 2023 • Zhizhong Chai, Luyang Luo, Huangjing Lin, Pheng-Ann Heng, Hao Chen
To tackle this challenge, the literature on object detection has witnessed an increase of weakly-supervised and semi-supervised approaches, yet still lacks a unified framework that leverages various forms of fully-labeled, weakly-labeled, and unlabeled data.
no code implementations • 19 Jun 2023 • Huiming Li, Hao Chen, Xiangke Wang, Mengge Zhang, Lincheng Shen
This paper studies the bearing-based time-varying formation control problem for unicycle-type agents without bearing rigidity conditions.
1 code implementation • 16 Jun 2023 • Xinke Jiang, Dingyi Zhuang, Xianghui Zhang, Hao Chen, Jiayuan Luo, Xiaowei Gao
Understanding Origin-Destination (O-D) travel demand is crucial for transportation management.
no code implementations • 15 Jun 2023 • Jia-Xin Zhuang, Luyang Luo, Hao Chen
Masked autoencoder (MAE) is a promising self-supervised pre-training technique that can improve the representation learning of a neural network without human intervention.
1 code implementation • ICCV 2023 • Yingxue Xu, Hao Chen
Survival prediction is a complicated ordinal regression task that aims to predict the ranking risk of death, which generally benefits from the integration of histology and genomic data.
no code implementations • 8 Jun 2023 • Yuling Xi, Hao Chen, Ning Wang, Peng Wang, Yanning Zhang, Chunhua Shen, Yifan Liu
In particular, one feature merge branch is designed for instance-level recognition the other for dense predictions.
2 code implementations • 8 Jun 2023 • Yidong Wang, Zhuohao Yu, Zhengran Zeng, Linyi Yang, Cunxiang Wang, Hao Chen, Chaoya Jiang, Rui Xie, Jindong Wang, Xing Xie, Wei Ye, Shikun Zhang, Yue Zhang
To ensure the reliability of PandaLM, we collect a diverse human-annotated test dataset, where all contexts are generated by humans and labels are aligned with human preferences.
1 code implementation • 7 Jun 2023 • Kaijie Zhu, Jindong Wang, Jiaheng Zhou, Zichen Wang, Hao Chen, Yidong Wang, Linyi Yang, Wei Ye, Yue Zhang, Neil Zhenqiang Gong, Xing Xie
The increasing reliance on Large Language Models (LLMs) across academia and industry necessitates a comprehensive understanding of their robustness to prompts.
Cross-Lingual Paraphrase Identification Machine Translation +5
no code implementations • 6 Jun 2023 • Hanxi Li, Jingqi Wu, Hao Chen, Mingwen Wang, Chunhua Shen
Thus the sliding transformer can attain even higher accuracy with much less annotation labor.
Ranked #1 on Anomaly Detection on MVTec AD (Segmentation AUROC metric)
no code implementations • 4 Jun 2023 • Jintao Rong, Hao Chen, Tianxiao Chen, Linlin Ou, Xinyi Yu, Yifan Liu
Prompt learning has become a popular approach for adapting large vision-language models, such as CLIP, to downstream tasks.
no code implementations • 31 May 2023 • Sicen Liu, Xiaolong Wang, Xianbing Zhao, Hao Chen
However, most of them neglect incorporating domain knowledge according to the clinical manifestations in the EHR of the patient.
no code implementations • 31 May 2023 • Yanxiong Li, Hao Chen, Wenchang Cao, Qisheng Huang, Qianhua He
In the proposed embedding module, audio feature of each speech sample is split into several low-dimensional feature subsets that are transformed by a recurrent convolutional block in parallel.
no code implementations • 30 May 2023 • Hao Chen, Thomas Barthel
As suggested in [arXiv:2205. 15296] in the context of quantum many-body physics, computation costs can be further substantially reduced by imposing constraints on the canonical polyadic (CP) rank of the tensors in such networks.
no code implementations • 30 May 2023 • Yanwen Li, Luyang Luo, Huangjing Lin, Pheng-Ann Heng, Hao Chen
To guide the segmentation branch to learn from richer high-resolution features, we propose a feature affinity module and a scale affinity module to enhance the multi-task learning of the dual branches.
1 code implementation • CVPR 2023 • Qingsheng Wang, Lingqiao Liu, Chenchen Jing, Hao Chen, Guoqiang Liang, Peng Wang, Chunhua Shen
Compositional Zero-Shot Learning (CZSL) aims to train models to recognize novel compositional concepts based on learned concepts such as attribute-object combinations.
Ranked #1 on Compositional Zero-Shot Learning on MIT-States
no code implementations • 28 May 2023 • Mingyang Zhang, Hao Chen, Chunhua Shen, Zhen Yang, Linlin Ou, Xinyi Yu, Bohan Zhuang
This is due to their utilization of unstructured pruning on LPMs, impeding the merging of LoRA weights, or their dependence on the gradients of pre-trained weights to guide pruning, which can impose significant memory overhead.
1 code implementation • 24 May 2023 • Hao Chen, Haotian Zhang, Keyan Chen, Chenyao Zhou, Song Chen, Zhengxia Zou, Zhenwei Shi
Toward continuous cross-resolution CD, we propose scale-invariant learning to enforce the model consistently predicting HR results given synthesized samples of varying resolution differences.
1 code implementation • 23 May 2023 • Jianyu Zhao, Yuyang Rong, Yiwen Guo, Yifeng He, Hao Chen
The effectiveness of the proposed method is verified on two program understanding tasks including code clone detection and code classification, and it outperforms current state-of-the-arts by large margins.
no code implementations • 22 May 2023 • Hao Chen, Ankit Shah, Jindong Wang, Ran Tao, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj
In this paper, we introduce imprecise label learning (ILL), a framework for the unification of learning with various imprecise label configurations.
Ranked #1 on Learning with noisy labels on mini WebVision 1.0
1 code implementation • 22 May 2023 • Yang Liu, Muzhi Zhu, Hengtao Li, Hao Chen, Xinlong Wang, Chunhua Shen
In this work, we present Matcher, a novel perception paradigm that utilizes off-the-shelf vision foundation models to address various perception tasks.
no code implementations • 21 May 2023 • Yue Xu, Hao Chen, Zefan Wang, Jianwen Yin, Qijie Shen, Dimin Wang, Feiran Huang, Lixiang Lai, Tao Zhuang, Junfeng Ge, Xia Hu
Feed recommendation systems, which recommend a sequence of items for users to browse and interact with, have gained significant popularity in practical applications.
no code implementations • 21 May 2023 • Yue Xu, Qijie Shen, Jianwen Yin, Zengde Deng, Dimin Wang, Hao Chen, Lixiang Lai, Tao Zhuang, Junfeng Ge
Integrated recommendation, which aims at jointly recommending heterogeneous items from different channels in a main feed, has been widely applied to various online platforms.
no code implementations • 16 May 2023 • Hao Chen, Yiming Zhang, Qi Zhang, Hantao Yang, Xiaomeng Hu, Xuetao Ma, Yifan Yanggong, Junbo Zhao
Instruction tuning for large language models (LLMs) has gained attention from researchers due to its ability to unlock the potential of LLMs in following instructions.
1 code implementation • 14 May 2023 • Jianqi Chen, Hao Chen, Keyan Chen, Yilan Zhang, Zhengxia Zou, Zhenwei Shi
Many existing adversarial attacks generate $L_p$-norm perturbations on image RGB space.
no code implementations • 10 May 2023 • Yuyan Ruan, Dawei Yang, Ziqi Tang, An Ran Ran, Carol Y. Cheung, Hao Chen
The key difference between the proposed method and traditional RefSR models is that the textures used during inference are generated by the LTG instead of being searched from a single reference image.
1 code implementation • 8 May 2023 • Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li, Yong Qin, Ruiqi Sun, Xiaoyan Bai
This demonstrates the great potential of using prompt for unsupervised keyphrase extraction.
Ranked #1 on Keyphrase Extraction on NUS
no code implementations • 3 May 2023 • Tao Chen, Liang Lv, Di Wang, Jing Zhang, Yue Yang, Zeyang Zhao, Chen Wang, Xiaowei Guo, Hao Chen, Qingye Wang, Yufei Xu, Qiming Zhang, Bo Du, Liangpei Zhang, DaCheng Tao
With the world population rapidly increasing, transforming our agrifood systems to be more productive, efficient, safe, and sustainable is crucial to mitigate potential food shortages.
1 code implementation • 1 May 2023 • Yi Lin, Dong Zhang, Xiao Fang, Yufan Chen, Kwang-Ting Cheng, Hao Chen
Medical image segmentation is a fundamental task in the community of medical image analysis.
2 code implementations • NeurIPS 2023 • Qizhang Li, Yiwen Guo, WangMeng Zuo, Hao Chen
In particular, the proposed method, named intermediate-level perturbation decay (ILPD), encourages the intermediate-level perturbation to be in an effective adversarial direction and to possess a great magnitude simultaneously.
no code implementations • 19 Apr 2023 • Hao Chen, Peng Zheng, Xin Wang, Shu Hu, Bin Zhu, Jinrong Hu, Xi Wu, Siwei Lyu
As growing usage of social media websites in the recent decades, the amount of news articles spreading online rapidly, resulting in an unprecedented scale of potentially fraudulent information.
no code implementations • 19 Apr 2023 • Yang Yang, Weijie Ma, Hao Chen, Linlin Ou, Xinyi Yu
The combination of LiDAR and camera modalities is proven to be necessary and typical for 3D object detection according to recent studies.
1 code implementation • CVPR 2023 • Rui Li, Dong Gong, Wei Yin, Hao Chen, Yu Zhu, Kaixuan Wang, Xiaozhi Chen, Jinqiu Sun, Yanning Zhang
To let the geometric perception learned from multi-view cues in static areas propagate to the monocular representation in dynamic areas and let monocular cues enhance the representation of multi-view cost volume, we propose a cross-cue fusion (CCF) module, which includes the cross-cue attention (CCA) to encode the spatially non-local relative intra-relations from each source to enhance the representation of the other.
1 code implementation • 14 Apr 2023 • Zhipeng Deng, Luyang Luo, Hao Chen
Federated learning (FL) has been introduced to the healthcare domain as a decentralized learning paradigm that allows multiple parties to train a model collaboratively without privacy leakage.
no code implementations • 14 Apr 2023 • Jaime Spencer, C. Stella Qian, Michaela Trescakova, Chris Russell, Simon Hadfield, Erich W. Graf, Wendy J. Adams, Andrew J. Schofield, James Elder, Richard Bowden, Ali Anwar, Hao Chen, Xiaozhi Chen, Kai Cheng, Yuchao Dai, Huynh Thai Hoa, Sadat Hossain, Jianmian Huang, Mohan Jing, Bo Li, Chao Li, Baojun Li, Zhiwen Liu, Stefano Mattoccia, Siegfried Mercelis, Myungwoo Nam, Matteo Poggi, Xiaohua Qi, Jiahui Ren, Yang Tang, Fabio Tosi, Linh Trinh, S. M. Nadim Uddin, Khan Muhammad Umair, Kaixuan Wang, YuFei Wang, Yixing Wang, Mochu Xiang, Guangkai Xu, Wei Yin, Jun Yu, Qi Zhang, Chaoqiang Zhao
This paper discusses the results for the second edition of the Monocular Depth Estimation Challenge (MDEC).
no code implementations • 13 Apr 2023 • Luyang Luo, Xi Wang, Yi Lin, Xiaoqi Ma, Andong Tan, Ronald Chan, Varut Vardhanabhuti, Winnie CW Chu, Kwang-Ting Cheng, Hao Chen
Breast cancer has reached the highest incidence rate worldwide among all malignancies since 2020.
1 code implementation • CVPR 2023 • Hao Chen, Matt Gwilliam, Ser-Nam Lim, Abhinav Shrivastava
Such embedding largely limits the regression capacity and internal generalization for video interpolation.
Ranked #3 on Video Reconstruction on UVG
no code implementations • 5 Apr 2023 • Bo Qian, Hao Chen, Xiangning Wang, Haoxuan Che, Gitaek Kwon, Jaeyoung Kim, Sungjin Choi, Seoyoung Shin, Felix Krause, Markus Unterdechler, Junlin Hou, Rui Feng, Yihao Li, Mostafa El Habib Daho, Qiang Wu, Ping Zhang, Xiaokang Yang, Yiyu Cai, Weiping Jia, Huating Li, Bin Sheng
Computer-assisted automatic analysis of diabetic retinopathy (DR) is of great importance in reducing the risks of vision loss and even blindness.
1 code implementation • 4 Apr 2023 • Yidong Wang, Zhuohao Yu, Jindong Wang, Qiang Heng, Hao Chen, Wei Ye, Rui Xie, Xing Xie, Shikun Zhang
However, their performance on imbalanced dataset is relatively poor, where the distribution of classes in the training dataset is skewed, leading to poor performance in predicting minority classes.
no code implementations • 2 Apr 2023 • Yifeng Wang, Luyang Luo, Mingxiang Wu, Qiong Wang, Hao Chen
Learning segmentation networks from multi-source annotations remains a challenge due to the uncertainties brought by the variance of annotations and the quality of images.
1 code implementation • 1 Apr 2023 • Hao Chen, Chen Gong, Yizhe WANG, Xinwen Hou
This paper proposes the Recovery Triggered States (RTS) method, a novel approach that effectively protects the victim agents from backdoor attacks.
1 code implementation • 30 Mar 2023 • Wen Wang, Yan Jiang, Kangyang Xie, Zide Liu, Hao Chen, Yue Cao, Xinlong Wang, Chunhua Shen
Our vid2vid-zero leverages off-the-shelf image diffusion models, and doesn't require training on any video.
1 code implementation • 28 Mar 2023 • Hongyi Wang, Luyang Luo, Fang Wang, Ruofeng Tong, Yen-Wei Chen, Hongjie Hu, Lanfen Lin, Hao Chen
In ICMIL, we use category information in the bag-level classifier to guide the patch-level fine-tuning of the patch feature extractor.
1 code implementation • CVPR 2023 • Haoxuan Che, Siyu Chen, Hao Chen
Medical images usually suffer from image degradation in clinical practice, leading to decreased performance of deep learning-based models.
1 code implementation • CVPR 2023 • Hao Jiang, Rushan Zhang, Yanning Zhou, Yumeng Wang, Hao Chen
Cell instance segmentation in cytology images has significant importance for biology analysis and cancer screening, while remains challenging due to 1) the extensive overlapping translucent cell clusters that cause the ambiguous boundaries, and 2) the confusion of mimics and debris as nuclei.
1 code implementation • 24 Mar 2023 • Yi Lin, Yufan Chen, Kwang-Ting Cheng, Hao Chen
Our proposed network mines the correlations between the support image and query image, limiting them to focus only on useful foreground information and boosting the representation capacity of both the support prototype and query features.
no code implementations • 24 Mar 2023 • Junhao Dong, Junxi Chen, Xiaohua Xie, JianHuang Lai, Hao Chen
In this exposition, we present a comprehensive survey on recent advances in adversarial attack and defense for medical image analysis with a novel taxonomy in terms of the application scenario.
no code implementations • 24 Mar 2023 • Hao Chen, Linyan Li, Fan Lyu, Fuyuan Hu, Zhenping Xia, Fenglei Xu
Class-level graph network aims to mitigate the semantic conflict between prototype features of new classes and old classes.
no code implementations • CVPR 2023 • Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava
Implicit neural representations (INR) have gained increasing attention in representing 3D scenes and images, and have been recently applied to encode videos (e. g., NeRV, E-NeRV).
no code implementations • 23 Mar 2023 • Yi Lin, Xiao Fang, Dong Zhang, Kwang-Ting Cheng, Hao Chen
Recently, the advent of vision Transformer (ViT) has brought substantial advancements in 3D dataset benchmarks, particularly in 3D volumetric medical image segmentation (Vol-MedSeg).
no code implementations • 23 Mar 2023 • Yi Lin, Zhongchen Zhao, Zhengjie ZHU, Lisheng Wang, Kwang-Ting Cheng, Hao Chen
Multiple instance learning (MIL) has emerged as a popular method for classifying histopathology whole slide images (WSIs).
no code implementations • 22 Mar 2023 • Cheng Jin, Zhengrui Guo, Yi Lin, Luyang Luo, Hao Chen
Thus, label-efficient deep learning methods are developed to make comprehensive use of the labeled data as well as the abundance of unlabeled and weak-labeled data.
no code implementations • 15 Mar 2023 • Zipeng Qi, Hao Chen, Chenyang Liu, Zhenwei Shi, Zhengxia Zou
In the first stage, we optimize a neural field to encode the color and 3D structure of the remote sensing scene based on multi-view images.
no code implementations • 12 Mar 2023 • Hao Chen, Zhe-Ming Lu, Jie Liu
This paper focuses on proposing a deep learning-based monkey swing counting algorithm.
no code implementations • ICCV 2023 • Hao Chen, Jiaze Wang, Kun Shao, Furui Liu, Jianye Hao, Chenyong Guan, Guangyong Chen, Pheng-Ann Heng
Specifically, our Traj-MAE employs diverse masking strategies to pre-train the trajectory encoder and map encoder, allowing for the capture of social and temporal information among agents while leveraging the effect of environment from multiple granularities.