Search Results for author: Hao Chen

Found 440 papers, 170 papers with code

Enhanced Multi-Channel Graph Convolutional Network for Aspect Sentiment Triplet Extraction

1 code implementation • ACL 2022 • Hao Chen, Zepeng Zhai, Fangxiang Feng, Ruifan Li, Xiaojie Wang

Specifically, we first define ten types of relations for ASTE task, and then adopt a biaffine attention module to embed these relations as an adjacent tensor between words in a sentence.

Aspect Sentiment Triplet Extraction Relation +1

Paper
Code

Enhanced Representation with Contrastive Loss for Long-Tail Query Classification in e-commerce

no code implementations • ECNLP (ACL) 2022 • Lvxing Zhu, Hao Chen, Chao Wei, Weiru Zhang

To solve the above problem, we propose a novel method that leverages an auxiliary module to enhance the representations of long-tail queries by taking advantage of reliable supervised information of variant frequent queries.

Paper
Add Code

Reinforced Counterfactual Data Augmentation for Dual Sentiment Classification

1 code implementation • EMNLP 2021 • Hao Chen, Rui Xia, Jianfei Yu

Data augmentation and adversarial perturbation approaches have recently achieved promising results in solving the over-fitting problem in many natural language processing (NLP) tasks including sentiment classification.

Classification counterfactual +4

Paper
Code

MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis

no code implementations • 24 Apr 2024 • Jiaxin Zhuang, Linshan Wu, Qiong Wang, Varut Vardhanabhuti, Lin Luo, Hao Chen

We further scale up the MiM to large pre-training datasets with more than 10k volumes, showing that large-scale pre-training can further enhance the performance of downstream tasks.

Computed Tomography (CT) Representation Learning +2

Paper
Add Code

MedDr: Diagnosis-Guided Bootstrapping for Large-Scale Medical Vision-Language Learning

no code implementations • 23 Apr 2024 • Sunan He, Yuxiang Nie, Zhixuan Chen, Zhiyuan Cai, Hongmei Wang, Shu Yang, Hao Chen

The rapid advancement of large-scale vision-language models has showcased remarkable capabilities across various tasks.

Medical Diagnosis Medical Report Generation +3

Paper
Add Code

Intrusion Detection at Scale with the Assistance of a Command-line Language Model

no code implementations • 20 Apr 2024 • Jiongliang Lin, Yiwen Guo, Hao Chen

Intrusion detection is a long standing and crucial problem in security.

Intrusion Detection Language Modelling +1

Paper
Add Code

Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

1 code implementation • Under review for Transaction 2024 • Mu Hu, Wei Yin, Chi Zhang, Zhipeng Cai, Xiaoxiao Long, Hao Chen, Kaixuan Wang, Gang Yu, Chunhua Shen, Shaojie Shen

Our method benefits various applications including in-the-wild metrology monocular-SLAM, and 3D reconstruction, which highlight the versatility of Metric3D v2 models as geometric foundation models.

Ranked #1 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)

3D Reconstruction Monocular Depth Estimation +3

648

Paper
Code

Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

no code implementations • 15 Apr 2024 • Fangwei Zhong, Kui Wu, Hai Ci, Churan Wang, Hao Chen

We evaluate our tracker on several high-fidelity environments with challenging situations, such as distraction and occlusion.

Offline RL Q-Learning +2

Paper
Add Code

Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting

2 code implementations • 10 Apr 2024 • Hao Lu, Jiaqi Tang, Xinli Xu, Xu Cao, Yunpeng Zhang, Guoqing Wang, Dalong Du, Hao Chen, Yingcong Chen

Finally, for MC3D-Det joint training, the elaborate dataset merge strategy is designed to solve the problem of inconsistent camera numbers and camera parameters.

3D Object Detection Autonomous Driving +1

159

Paper
Code

Concept-Attention Whitening for Interpretable Skin Lesion Diagnosis

no code implementations • 9 Apr 2024 • Junlin Hou, Jilan Xu, Hao Chen

In the former branch, we train the CNN with a CAW layer inserted to perform skin lesion diagnosis.

Concept Alignment Explainable artificial intelligence +1

Paper
Add Code

QMix: Quality-aware Learning with Mixed Noise for Robust Retinal Disease Diagnosis

no code implementations • 8 Apr 2024 • Junlin Hou, Jilan Xu, Rui Feng, Hao Chen

Previous noise learning methods mainly considered noise arising from images being mislabeled, i. e. label noise, assuming that all mislabeled images are of high image quality.

Paper
Add Code

MedIAnomaly: A comparative study of anomaly detection in medical images

1 code implementation • 6 Apr 2024 • Yu Cai, Weiwen Zhang, Hao Chen, Kwang-Ting Cheng

Anomaly detection (AD) aims at detecting abnormal samples that deviate from the expected normal patterns.

Anomaly Classification Anomaly Detection +2

Paper
Code

Rethinking Self-training for Semi-supervised Landmark Detection: A Selection-free Approach

1 code implementation • 6 Apr 2024 • Haibo Jin, Haoxuan Che, Hao Chen

Self-training is a simple yet effective method for semi-supervised learning, during which pseudo-label selection plays an important role for handling confirmation bias.

Pseudo Label regression

Paper
Code

Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future Directions

1 code implementation • 4 Apr 2024 • Yuting He, Fuxiang Huang, Xinrui Jiang, Yuxiang Nie, Minghao Wang, Jiguang Wang, Hao Chen

To answer these questions, a comprehensive and deep survey of the challenges, opportunities, and future directions of HFMs is presented in this survey.

Paper
Code

RS-Mamba for Large Remote Sensing Image Dense Prediction

1 code implementation • 3 Apr 2024 • Sijie Zhao, Hao Chen, Xueliang Zhang, Pengfeng Xiao, Lei Bai, Wanli Ouyang

RSM is specifically designed to capture the global context of remote sensing images with linear complexity, facilitating the effective processing of large VHR images.

Ranked #1 on Road Segmentation on Massachusetts Roads Dataset (F1 metric)

Building change detection for remote sensing images Change Detection +1

126

Paper
Code

Cohort-Individual Cooperative Learning for Multimodal Cancer Survival Analysis

no code implementations • 3 Apr 2024 • Huajun Zhou, Fengtao Zhou, Hao Chen

In this paper, we propose a Cohort-individual Cooperative Learning (CCL) framework to advance cancer survival analysis by collaborating knowledge decomposition and cohort guidance.

Survival Analysis

Paper
Add Code

iMD4GC: Incomplete Multimodal Data Integration to Advance Precise Treatment Response Prediction and Survival Analysis for Gastric Cancer

1 code implementation • 1 Apr 2024 • Fengtao Zhou, Yingxue Xu, Yanfen Cui, Shenyan Zhang, Yun Zhu, Weiyang He, Jiguang Wang, Xin Wang, Ronald Chan, Louis Ho Shing Lau, Chu Han, Dafu Zhang, Zhenhui Li, Hao Chen

The limited availability of modalities for each patient would cause information loss, adversely affecting predictive accuracy.

Data Integration Survival Analysis

Paper
Code

360+x: A Panoptic Multi-modal Scene Understanding Dataset

no code implementations • 1 Apr 2024 • Hao Chen, Yuqi Hou, Chenyuan Qu, Irene Testini, Xiaohan Hong, Jianbo Jiao

While many existing datasets focus on scene understanding from a certain perspective (e. g. egocentric or third-person views), our dataset offers a panoptic perspective (i. e. multiple viewpoints with multiple data modalities).

Scene Understanding

Paper
Add Code

Design as Desired: Utilizing Visual Question Answering for Multimodal Pre-training

no code implementations • 30 Mar 2024 • Tongkun Su, Jun Li, Xi Zhang, Haibo Jin, Hao Chen, Qiong Wang, Faqin Lv, Baoliang Zhao, Yin Hu

In this work, we leverage descriptions in medical reports to design multi-granular question-answer pairs associated with different diseases, which assist the framework in pre-training without requiring extra annotations from experts.

Contrastive Learning Question Answering +1

Paper
Add Code

Dia-LLaMA: Towards Large Language Model-driven CT Report Generation

no code implementations • 25 Mar 2024 • Zhixuan Chen, Luyang Luo, Yequan Bie, Hao Chen

Medical report generation has achieved remarkable advancements yet has still been faced with several challenges.

Language Modelling Large Language Model +2

Paper
Add Code

AC4: Algebraic Computation Checker for Circuit Constraints in ZKPs

no code implementations • 23 Mar 2024 • Hao Chen, Minyu Chen, Ruibang Liu, Guoqiang Li

ZKP systems have surged attention and held a fundamental role in contemporary cryptography.

Paper
Add Code

Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

no code implementations • 22 Mar 2024 • Mu Hu, Wei Yin, Chi Zhang, Zhipeng Cai, Xiaoxiao Long, Hao Chen, Kaixuan Wang, Gang Yu, Chunhua Shen, Shaojie Shen

For metric depth estimation, we show that the key to a zero-shot single-view model lies in resolving the metric ambiguity from various camera models and large-scale data training.

Paper
Add Code

Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation

1 code implementation • 20 Mar 2024 • Linshan Wu, Zhun Zhong, Jiayi Ma, Yunchao Wei, Hao Chen, Leyuan Fang, Shutao Li

Based on the label distributions, we leverage the GMM to generate high-quality pseudo labels for more reliable supervision.

Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation

Paper
Code

Prompt-Guided Adaptive Model Transformation for Whole Slide Image Classification

no code implementations • 19 Mar 2024 • Yi Lin, Zhengjie ZHU, Kwang-Ting Cheng, Hao Chen

To address this issue, we propose PAMT, a novel Prompt-guided Adaptive Model Transformation framework that enhances MIL classification performance by seamlessly adapting pre-trained models to the specific characteristics of histopathology data.

Image Classification Multiple Instance Learning +1

Paper
Add Code

Advancing COVID-19 Detection in 3D CT Scans

no code implementations • 18 Mar 2024 • Qingqiu Li, Runtian Yuan, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen

To make a more accurate diagnosis of COVID-19, we propose a straightforward yet effective model.

Paper
Add Code

Domain Adaptation Using Pseudo Labels for COVID-19 Detection

no code implementations • 18 Mar 2024 • Runtian Yuan, Qingqiu Li, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen

In response to the need for rapid and accurate COVID-19 diagnosis during the global pandemic, we present a two-stage framework that leverages pseudo labels for domain adaptation to enhance the detection of COVID-19 from CT scans.

COVID-19 Diagnosis Domain Adaptation +1

Paper
Add Code

Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model

no code implementations • 17 Mar 2024 • Kangyang Xie, BinBin Yang, Hao Chen, Meng Wang, Cheng Zou, Hui Xue, Ming Yang, Chunhua Shen

Beyond the superiority of the text-to-image diffusion model in generating high-quality images, recent studies have attempted to uncover its potential for adapting the learned semantic knowledge to visual perception tasks.

Image Generation

Paper
Add Code

3D Human Reconstruction in the Wild with Synthetic Data Using Generative Models

no code implementations • 17 Mar 2024 • Yongtao Ge, Wenjia Wang, Yongfan Chen, Hao Chen, Chunhua Shen

In this work, we show that synthetic data created by generative models is complementary to computer graphics (CG) rendered data for achieving remarkable generalization performance on diverse real-world scenes for 3D human pose and shape estimation (HPS).

3D human pose and shape estimation 3D Human Reconstruction

Paper
Add Code

Self-Supervised Video Desmoking for Laparoscopic Surgery

1 code implementation • 17 Mar 2024 • Renlong Wu, Zhilu Zhang, Shuohao Zhang, Longfei Gou, Haobin Chen, Lei Zhang, Hao Chen, WangMeng Zuo

On the other hand, in order to enhance the desmoking performance, we further feed the valuable information from PS frame into models, where a masking strategy and a regularization term are presented to avoid trivial solutions.

Paper
Code

Histo-Genomic Knowledge Distillation For Cancer Prognosis From Histopathology Whole Slide Images

1 code implementation • 15 Mar 2024 • Zhikang Wang, Yumeng Zhang, Yingxue Xu, Seiya Imoto, Hao Chen, Jiangning Song

G-HANet is expected to be explored as a useful tool by the research community to address the current bottleneck of insufficient histo-genomic data pairing in the context of cancer prognosis and precision oncology.

Benchmarking Knowledge Distillation +1

Paper
Code

CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning

1 code implementation • 15 Mar 2024 • Yukun Li, Guansong Pang, Wei Suo, Chenchen Jing, Yuling Xi, Lingqiao Liu, Hao Chen, Guoqiang Liang, Peng Wang

Large pre-trained VLMs like CLIP have demonstrated superior zero-shot recognition ability, and a number of recent studies leverage this ability to mitigate catastrophic forgetting in CL, but they focus on closed-set CL in a single domain dataset.

Class Incremental Learning Incremental Learning +1

Paper
Code

XCoOp: Explainable Prompt Learning for Computer-Aided Diagnosis via Concept-guided Context Optimization

no code implementations • 14 Mar 2024 • Yequan Bie, Luyang Luo, Zhixuan Chen, Hao Chen

Utilizing potent representations of the large vision-language models (VLMs) to accomplish various downstream tasks has attracted increasing attention.

Explainable artificial intelligence Explainable Artificial Intelligence (XAI) +1

Paper
Add Code

Rethinking Autoencoders for Medical Anomaly Detection from A Theoretical Perspective

no code implementations • 14 Mar 2024 • Yu Cai, Hao Chen, Kwang-Ting Cheng

To the best of our knowledge, this is the first effort to theoretically clarify the principles and design philosophy of AE for anomaly detection.

Anomaly Detection Philosophy

Paper
Add Code

Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification

no code implementations • 13 Mar 2024 • Shuhan LI, Yi Lin, Hao Chen, Kwang-Ting Cheng

In this paper, we introduce an Iterative Online Image Synthesis (IOIS) framework to address the class imbalance problem in medical image classification.

Image Classification Image Generation +3

Paper
Add Code

MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology

1 code implementation • 11 Mar 2024 • Shu Yang, Yihui Wang, Hao Chen

Multiple Instance Learning (MIL) has emerged as a dominant paradigm to extract discriminative feature representations within Whole Slide Images (WSIs) in computational pathology.

Multiple Instance Learning whole slide images

Paper
Code

Learning with Noisy Foundation Models

no code implementations • 11 Mar 2024 • Hao Chen, Jindong Wang, Zihan Wang, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj

Foundation models are usually pre-trained on large-scale datasets and then adapted to downstream tasks through tuning.

Paper
Add Code

Diffusion Models Trained with Large Data Are Transferable Visual Models

no code implementations • 10 Mar 2024 • Guangkai Xu, Yongtao Ge, MingYu Liu, Chengxiang Fan, Kangyang Xie, Zhiyue Zhao, Hao Chen, Chunhua Shen

We show that, simply initializing image understanding models using a pre-trained UNet (or transformer) of diffusion models, it is possible to achieve remarkable transferable performance on fundamental vision perception tasks using a moderate amount of target data (even synthetic data only), including monocular depth, surface normal, image segmentation, matting, human pose estimation, among virtually many others.

Image Matting Image Segmentation +2

Paper
Add Code

GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing

2 code implementations • 9 Mar 2024 • Hao Lu, Xuesong Niu, Jiyao Wang, Yin Wang, Qingyong Hu, Jiaqi Tang, Yuting Zhang, Kaishen Yuan, Bin Huang, Zitong Yu, Dengbo He, Shuiguang Deng, Hao Chen, Yingcong Chen, Shiguang Shan

In conclusion, this paper provides valuable insights into the potential applications and challenges of MLLMs in human-centric computing.

Emotion Recognition Facial Action Unit Detection +4

Paper
Code

HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context Interaction

1 code implementation • 8 Mar 2024 • Zhengrui Guo, Jiabo Ma, Yingxue Xu, Yihui Wang, Liansheng Wang, Hao Chen

Histopathology serves as the gold standard in cancer diagnosis, with clinical reports being vital in interpreting and understanding this process, guiding cancer treatment and patient care.

Ranked #1 on Medical Report Generation on HistGen WSI-Report Dataset

Medical Report Generation Multiple Instance Learning +3

Paper
Code

$\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations

2 code implementations • 7 Mar 2024 • Xiang Li, Kai Qiu, Jinglu Wang, Xiaohao Xu, Rita Singh, Kashu Yamazak, Hao Chen, Xiaonan Huang, Bhiksha Raj

Referring perception, which aims at grounding visual objects with multimodal referring guidance, is essential for bridging the gap between humans, who provide instructions, and the environment where intelligent systems perceive.

Benchmarking

Paper
Code

MolNexTR: A Generalized Deep Learning Model for Molecular Image Recognition

1 code implementation • 6 Mar 2024 • Yufan Chen, Ching Ting Leung, Yong Huang, Jianwei Sun, Hao Chen, Hanyu Gao

In addition, it employs a series of novel augmentation algorithms to significantly enhance the robustness and performance of the model.

Data Augmentation

Paper
Code

PI-AstroDeconv: A Physics-Informed Unsupervised Learning Method for Astronomical Image Deconvolution

no code implementations • 4 Mar 2024 • Shulei Ni, Yisheng Qiu, YunChun Chen, Zihao Song, Hao Chen, Xuejian Jiang, Huaxi Chen

In the imaging process of an astronomical telescope, the deconvolution of its beam or Point Spread Function (PSF) is a crucial task.

Image Deconvolution

Paper
Add Code

Boosting Box-supervised Instance Segmentation with Pseudo Depth

no code implementations • 2 Mar 2024 • Xinyi Yu, Ling Yan, PengTao Jiang, Hao Chen, Bo Li, Lin Yuanbo Wu, Linlin Ou

This innovative approach empowers the network to simultaneously predict masks and depth, enhancing its ability to capture nuanced depth-related information during the instance segmentation process.

Box-supervised Instance Segmentation Depth Estimation +4

Paper
Add Code

Data-efficient Event Camera Pre-training via Disentangled Masked Modeling

no code implementations • 1 Mar 2024 • Zhenpeng Huang, Chao Li, Hao Chen, Yongjian Deng, Yifeng Geng, LiMin Wang

Our pre-training overcomes the limitations of previous methods, which either sacrifice temporal information by converting event sequences into 2D images for utilizing pre-trained image models or directly employ paired image data for knowledge distillation to enhance the learning of event streams.

Knowledge Distillation Self-Supervised Learning

Paper
Add Code

Anatomy-guided fiber trajectory distribution estimation for cranial nerves tractography

no code implementations • 29 Feb 2024 • Lei Xie, Qingrun Zeng, Huajun Zhou, Guoqiang Xie, Mingchu Li, Jiahao Huang, Jianan Cui, Hao Chen, Yuanjing Feng

Diffusion MRI tractography is an important tool for identifying and analyzing the intracranial course of cranial nerves (CNs).

Anatomy

Paper
Add Code

A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation

1 code implementation • 29 Feb 2024 • Hanxi Li, Zhengxun Zhang, Hao Chen, Lin Wu, Bo Li, Deyin Liu, Mingwen Wang

Effectively addressing the challenge of industrial Anomaly Detection (AD) necessitates an ample supply of defective samples, a constraint often hindered by their scarcity in industrial contexts.

Anomaly Detection Image Generation

Paper
Code

VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis

1 code implementation • 27 Feb 2024 • Linshan Wu, Jiaxin Zhuang, Hao Chen

Through this pretext task, VoCo implicitly encodes the contextual position priors into model representations without the guidance of annotations, enabling us to effectively improve the performance of downstream tasks that require high-level semantics.

Contrastive Learning Position +1

Paper
Code

Structure Guided Large Language Model for SQL Generation

no code implementations • 19 Feb 2024 • Qinggang Zhang, Junnan Dong, Hao Chen, Wentao Li, Feiran Huang, Xiao Huang

Existing models typically input queries and database schemas into the LLM and rely on the LLM to perform semantic-structure matching and generate structured SQL.

Language Modelling Large Language Model

Paper
Add Code

Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM

no code implementations • 18 Feb 2024 • Zijin Hong, Zheng Yuan, Hao Chen, Qinggang Zhang, Feiran Huang, Xiao Huang

Generating accurate SQL for user queries (text-to-SQL) is a long-standing problem since the generation of the SQL requires comprehending the query and database and retrieving the accurate data from the database accordingly.

Text-To-SQL

Paper
Add Code

Large Language Model Interaction Simulator for Cold-Start Item Recommendation

no code implementations • 14 Feb 2024 • Feiran Huang, Zhenghang Yang, Junyi Jiang, Yuanchen Bei, Yijie Zhang, Hao Chen

To address this challenge, we propose an LLM Interaction Simulator (LLM-InS) to model users' behavior patterns based on the content aspect.

Collaborative Filtering Language Modelling +2

Paper
Add Code

Deep Rib Fracture Instance Segmentation and Classification from CT on the RibFrac Challenge

no code implementations • 14 Feb 2024 • Jiancheng Yang, Rui Shi, Liang Jin, Xiaoyang Huang, Kaiming Kuang, Donglai Wei, Shixuan Gu, Jianying Liu, PengFei Liu, Zhizhong Chai, Yongjie Xiao, Hao Chen, Liming Xu, Bang Du, Xiangyi Yan, Hao Tang, Adam Alessio, Gregory Holste, Jiapeng Zhang, Xiaoming Wang, Jianye He, Lixuan Che, Hanspeter Pfister, Ming Li, Bingbing Ni

The resulting FracNet+ demonstrates competitive performance in rib fracture detection, which lays a foundation for further research and development in AI-assisted rib fracture detection and diagnosis.

Instance Segmentation Semantic Segmentation

Paper
Add Code

Multi-Behavior Collaborative Filtering with Partial Order Graph Convolutional Networks

no code implementations • 12 Feb 2024 • Yijie Zhang, Yuanchen Bei, Hao Chen, Qijie Shen, Zheng Yuan, Huan Gong, Senzhang Wang, Feiran Huang, Xiao Huang

POG defines the partial order relation of multiple behaviors and models behavior combinations as weighted edges to merge separate behavior graphs into a joint POG.

Collaborative Filtering Recommendation Systems

Paper
Add Code

Physics-Informed Neural Networks with Hard Linear Equality Constraints

1 code implementation • 11 Feb 2024 • Hao Chen, Gonzalo E. Constante Flores, Can Li

The incorporation of physics into neural networks can improve generalization and data efficiency.

Paper
Code

UniTSyn: A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing

no code implementations • 4 Feb 2024 • Yifeng He, Jiabo Huang, Yuyang Rong, Yiwen Guo, Ethan Wang, Hao Chen

The remarkable capability of large language models (LLMs) in generating high-quality code has drawn increasing attention in the software testing community.

Paper
Add Code

ExtremeCast: Boosting Extreme Value Prediction for Global Weather Forecast

1 code implementation • 2 Feb 2024 • Wanghan Xu, Kang Chen, Tao Han, Hao Chen, Wanli Ouyang, Lei Bai

Data-driven weather forecast based on machine learning (ML) has experienced rapid development and demonstrated superior performance in the global medium-range forecast compared to traditional physics-based dynamical models.

Value prediction

Paper
Code

On Catastrophic Inheritance of Large Foundation Models

no code implementations • 2 Feb 2024 • Hao Chen, Bhiksha Raj, Xing Xie, Jindong Wang

Large foundation models (LFMs) are claiming incredible performances.

Paper
Add Code

A General Framework for Learning from Weak Supervision

1 code implementation • 2 Feb 2024 • Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj

Weakly supervised learning generally faces challenges in applicability to various scenarios with diverse weak supervision and in scalability due to the complexity of existing algorithms, thereby hindering the practical deployment.

Weakly-supervised Learning

Paper
Code

Uncertainty-Aware Explainable Recommendation with Large Language Models

no code implementations • 31 Jan 2024 • Yicui Peng, Hao Chen, ChingSheng Lin, Guo Huang, Jinrong Hu, Hui Guo, Bin Kong, Shu Hu, Xi Wu, Xin Wang

Providing explanations within the recommendation system would boost user satisfaction and foster trust, especially by elaborating on the reasons for selecting recommended items tailored to the user.

Explainable Recommendation Multi-Task Learning

Paper
Add Code

Time Series Supplier Allocation via Deep Black-Litterman Model

1 code implementation • 30 Jan 2024 • Jiayuan Luo, Wentao Zhang, Yuchen Fang, Xiaowei Gao, Dingyi Zhuang, Hao Chen, Xinke Jiang

Time Series Supplier Allocation (TSSA) poses a complex NP-hard challenge, aimed at refining future order dispatching strategies to satisfy order demands with maximum supply efficiency fully.

Navigate Time Series

Paper
Code

Deep Joint Source-Channel Coding for Efficient and Reliable Cross-Technology Communication

no code implementations • 26 Jan 2024 • Shumin Yao, Xiaodong Xu, Hao Chen, Yaping Sun, Qinglin Zhao

Cross-technology communication (CTC) is a promising technique that enables direct communications among incompatible wireless technologies without needing hardware modification.

Paper
Add Code

Macro Graph Neural Networks for Online Billion-Scale Recommender Systems

1 code implementation • 26 Jan 2024 • Hao Chen, Yuanchen Bei, Qijie Shen, Yue Xu, Sheng Zhou, Wenbing Huang, Feiran Huang, Senzhang Wang, Xiao Huang

Predicting Click-Through Rate (CTR) in billion-scale recommender systems poses a long-standing challenge for Graph Neural Networks (GNNs) due to the overwhelming computational complexity involved in aggregating billions of neighbors.

Recommendation Systems

Paper
Code

Observation-Guided Meteorological Field Downscaling at Station Scale: A Benchmark and a New Method

no code implementations • 22 Jan 2024 • Zili Liu, Hao Chen, Lei Bai, Wenyuan Li, Keyan Chen, Zhengyi Wang, Wanli Ouyang, Zhengxia Zou, Zhenwei Shi

In this paper, we extend meteorological downscaling to arbitrary scattered station scales, establish a brand new benchmark and dataset, and retrieve meteorological states at any given station location from a coarse-resolution meteorological field.

Super-Resolution Weather Forecasting

Paper
Add Code

SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning

no code implementations • 22 Jan 2024 • Hao Chen, Jiaze Wang, Ziyu Guo, Jinpeng Li, Donghao Zhou, Bian Wu, Chenyong Guan, Guangyong Chen, Pheng-Ann Heng

Sign language recognition (SLR) plays a vital role in facilitating communication for the hearing-impaired community.

Contrastive Learning Language Modelling +3

Paper
Add Code

Codebook-enabled Generative End-to-end Semantic Communication Powered by Transformer

no code implementations • 22 Jan 2024 • PeiGen Ye, Yaping Sun, Shumin Yao, Hao Chen, Xiaodong Xu, Shuguang Cui

Codebook-based generative semantic communication attracts increasing attention, since only indices are required to be transmitted when the codebook is shared between transmitter and receiver.

Image Generation

Paper
Add Code

Medical Image Debiasing by Learning Adaptive Agreement from a Biased Council

no code implementations • 22 Jan 2024 • Luyang Luo, Xin Huang, Minghao Wang, Zhuoyue Wan, Hao Chen

Specifically, the debiasing model is required to learn adaptive agreement with the biased council by agreeing on the correctly predicted samples and disagreeing on the wrongly predicted samples by the biased council.

Attribute Image Classification +1

Paper
Add Code

MDGNN: Multi-Relational Dynamic Graph Neural Network for Comprehensive and Dynamic Stock Investment Prediction

no code implementations • 19 Jan 2024 • Hao Qian, Hongting Zhou, Qian Zhao, Hao Chen, Hongxiang Yao, Jingwei Wang, Ziqi Liu, Fei Yu, Zhiqiang Zhang, Jun Zhou

The stock market is a crucial component of the financial system, but predicting the movement of stock prices is challenging due to the dynamic and intricate relations arising from various aspects such as economic indicators, financial reports, global news, and investor sentiment.

Paper
Add Code

Distributed Task-Oriented Communication Networks with Multimodal Semantic Relay and Edge Intelligence

no code implementations • 18 Jan 2024 • Jie Guo, Hao Chen, Bin Song, Yuhao Chi, Chau Yuen, Fei Richard Yu, Geoffrey Ye Li, Dusit Niyato

In this article, we present a novel framework, named distributed task-oriented communication networks (DTCN), based on recent advances in multimodal semantic transmission and edge intelligence.

Paper
Add Code

Learning to detect cloud and snow in remote sensing images from noisy labels

no code implementations • 17 Jan 2024 • Zili Liu, Hao Chen, Wenyuan Li, Keyan Chen, Zipeng Qi, Chenyang Liu, Zhengxia Zou, Zhenwei Shi

This paper is the first to consider the impact of label noise on the detection of clouds and snow in remote sensing images.

Semantic Segmentation

Paper
Add Code

MICA: Towards Explainable Skin Lesion Diagnosis via Multi-Level Image-Concept Alignment

1 code implementation • 16 Jan 2024 • Yequan Bie, Luyang Luo, Hao Chen

Black-box deep learning approaches have showcased significant potential in the realm of medical image analysis.

Concept Alignment Explainable artificial intelligence +1

Paper
Code

TAROT: A Hierarchical Framework with Multitask Co-Pretraining on Semi-Structured Data towards Effective Person-Job Fit

no code implementations • 15 Jan 2024 • Yihan Cao, Xu Chen, Lun Du, Hao Chen, Qiang Fu, Shi Han, Yushu Du, Yanbin Kang, Guangming Lu, Zi Li

Person-job fit is an essential part of online recruitment platforms in serving various downstream applications like Job Search and Candidate Recommendation.

Paper
Add Code

BoNuS: Boundary Mining for Nuclei Segmentation with Partial Point Labels

1 code implementation • 15 Jan 2024 • Yi Lin, Zeyu Wang, Dong Zhang, Kwang-Ting Cheng, Hao Chen

To alleviate this problem, in this paper, we propose a weakly-supervised nuclei segmentation method that only requires partial point labels of nuclei.

Multiple Instance Learning Segmentation

Paper
Code

RHOBIN Challenge: Reconstruction of Human Object Interaction

no code implementations • 7 Jan 2024 • Xianghui Xie, Xi Wang, Nikos Athanasiou, Bharat Lal Bhatnagar, Chun-Hao P. Huang, Kaichun Mo, Hao Chen, Xia Jia, Zerui Zhang, Liangxian Cui, Xiao Lin, Bingqiao Qian, Jie Xiao, Wenfei Yang, Hyeongjin Nam, Daniel Sungho Jung, Kihoon Kim, Kyoung Mu Lee, Otmar Hilliges, Gerard Pons-Moll

Modeling the interaction between humans and objects has been an emerging research direction in recent years.

3D Reconstruction Human-Object Interaction Detection +3

Paper
Add Code

DeepPhysiNet: Bridging Deep Learning and Atmospheric Physics for Accurate and Continuous Weather Modeling

1 code implementation • 4 Jan 2024 • Wenyuan Li, Zili Liu, Keyan Chen, Hao Chen, Shunlin Liang, Zhengxia Zou, Zhenwei Shi

Next, we construct hyper-networks based on deep learning methods to directly learn weather patterns from a large amount of meteorological data.

Weather Forecasting

Paper
Code

Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction

1 code implementation • 3 Jan 2024 • Yilan Zhang, Yingxue Xu, Jianqi Chen, Fengying Xie, Hao Chen

Despite advantages of multimodal learning for cancer survival prediction, massive redundancy in multimodal data prevents it from extracting discriminative and compact information: (1) An extensive amount of intra-modal task-unrelated information blurs discriminability, especially for gigapixel whole slide images (WSIs) with many patches in pathology and thousands of pathways in genomic data, leading to an ``intra-modal redundancy" issue.

Disentanglement Survival Prediction +1

Paper
Code

MOC-RVQ: Multilevel Codebook-assisted Digital Generative Semantic Communication

no code implementations • 2 Jan 2024 • Yingbin Zhou, Yaping Sun, GuanYing Chen, Xiaodong Xu, Hao Chen, Binhong Huang, Shuguang Cui, Ping Zhang

Vector quantization-based image semantic communication systems have successfully boosted transmission efficiency, but face a challenge with conflicting requirements between codebook design and digital constellation modulation.

Quantization

Paper
Add Code

Distance Guided Generative Adversarial Network for Explainable Binary Classifications

1 code implementation • 29 Dec 2023 • Xiangyu Xiong, Yue Sun, Xiaohong Liu, Wei Ke, Chan-Tong Lam, Jiangang Chen, Mingfeng Jiang, Mingwei Wang, Hui Xie, Tong Tong, Qinquan Gao, Hao Chen, Tao Tan

Experimental results show that DisGAN consistently outperforms the GAN-based augmentation methods with explainable binary classification.

Binary Classification Classification +3

Paper
Code

Multimodal Sentiment Analysis with Missing Modality: A Knowledge-Transfer Approach

no code implementations • 28 Dec 2023 • Weide Liu, Huijing Zhan, Hao Chen, Fengmao Lv

Multimodal sentiment analysis aims to identify the emotions expressed by individuals through visual, language, and acoustic cues.

Multimodal Sentiment Analysis Transfer Learning

Paper
Add Code

Mobility and Cost Aware Inference Accelerating Algorithm for Edge Intelligence

no code implementations • 27 Dec 2023 • Xin Yuan, Ning li, Kang Wei, Wenchao Xu, Quan Chen, Hao Chen, Song Guo

The model segmentation without user mobility has been investigated deeply by previous works.

Segmentation

Paper
Add Code

PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning

2 code implementations • 26 Dec 2023 • Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, Jiangjin Yin

Designing better deep networks and better reinforcement learning (RL) algorithms are both important for deep RL.

Decision Making Offline RL +2

Paper
Code

Mutual Information as Intrinsic Reward of Reinforcement Learning Agents for On-demand Ride Pooling

no code implementations • 23 Dec 2023 • Xianjie Zhang, Jiahao Sun, Chen Gong, Kai Wang, Yifei Cao, Hao Chen, Yu Liu

The emergence of on-demand ride pooling services allows each vehicle to serve multiple passengers at a time, thus increasing drivers' income and enabling passengers to travel at lower prices than taxi/car on-demand services (only one passenger can be assigned to a car at a time like UberX and Lyft).

Reinforcement Learning (RL)

Paper
Add Code

Time Travelling Pixels: Bitemporal Features Integration with Foundation Model for Remote Sensing Image Change Detection

2 code implementations • 23 Dec 2023 • Keyan Chen, Chengyang Liu, Wenyuan Li, Zili Liu, Hao Chen, Haotian Zhang, Zhengxia Zou, Zhenwei Shi

Change detection, a prominent research area in remote sensing, is pivotal in observing and analyzing surface transformations.

Ranked #6 on Change Detection on LEVIR-CD

Change Detection General Knowledge +1

379

Paper
Code

Professional Network Matters: Connections Empower Person-Job Fit

no code implementations • 19 Dec 2023 • Hao Chen, Lun Du, Yuxuan Lu, Qiang Fu, Xu Chen, Shi Han, Yanbin Kang, Guangming Lu, Zi Li

Online recruitment platforms typically employ Person-Job Fit models in the core service that automatically match suitable job seekers with appropriate job positions.

Paper
Add Code

Towards an end-to-end artificial intelligence driven global weather forecasting system

no code implementations • 18 Dec 2023 • Kun Chen, Lei Bai, Fenghua Ling, Peng Ye, Tao Chen, Jing-Jia Luo, Hao Chen, Yi Xiao, Kang Chen, Tao Han, Wanli Ouyang

Initial states are typically generated by traditional data assimilation components, which are computational expensive and time-consuming.

Weather Forecasting

Paper
Add Code

Model-Free Change Point Detection for Mixing Processes

no code implementations • 14 Dec 2023 • Hao Chen, Abhishek Gupta, Yin Sun, Ness Shroff

In particular, we provide performance guarantees for the MMD-CUSUM test under $\alpha$, $\beta$, and $\phi$-mixing processes, which significantly expands its utility beyond the i. i. d.

Change Point Detection

Paper
Add Code

PromptBench: A Unified Library for Evaluation of Large Language Models

1 code implementation • 13 Dec 2023 • Kaijie Zhu, Qinlin Zhao, Hao Chen, Jindong Wang, Xing Xie

The evaluation of large language models (LLMs) is crucial to assess their performance and mitigate potential security risks.

Prompt Engineering

1,974

Paper
Code

KnowGPT: Knowledge Injection for Large Language Models

no code implementations • 11 Dec 2023 • Qinggang Zhang, Junnan Dong, Hao Chen, Daochen Zha, Zailiang Yu, Xiao Huang

Generative Large Language Models (LLMs), such as ChatGPT, offer interactive APIs that can answer common questions at a human-expert level.

Knowledge Graphs Question Answering +1

Paper
Add Code

Reinforcement Neighborhood Selection for Unsupervised Graph Anomaly Detection

no code implementations • 9 Dec 2023 • Yuanchen Bei, Sheng Zhou, Qiaoyu Tan, Hao Xu, Hao Chen, Zhao Li, Jiajun Bu

To address these issues, we utilize the advantages of reinforcement learning in adaptively learning in complex environments and propose a novel method that incorporates Reinforcement neighborhood selection for unsupervised graph ANomaly Detection (RAND).

Graph Anomaly Detection Representation Learning

Paper
Add Code

Shapley Values-enabled Progressive Pseudo Bag Augmentation for Whole Slide Image Classification

no code implementations • 9 Dec 2023 • Renao Yan, Qiehe Sun, Cheng Jin, Yiqing Liu, Yonghong He, Tian Guan, Hao Chen

While most of the conventional MIL methods use attention scores to estimate instance importance scores (IIS) which contribute to the prediction of the slide labels, these often lead to skewed attention distributions and inaccuracies in identifying crucial instances.

Image Classification Multiple Instance Learning

Paper
Add Code

GenDeF: Learning Generative Deformation Field for Video Generation

no code implementations • 7 Dec 2023 • Wen Wang, Kecheng Zheng, Qiuyu Wang, Hao Chen, Zifan Shi, Ceyuan Yang, Yujun Shen, Chunhua Shen

We offer a new perspective on approaching the task of video generation.

Disentanglement Video Editing +3

Paper
Add Code

Rethinking Multiple Instance Learning for Whole Slide Image Classification: A Bag-Level Classifier is a Good Instance-Level Teacher

1 code implementation • 2 Dec 2023 • Hongyi Wang, Luyang Luo, Fang Wang, Ruofeng Tong, Yen-Wei Chen, Hongjie Hu, Lanfen Lin, Hao Chen

Based on this idea, we design Iteratively Coupled Multiple Instance Learning (ICMIL) to couple the embedder and the bag classifier at a low cost.

Image Classification Multiple Instance Learning

Paper
Code

Learning for Semantic Knowledge Base-Guided Online Feature Transmission in Dynamic Channels

no code implementations • 30 Nov 2023 • Xiangyu Gao, Yaping Sun, Dongyu Wei, Xiaodong Xu, Hao Chen, Hao Yin, Shuguang Cui

In this context, we address the problem of efficient remote object recognition by optimizing feature transmission between mobile devices and edge servers.

Autonomous Vehicles Decision Making +2

Paper
Add Code

HumanRecon: Neural Reconstruction of Dynamic Human Using Geometric Cues and Physical Priors

1 code implementation • 26 Nov 2023 • Junhui Yin, Wei Yin, Hao Chen, Xuqian Ren, Zhanyu Ma, Jun Guo, Yifan Liu

These priors ensure the color rendered along rays to be robust to view direction and reduce the inherent ambiguities of density estimated along rays.

Novel View Synthesis

Paper
Code

Hybrid Precoding and Combining for mmWave Full-Duplex Joint Radar and Communication Systems under Self-Interference

no code implementations • 25 Nov 2023 • Murat Bayraktar, Nuria González-Prelcic, Hao Chen

Specifically, we introduce a generalized eigenvalue-based precoder design that considers the downlink user rate, the radar gain, and the SI suppression.

Paper
Add Code

A Parameterized Generative Adversarial Network Using Cyclic Projection for Explainable Medical Image Classification

no code implementations • 24 Nov 2023 • Xiangyu Xiong, Yue Sun, Xiaohong Liu, Chan-Tong Lam, Tong Tong, Hao Chen, Qinquan Gao, Wei Ke, Tao Tan

Although current data augmentation methods are successful to alleviate the data insufficiency, conventional augmentation are primarily intra-domain while advanced generative adversarial networks (GANs) generate images remaining uncertain, particularly in small-scale datasets.

Data Augmentation Generative Adversarial Network +2

Paper
Add Code

Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey

1 code implementation • 21 Nov 2023 • Yunpeng Huang, Jingwei Xu, Junyu Lai, Zixu Jiang, Taolue Chen, Zenan Li, Yuan YAO, Xiaoxing Ma, Lijuan Yang, Hao Chen, Shupeng Li, Penghao Zhao

Transformer-based Large Language Models (LLMs) have been applied in diverse areas such as knowledge bases, human interfaces, and dynamic agents, and marking a stride towards achieving Artificial General Intelligence (AGI).

Navigate

193

Paper
Code

AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort

no code implementations • 19 Nov 2023 • Wen Wang, Canyu Zhao, Hao Chen, Zhekai Chen, Kecheng Zheng, Chunhua Shen

We empirically find that sparse control conditions, such as bounding boxes, are suitable for layout planning, while dense control conditions, e. g., sketches and keypoints, are suitable for generating high-quality image content.

Image Generation Story Visualization

Paper
Add Code

Knowledge Graph Construction in Power Distribution Networks

no code implementations • 15 Nov 2023 • Xiang Li, Che Wang, Bing Li, Hao Chen, Sizhe Li

In this paper, we propose a method for knowledge graph construction in power distribution networks.

Entity Linking graph construction +1

Paper
Add Code

Alleviating Behavior Data Imbalance for Multi-Behavior Graph Collaborative Filtering

no code implementations • 12 Nov 2023 • Yijie Zhang, Yuanchen Bei, Shiqi Yang, Hao Chen, Zhiqing Li, Lijia Chen, Feiran Huang

To this end, we propose IMGCF, a simple but effective model to alleviate behavior data imbalance for multi-behavior graph collaborative filtering.

Collaborative Filtering Multi-Task Learning +1

Paper
Add Code

Rethinking and Improving Multi-task Learning for End-to-end Speech Translation

1 code implementation • 7 Nov 2023 • Yuhao Zhang, Chen Xu, Bei Li, Hao Chen, Tong Xiao, Chunliang Zhang, Jingbo Zhu

Significant improvements in end-to-end speech translation (ST) have been achieved through the application of multi-task learning.

Multi-Task Learning

Paper
Code

Channel Estimation and Training Design for Active RIS Aided Wireless Communications

no code implementations • 6 Nov 2023 • Hao Chen, Nanxi Li, Ruizhe Long, Ying-Chang Liang

To address this issue, we further investigate this ARIS-specific channel estimation problem and propose a least-square (LS) based channel estimator, whose performance can be further improved with the design on ARIS reflection patterns at the channel training phase.

Paper
Add Code

Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning

1 code implementation • NeurIPS 2023 • Shenzhi Wang, Qisen Yang, Jiawei Gao, Matthieu Gaetan Lin, Hao Chen, Liwei Wu, Ning Jia, Shiji Song, Gao Huang

Existing solutions tackle this problem by imposing a policy constraint on the policy improvement objective in both offline and online learning.

D4RL Reinforcement Learning (RL)

Paper
Code

CompeteAI: Understanding the Competition Behaviors in Large Language Model-based Agents

no code implementations • 26 Oct 2023 • Qinlin Zhao, Jindong Wang, Yixuan Zhang, Yiqiao Jin, Kaijie Zhu, Hao Chen, Xing Xie

Large language models (LLMs) have been widely used as agents to complete different tasks, such as personal assistance or event planning.

Language Modelling Large Language Model

Paper
Add Code

Customising General Large Language Models for Specialised Emotion Recognition Tasks

no code implementations • 22 Oct 2023 • Liyizhe Peng, Zixing Zhang, Tao Pang, Jing Han, Huan Zhao, Hao Chen, Björn W. Schuller

This indicates the strong transferability and feasibility of LLMs in the field of emotion recognition.

Emotion Recognition Language Modelling +1

Paper
Add Code

De novo protein design using geometric vector field networks

no code implementations • 18 Oct 2023 • Weian Mao, Muzhi Zhu, Zheng Sun, Shuaike Shen, Lin Yuanbo Wu, Hao Chen, Chunhua Shen

Most prior encoders rely on atom-wise features, such as angles and distances between atoms, which are not available in this context.

Protein Design

Paper
Add Code

Object-aware Inversion and Reassembly for Image Editing

no code implementations • 18 Oct 2023 • Zhen Yang, Ganggui Ding, Wen Wang, Hao Chen, Bohan Zhuang, Chunhua Shen

Subsequently, we propose an additional reassembly step to seamlessly integrate the respective editing results and the non-editing region to obtain the final edited image.

Benchmarking Denoising +1

Paper
Add Code

RGM: A Robust Generalizable Matching Model

1 code implementation • 18 Oct 2023 • Songyan Zhang, Xinyu Sun, Hao Chen, Bo Li, Chunhua Shen

Finding corresponding pixels within a pair of images is a fundamental computer vision task with various applications.

Optical Flow Estimation

Paper
Code

Towards Intelligent Network Management: Leveraging AI for Network Service Detection

no code implementations • 14 Oct 2023 • Khuong N. Nguyen, Abhishek Sehgal, Yuming Zhu, Junsu Choi, Guanbo Chen, Hao Chen, Boon Loong Ng, Charlie Zhang

As the complexity and scale of modern computer networks continue to increase, there has emerged an urgent need for precise traffic analysis, which plays a pivotal role in cutting-edge wireless connectivity technologies.

Management Traffic Classification

Paper
Add Code

The Program Testing Ability of Large Language Models for Code

no code implementations • 9 Oct 2023 • Weimin Xiong, Yiwen Guo, Hao Chen

In this paper, we explore the ability of LLMs for testing programs/code.

Program Synthesis

Paper
Add Code

ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data

no code implementations • 8 Oct 2023 • Tianyang Zhong, Wei Zhao, Yutong Zhang, Yi Pan, Peixin Dong, Zuowei Jiang, Xiaoyan Kui, Youlan Shang, Li Yang, Yaonai Wei, Longtao Yang, Hao Chen, Huan Zhao, Yuxiao Liu, Ning Zhu, Yiwei Li, Yisong Wang, Jiaqi Yao, Jiaqi Wang, Ying Zeng, Lei He, Chao Zheng, Zhixue Zhang, Ming Li, Zhengliang Liu, Haixing Dai, Zihao Wu, Lu Zhang, Shu Zhang, Xiaoyan Cai, Xintao Hu, Shijie Zhao, Xi Jiang, Xin Zhang, Xiang Li, Dajiang Zhu, Lei Guo, Dinggang Shen, Junwei Han, Tianming Liu, Jun Liu, Tuo Zhang

Radiology report generation, as a key step in medical image analysis, is critical to the quantitative analysis of clinically informed decision-making levels.

Decision Making Language Modelling +1

Paper
Add Code

X-Transfer: A Transfer Learning-Based Framework for GAN-Generated Fake Image Detection

no code implementations • 7 Oct 2023 • Lei Zhang, Hao Chen, Shu Hu, Bin Zhu, Ching Sheng Lin, Xi Wu, Jinrong Hu, Xin Wang

Generative adversarial networks (GANs) have remarkably advanced in diverse domains, especially image generation and editing.

Fake Image Detection Image Generation +1

Paper
Add Code

Hoeffding's Inequality for Markov Chains under Generalized Concentrability Condition

no code implementations • 4 Oct 2023 • Hao Chen, Abhishek Gupta, Yin Sun, Ness Shroff

This paper studies Hoeffding's inequality for Markov chains under the generalized concentrability condition defined via integral probability metric (IPM).

Paper
Add Code

Towards Domain-Specific Features Disentanglement for Domain Generalization

no code implementations • 4 Oct 2023 • Hao Chen, Qi Zhang, Zenan Huang, Haobo Wang, Junbo Zhao

Distributional shift between domains poses great challenges to modern machine learning algorithms.

Disentanglement Domain Generalization

Paper
Add Code

Completing Visual Objects via Bridging Generation and Segmentation

no code implementations • 1 Oct 2023 • Xiang Li, Yinpeng Chen, Chung-Ching Lin, Hao Chen, Kai Hu, Rita Singh, Bhiksha Raj, Lijuan Wang, Zicheng Liu

This paper presents a novel approach to object completion, with the primary goal of reconstructing a complete object from its partially visible components.

Image Generation Object +1

Paper
Add Code

Unpaired Optical Coherence Tomography Angiography Image Super-Resolution via Frequency-Aware Inverse-Consistency GAN

no code implementations • 29 Sep 2023 • Weiwen Zhang, Dawei Yang, Haoxuan Che, An Ran Ran, Carol Y. Cheung, Hao Chen

For optical coherence tomography angiography (OCTA) images, a limited scanning rate leads to a trade-off between field-of-view (FOV) and imaging resolution.

Generative Adversarial Network Image Super-Resolution

Paper
Add Code

Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks

no code implementations • 29 Sep 2023 • Hao Chen, Jindong Wang, Ankit Shah, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj

This paper aims to understand the nature of noise in pre-training datasets and to mitigate its impact on downstream tasks.

Paper
Add Code

Cross-Modal Translation and Alignment for Survival Analysis

1 code implementation • ICCV 2023 • Fengtao Zhou, Hao Chen

With the rapid advances in high-throughput sequencing technologies, the focus of survival analysis has shifted from examining clinical indicators to incorporating genomic profiles with pathological images.

Survival Analysis Survival Prediction +1

Paper
Code

Enabling Quartile-based Estimated-Mean Gradient Aggregation As Baseline for Federated Image Classifications

no code implementations • 21 Sep 2023 • Yusen Wu, Jamie Deng, Hao Chen, Phuong Nguyen, Yelena Yesha

Federated Learning (FL) has revolutionized how we train deep neural networks by enabling decentralized collaboration while safeguarding sensitive data and improving model performance.

Federated Learning

Paper
Add Code

Soft Merging: A Flexible and Robust Soft Model Merging Approach for Enhanced Neural Network Performance

no code implementations • 21 Sep 2023 • Hao Chen, Yusen Wu, Phuong Nguyen, Chao Liu, Yelena Yesha

This merging process not only enhances the model performance by converging to a better local optimum, but also minimizes computational costs, offering an efficient and explicit learning process integrated with stochastic gradient descent.

Paper
Add Code

Multi-view Self-supervised Disentanglement for General Image Denoising

1 code implementation • ICCV 2023 • Hao Chen, Chenyuan Qu, Yu Zhang, Chen Chen, Jianbo Jiao

It is understandable as the model is designed to learn paired mapping (e. g. from a noisy image to its clean version).

Ranked #1 on Denoising on CBSD68 sigm75

Disentanglement Image Denoising +1

Paper
Code

Progressive Feature Adjustment for Semi-supervised Learning from Pretrained Models

no code implementations • 9 Sep 2023 • Hai-Ming Xu, Lingqiao Liu, Hao Chen, Ehsan Abbasnejad, Rafael Felix

As an effective way to alleviate the burden of data annotation, semi-supervised learning (SSL) provides an attractive solution due to its ability to leverage both labeled and unlabeled data to build a predictive model.

Paper
Add Code

Code Representation Pre-training with Complements from Program Executions

no code implementations • 4 Sep 2023 • Jiabo Huang, Jianyu Zhao, Yuyang Rong, Yiwen Guo, Yifeng He, Hao Chen

The test cases are obtained with the assistance of a customized fuzzer and are only required during pre-training.

Code Search Language Modelling

Paper
Add Code

DARC: Distribution-Aware Re-Coloring Model for Generalizable Nucleus Segmentation

1 code implementation • 1 Sep 2023 • Shengcong Chen, Changxing Ding, DaCheng Tao, Hao Chen

Second, we propose a new instance normalization method that is robust to the variation in foreground-background ratios.

Segmentation

Paper
Code

Using Large Language Models to Automate Category and Trend Analysis of Scientific Articles: An Application in Ophthalmology

no code implementations • 31 Aug 2023 • Hina Raja, Asim Munawar, Mohammad Delsoz, Mohammad Elahi, Yeganeh Madadi, Amr Hassan, Hashem Abu Serhan, Onur Inam, Luis Hermandez, Sang Tran, Wuqas Munir, Alaa Abd-Alrazaq, Hao Chen, SiamakYousefi

Moreover, the extendibility of the model to other scientific fields broadens its impact in facilitating research and trend analysis across diverse disciplines.

Zero-Shot Learning

Paper
Add Code

Unsupervised Domain Adaptation for Anatomical Landmark Detection

1 code implementation • 25 Aug 2023 • Haibo Jin, Haoxuan Che, Hao Chen

The framework leverages self-training and domain adversarial learning to address the domain gap during adaptation.

Unsupervised Domain Adaptation

Paper
Code

PromptMRG: Diagnosis-Driven Prompts for Medical Report Generation

1 code implementation • 24 Aug 2023 • Haibo Jin, Haoxuan Che, Yi Lin, Hao Chen

To address these challenges, we propose diagnosis-driven prompts for medical report generation (PromptMRG), a novel framework that aims to improve the diagnostic accuracy of MRG with the guidance of diagnosis-aware prompts.

Medical Report Generation

Paper
Code

Diagnosing Infeasible Optimization Problems Using Large Language Models

no code implementations • 23 Aug 2023 • Hao Chen, Gonzalo E. Constante-Flores, Can Li

Decision-making problems can be represented as mathematical optimization models, finding wide applications in fields such as economics, engineering and manufacturing, transportation, and health care.

Chatbot Decision Making +1

Paper
Add Code

In-Rack Test Tube Pose Estimation Using RGB-D Data

no code implementations • 21 Aug 2023 • Hao Chen, Weiwei Wan, Masaki Matsushita, Takeyuki Kotaka, Kensuke Harada

Accurate robotic manipulation of test tubes in biology and medical industries is becoming increasingly important to address workforce shortages and improve worker safety.

Point Cloud Registration Pose Estimation

Paper
Add Code

Karma: Adaptive Video Streaming via Causal Sequence Modeling

no code implementations • 20 Aug 2023 • Bowei Xu, Hao Chen, Zhan Ma

Unlike direct observation-to-action mapping, Karma recurrently maintains a multi-dimensional time series of observations, returns, and actions as input and employs causal sequence modeling via a decision transformer to determine the next action.

Paper
Add Code

Interpretation on Multi-modal Visual Fusion

no code implementations • 19 Aug 2023 • Hao Chen, Haoran Zhou, Yongjian Deng

In this paper, we present an analytical framework and a novel metric to shed light on the interpretation of the multimodal vision community.

Paper
Add Code

Better Zero-Shot Reasoning with Role-Play Prompting

2 code implementations • 15 Aug 2023 • Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li, Yong Qin, Ruiqi Sun, Xin Zhou, Enzhi Wang, Xiaohang Dong

This highlights its potential to augment the reasoning capabilities of LLMs.

Paper
Code

Target before Shooting: Accurate Anomaly Detection and Localization under One Millisecond via Cascade Patch Retrieval

1 code implementation • 13 Aug 2023 • Hanxi Li, Jianfei Hu, Bo Li, Hao Chen, Yongbin Zheng, Chunhua Shen

In this framework, the anomaly detection problem is solved via a cascade patch retrieval procedure that retrieves the nearest neighbors for each test image patch in a coarse-to-fine fashion.

Ranked #1 on Supervised Anomaly Detection on BTAD

Supervised Anomaly Detection

Paper
Code

When Monte-Carlo Dropout Meets Multi-Exit: Optimizing Bayesian Neural Networks on FPGA

1 code implementation • 13 Aug 2023 • Hongxiang Fan, Hao Chen, Liam Castelli, Zhiqiang Que, He Li, Kenneth Long, Wayne Luk

Bayesian Neural Networks (BayesNNs) have demonstrated their capability of providing calibrated prediction for safety-critical applications such as medical imaging and autonomous driving.

Autonomous Driving

Paper
Code

SegPrompt: Boosting Open-world Segmentation via Category-level Prompt Learning

1 code implementation • ICCV 2023 • Muzhi Zhu, Hengtao Li, Hao Chen, Chengxiang Fan, Weian Mao, Chenchen Jing, Yifan Liu, Chunhua Shen

In this work, we propose a novel training mechanism termed SegPrompt that uses category information to improve the model's class-agnostic segmentation ability for both known and unknown categories.

Open-World Instance Segmentation Segmentation +1

108

Paper
Code

DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models

1 code implementation • NeurIPS 2023 • Weijia Wu, Yuzhong Zhao, Hao Chen, YuChao Gu, Rui Zhao, Yefei He, Hong Zhou, Mike Zheng Shou, Chunhua Shen

To showcase the power of the proposed approach, we generate datasets with rich dense pixel-wise labels for a wide range of downstream tasks, including semantic segmentation, instance segmentation, and depth estimation.

Depth Estimation Domain Generalization +5

281

Paper
Code

FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models

no code implementations • ICCV 2023 • Guangkai Xu, Wei Yin, Hao Chen, Chunhua Shen, Kai Cheng, Feng Zhao

3D scene reconstruction is a long-standing vision task.

3D Scene Reconstruction Monocular Depth Estimation

Paper
Add Code

Phase Matching for Out-of-Distribution Generalization

no code implementations • 24 Jul 2023 • Chengming Hu, Yeqian Du, Rui Wang, Hao Chen

In this paper, we aim to clarify the relationships between Domain Generalization (DG) and the frequency components, and explore the spatial relationships of the phase spectrum.

Domain Generalization Out-of-Distribution Generalization +1

Paper
Add Code

CTVIS: Consistent Training for Online Video Instance Segmentation

1 code implementation • ICCV 2023 • Kaining Ying, Qing Zhong, Weian Mao, Zhenhua Wang, Hao Chen, Lin Yuanbo Wu, Yifan Liu, Chengxiang Fan, Yunzhi Zhuge, Chunhua Shen

The discrimination of instance embeddings plays a vital role in associating instances across time for online video instance segmentation (VIS).

Ranked #2 on Video Instance Segmentation on Youtube-VIS 2022 Validation (using extra training data)

Instance Segmentation Semantic Segmentation +1

Paper
Code

Collaborative Graph Neural Networks for Attributed Network Embedding

1 code implementation • 22 Jul 2023 • Qiaoyu Tan, Xin Zhang, Xiao Huang, Hao Chen, Jundong Li, Xia Hu

Graph neural networks (GNNs) have shown prominent performance on attributed network embedding.

Attribute Network Embedding

Paper
Code

Improving Transferability of Adversarial Examples via Bayesian Attacks

no code implementations • 21 Jul 2023 • Qizhang Li, Yiwen Guo, Xiaochen Yang, WangMeng Zuo, Hao Chen

Our ICLR work advocated for enhancing transferability in adversarial examples by incorporating a Bayesian formulation into model parameters, which effectively emulates the ensemble of infinitely many deep neural networks, while, in this paper, we introduce a novel extension by incorporating the Bayesian formulation into the model input as well, enabling the joint diversification of both the model input and model parameters.

Paper
Add Code

Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image

1 code implementation • ICCV 2023 • Wei Yin, Chi Zhang, Hao Chen, Zhipeng Cai, Gang Yu, Kaixuan Wang, Xiaozhi Chen, Chunhua Shen

State-of-the-art (SOTA) monocular metric depth estimation methods can only handle a single camera model and are unable to perform mixed-data training due to the metric ambiguity.

Ranked #19 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)

Image Reconstruction Monocular Depth Estimation +1

648

Paper
Code

Image Captions are Natural Prompts for Text-to-Image Models

1 code implementation • 17 Jul 2023 • Shiye Lei, Hao Chen, Sen Zhang, Bo Zhao, DaCheng Tao

With the rapid development of Artificial Intelligence Generated Content (AIGC), it has become common practice in many learning tasks to train or fine-tune large models on synthetic data due to the data-scarcity and privacy leakage problems.

Image Captioning Image Generation

Paper
Code

Dense Affinity Matching for Few-Shot Segmentation

no code implementations • 17 Jul 2023 • Hao Chen, Yonghan Dong, Zheming Lu, Yunlong Yu, Yingming Li, Jungong Han, Zhongfei Zhang

Few-Shot Segmentation (FSS) aims to segment the novel class images with a few annotated samples.

Few-Shot Semantic Segmentation

Paper
Add Code

Towards Generalizable Diabetic Retinopathy Grading in Unseen Domains

1 code implementation • 10 Jul 2023 • Haoxuan Che, YuHan Cheng, Haibo Jin, Hao Chen

Diabetic Retinopathy (DR) is a common complication of diabetes and a leading cause of blindness worldwide.

Diabetic Retinopathy Grading Domain Generalization

Paper
Code

A Survey on Evaluation of Large Language Models

1 code implementation • 6 Jul 2023 • Yupeng Chang, Xu Wang, Jindong Wang, Yuan Wu, Linyi Yang, Kaijie Zhu, Hao Chen, Xiaoyuan Yi, Cunxiang Wang, Yidong Wang, Wei Ye, Yue Zhang, Yi Chang, Philip S. Yu, Qiang Yang, Xing Xie

Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing to their unprecedented performance in various applications.

Ethics

1,225

Paper
Code

Low-Light Enhancement in the Frequency Domain

no code implementations • 29 Jun 2023 • Hao Chen, Zhi Jin

Hence, in this work, we propose a novel residual recurrent multi-wavelet convolutional neural network R2-MWCNN learned in the frequency domain that can simultaneously increase the image contrast and reduce noise signals well.

Image Enhancement object-detection +1

Paper
Add Code

RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model

1 code implementation • 28 Jun 2023 • Keyan Chen, Chenyang Liu, Hao Chen, Haotian Zhang, Wenyuan Li, Zhengxia Zou, Zhenwei Shi

We also propose several ongoing derivatives for instance segmentation tasks, drawing on recent advancements within the SAM community, and compare their performance with RSPrompter.

Image Segmentation Instance Segmentation +2

437

Paper
Code

Deep Omni-supervised Learning for Rib Fracture Detection from Chest Radiology Images

1 code implementation • 23 Jun 2023 • Zhizhong Chai, Luyang Luo, Huangjing Lin, Pheng-Ann Heng, Hao Chen

To tackle this challenge, the literature on object detection has witnessed an increase of weakly-supervised and semi-supervised approaches, yet still lacks a unified framework that leverages various forms of fully-labeled, weakly-labeled, and unlabeled data.

object-detection Object Detection

Paper
Code

Distributed Localization and Tracking Control for Nonholonomic Agents with Time-varying Bearing Formation

no code implementations • 19 Jun 2023 • Huiming Li, Hao Chen, Xiangke Wang, Mengge Zhang, Lincheng Shen

This paper studies the bearing-based time-varying formation control problem for unicycle-type agents without bearing rigidity conditions.

Paper
Add Code

Uncertainty Quantification via Spatial-Temporal Tweedie Model for Zero-inflated and Long-tail Travel Demand Prediction

1 code implementation • 16 Jun 2023 • Xinke Jiang, Dingyi Zhuang, Xianghui Zhang, Hao Chen, Jiayuan Luo, Xiaowei Gao

Understanding Origin-Destination (O-D) travel demand is crucial for transportation management.

Management Uncertainty Quantification

Paper
Code

Advancing Volumetric Medical Image Segmentation via Global-Local Masked Autoencoder

no code implementations • 15 Jun 2023 • Jia-Xin Zhuang, Luyang Luo, Hao Chen

Masked autoencoder (MAE) is a promising self-supervised pre-training technique that can improve the representation learning of a neural network without human intervention.

Image Segmentation Representation Learning +2

Paper
Add Code

Multimodal Optimal Transport-based Co-Attention Transformer with Global Structure Consistency for Survival Prediction

1 code implementation • ICCV 2023 • Yingxue Xu, Hao Chen

Survival prediction is a complicated ordinal regression task that aims to predict the ranking risk of death, which generally benefits from the integration of histology and genomic data.

Survival Analysis Survival Prediction +1

Paper
Code

A Dynamic Feature Interaction Framework for Multi-task Visual Perception

no code implementations • 8 Jun 2023 • Yuling Xi, Hao Chen, Ning Wang, Peng Wang, Yanning Zhang, Chunhua Shen, Yifan Liu

In particular, one feature merge branch is designed for instance-level recognition the other for dense predictions.

Autonomous Driving Depth Estimation +3

Paper
Add Code

PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization

2 code implementations • 8 Jun 2023 • Yidong Wang, Zhuohao Yu, Zhengran Zeng, Linyi Yang, Cunxiang Wang, Hao Chen, Chaoya Jiang, Rui Xie, Jindong Wang, Xing Xie, Wei Ye, Shikun Zhang, Yue Zhang

To ensure the reliability of PandaLM, we collect a diverse human-annotated test dataset, where all contexts are generated by humans and labels are aligned with human preferences.

Language Modelling Large Language Model

840

Paper
Code

PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts

1 code implementation • 7 Jun 2023 • Kaijie Zhu, Jindong Wang, Jiaheng Zhou, Zichen Wang, Hao Chen, Yidong Wang, Linyi Yang, Wei Ye, Yue Zhang, Neil Zhenqiang Gong, Xing Xie

The increasing reliance on Large Language Models (LLMs) across academia and industry necessitates a comprehensive understanding of their robustness to prompts.

Cross-Lingual Paraphrase Identification Machine Translation +5

1,974

Paper
Code

Efficient Anomaly Detection with Budget Annotation Using Semi-Supervised Residual Transformer

no code implementations • 6 Jun 2023 • Hanxi Li, Jingqi Wu, Hao Chen, Mingwen Wang, Chunhua Shen

Thus the sliding transformer can attain even higher accuracy with much less annotation labor.

Ranked #1 on Anomaly Detection on MVTec AD (Segmentation AUROC metric)

Supervised Anomaly Detection Unsupervised Anomaly Detection

Paper
Add Code

Retrieval-Enhanced Visual Prompt Learning for Few-shot Classification

no code implementations • 4 Jun 2023 • Jintao Rong, Hao Chen, Tianxiao Chen, Linlin Ou, Xinyi Yu, Yifan Liu

Prompt learning has become a popular approach for adapting large vision-language models, such as CLIP, to downstream tasks.

Classification Domain Generalization +3

Paper
Add Code

Medication Recommendation via Domain Knowledge Informed Deep Learning

no code implementations • 31 May 2023 • Sicen Liu, Xiaolong Wang, Xianbing Zhao, Hao Chen

However, most of them neglect incorporating domain knowledge according to the clinical manifestations in the EHR of the patient.

Paper
Add Code

Few-Shot Speaker Identification Using Lightweight Prototypical Network with Feature Grouping and Interaction

no code implementations • 31 May 2023 • Yanxiong Li, Hao Chen, Wenchang Cao, Qisheng Huang, Qianhua He

In the proposed embedding module, audio feature of each speech sample is split into several low-dimensional feature subsets that are transformed by a recurrent convolutional block in parallel.

Speaker Identification

Paper
Add Code

Machine learning with tree tensor networks, CP rank constraints, and tensor dropout

no code implementations • 30 May 2023 • Hao Chen, Thomas Barthel

As suggested in [arXiv:2205. 15296] in the context of quantum many-body physics, computation costs can be further substantially reduced by imposing constraints on the canonical polyadic (CP) rank of the tensors in such networks.

Image Classification Tensor Networks

Paper
Add Code

Scale-aware Super-resolution Network with Dual Affinity Learning for Lesion Segmentation from Medical Images

no code implementations • 30 May 2023 • Yanwen Li, Luyang Luo, Huangjing Lin, Pheng-Ann Heng, Hao Chen

To guide the segmentation branch to learn from richer high-resolution features, we propose a feature affinity module and a scale affinity module to enhance the multi-task learning of the dual branches.

Image Segmentation Image Super-Resolution +4

Paper
Add Code

Learning Conditional Attributes for Compositional Zero-Shot Learning

1 code implementation • CVPR 2023 • Qingsheng Wang, Lingqiao Liu, Chenchen Jing, Hao Chen, Guoqiang Liang, Peng Wang, Chunhua Shen

Compositional Zero-Shot Learning (CZSL) aims to train models to recognize novel compositional concepts based on learned concepts such as attribute-object combinations.

Ranked #1 on Compositional Zero-Shot Learning on MIT-States

Attribute Compositional Zero-Shot Learning

Paper
Code

LoRAPrune: Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning

no code implementations • 28 May 2023 • Mingyang Zhang, Hao Chen, Chunhua Shen, Zhen Yang, Linlin Ou, Xinyi Yu, Bohan Zhuang

This is due to their utilization of unstructured pruning on LPMs, impeding the merging of LoRA weights, or their dependence on the gradients of pre-trained weights to guide pruning, which can impose significant memory overhead.

Model Compression Network Pruning

Paper
Add Code

Continuous Cross-resolution Remote Sensing Image Change Detection

1 code implementation • 24 May 2023 • Hao Chen, Haotian Zhang, Keyan Chen, Chenyao Zhou, Song Chen, Zhengxia Zou, Zhenwei Shi

Toward continuous cross-resolution CD, we propose scale-invariant learning to enforce the model consistently predicting HR results given synthesized samples of varying resolution differences.

Change Detection

Paper
Code

Understanding Programs by Exploiting (Fuzzing) Test Cases

1 code implementation • 23 May 2023 • Jianyu Zhao, Yuyang Rong, Yiwen Guo, Yifeng He, Hao Chen

The effectiveness of the proposed method is verified on two program understanding tasks including code clone detection and code classification, and it outperforms current state-of-the-arts by large margins.

Clone Detection Code Classification +2

Paper
Code

Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations

no code implementations • 22 May 2023 • Hao Chen, Ankit Shah, Jindong Wang, Ran Tao, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj

In this paper, we introduce imprecise label learning (ILL), a framework for the unification of learning with various imprecise label configurations.

Ranked #1 on Learning with noisy labels on mini WebVision 1.0

Learning with noisy labels Partial Label Learning

Paper
Add Code

Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching

1 code implementation • 22 May 2023 • Yang Liu, Muzhi Zhu, Hengtao Li, Hao Chen, Xinlong Wang, Chunhua Shen

In this work, we present Matcher, a novel perception paradigm that utilizes off-the-shelf vision foundation models to address various perception tasks.

Segmentation Semantic Segmentation

358

Paper
Code

Multi-factor Sequential Re-ranking with Perception-Aware Diversification

no code implementations • 21 May 2023 • Yue Xu, Hao Chen, Zefan Wang, Jianwen Yin, Qijie Shen, Dimin Wang, Feiran Huang, Lixiang Lai, Tao Zhuang, Junfeng Ge, Xia Hu

Feed recommendation systems, which recommend a sequence of items for users to browse and interact with, have gained significant popularity in practical applications.

Graph Clustering Recommendation Systems +1

Paper
Add Code

Multi-channel Integrated Recommendation with Exposure Constraints

no code implementations • 21 May 2023 • Yue Xu, Qijie Shen, Jianwen Yin, Zengde Deng, Dimin Wang, Hao Chen, Lixiang Lai, Tao Zhuang, Junfeng Ge

Integrated recommendation, which aims at jointly recommending heterogeneous items from different channels in a main feed, has been widely applied to various online platforms.

Recommendation Systems

Paper
Add Code

Maybe Only 0.5% Data is Needed: A Preliminary Exploration of Low Training Data Instruction Tuning

no code implementations • 16 May 2023 • Hao Chen, Yiming Zhang, Qi Zhang, Hantao Yang, Xiaomeng Hu, Xuetao Ma, Yifan Yanggong, Junbo Zhao

Instruction tuning for large language models (LLMs) has gained attention from researchers due to its ability to unlock the potential of LLMs in following instructions.

Paper
Add Code

Diffusion Models for Imperceptible and Transferable Adversarial Attack

1 code implementation • 14 May 2023 • Jianqi Chen, Hao Chen, Keyan Chen, Yilan Zhang, Zhengxia Zou, Zhenwei Shi

Many existing adversarial attacks generate $L_p$-norm perturbations on image RGB space.

Adversarial Attack

106

Paper
Code

Reference-based OCT Angiogram Super-resolution with Learnable Texture Generation

no code implementations • 10 May 2023 • Yuyan Ruan, Dawei Yang, Ziqi Tang, An Ran Ran, Carol Y. Cheung, Hao Chen

The key difference between the proposed method and traditional RefSR models is that the textures used during inference are generated by the LTG instead of being searched from a single reference image.

Reference-based Super-Resolution Texture Synthesis

Paper
Add Code

PromptRank: Unsupervised Keyphrase Extraction Using Prompt

1 code implementation • 8 May 2023 • Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li, Yong Qin, Ruiqi Sun, Xiaoyan Bai

This demonstrates the great potential of using prompt for unsupervised keyphrase extraction.

Ranked #1 on Keyphrase Extraction on NUS

Keyphrase Extraction Language Modelling

Paper
Code

Revolutionizing Agrifood Systems with Artificial Intelligence: A Survey

no code implementations • 3 May 2023 • Tao Chen, Liang Lv, Di Wang, Jing Zhang, Yue Yang, Zeyang Zhao, Chen Wang, Xiaowei Guo, Hao Chen, Qingye Wang, Yufei Xu, Qiming Zhang, Bo Du, Liangpei Zhang, DaCheng Tao

With the world population rapidly increasing, transforming our agrifood systems to be more productive, efficient, safe, and sustainable is crucial to mitigate potential food shortages.

Paper
Add Code

Rethinking Boundary Detection in Deep Learning Models for Medical Image Segmentation

1 code implementation • 1 May 2023 • Yi Lin, Dong Zhang, Xiao Fang, Yufan Chen, Kwang-Ting Cheng, Hao Chen

Medical image segmentation is a fundamental task in the community of medical image analysis.

Boundary Detection Image Segmentation +3

Paper
Code

Improving Adversarial Transferability via Intermediate-level Perturbation Decay

2 code implementations • NeurIPS 2023 • Qizhang Li, Yiwen Guo, WangMeng Zuo, Hao Chen

In particular, the proposed method, named intermediate-level perturbation decay (ILPD), encourages the intermediate-level perturbation to be in an effective adversarial direction and to possess a great magnitude simultaneously.

136

Paper
Code

Harnessing the Power of Text-image Contrastive Models for Automatic Detection of Online Misinformation

no code implementations • 19 Apr 2023 • Hao Chen, Peng Zheng, Xin Wang, Shu Hu, Bin Zhu, Jinrong Hu, Xi Wu, Siwei Lyu

As growing usage of social media websites in the recent decades, the amount of news articles spreading online rapidly, resulting in an unprecedented scale of potentially fraudulent information.

Contrastive Learning Misinformation +1

Paper
Add Code

CrossFusion: Interleaving Cross-modal Complementation for Noise-resistant 3D Object Detection

no code implementations • 19 Apr 2023 • Yang Yang, Weijie Ma, Hao Chen, Linlin Ou, Xinyi Yu

The combination of LiDAR and camera modalities is proven to be necessary and typical for 3D object detection according to recent studies.

3D Object Detection Depth Estimation +1

Paper
Add Code

Learning to Fuse Monocular and Multi-view Cues for Multi-frame Depth Estimation in Dynamic Scenes

1 code implementation • CVPR 2023 • Rui Li, Dong Gong, Wei Yin, Hao Chen, Yu Zhu, Kaixuan Wang, Xiaozhi Chen, Jinqiu Sun, Yanning Zhang

To let the geometric perception learned from multi-view cues in static areas propagate to the monocular representation in dynamic areas and let monocular cues enhance the representation of multi-view cost volume, we propose a cross-cue fusion (CCF) module, which includes the cross-cue attention (CCA) to encode the spatially non-local relative intra-relations from each source to enhance the representation of the other.

Autonomous Driving Depth Estimation

111

Paper
Code

Scale Federated Learning for Label Set Mismatch in Medical Image Classification

1 code implementation • 14 Apr 2023 • Zhipeng Deng, Luyang Luo, Hao Chen

Federated learning (FL) has been introduced to the healthcare domain as a decentralized learning paradigm that allows multiple parties to train a model collaboratively without privacy leakage.

Federated Learning Image Classification +2

Paper
Code

The Second Monocular Depth Estimation Challenge

no code implementations • 14 Apr 2023 • Jaime Spencer, C. Stella Qian, Michaela Trescakova, Chris Russell, Simon Hadfield, Erich W. Graf, Wendy J. Adams, Andrew J. Schofield, James Elder, Richard Bowden, Ali Anwar, Hao Chen, Xiaozhi Chen, Kai Cheng, Yuchao Dai, Huynh Thai Hoa, Sadat Hossain, Jianmian Huang, Mohan Jing, Bo Li, Chao Li, Baojun Li, Zhiwen Liu, Stefano Mattoccia, Siegfried Mercelis, Myungwoo Nam, Matteo Poggi, Xiaohua Qi, Jiahui Ren, Yang Tang, Fabio Tosi, Linh Trinh, S. M. Nadim Uddin, Khan Muhammad Umair, Kaixuan Wang, YuFei Wang, Yixing Wang, Mochu Xiang, Guangkai Xu, Wei Yin, Jun Yu, Qi Zhang, Chaoqiang Zhao

This paper discusses the results for the second edition of the Monocular Depth Estimation Challenge (MDEC).

Monocular Depth Estimation

Paper
Add Code

Deep Learning in Breast Cancer Imaging: A Decade of Progress and Future Directions

no code implementations • 13 Apr 2023 • Luyang Luo, Xi Wang, Yi Lin, Xiaoqi Ma, Andong Tan, Ronald Chan, Varut Vardhanabhuti, Winnie CW Chu, Kwang-Ting Cheng, Hao Chen

Breast cancer has reached the highest incidence rate worldwide among all malignancies since 2020.

Paper
Add Code

HNeRV: A Hybrid Neural Representation for Videos

1 code implementation • CVPR 2023 • Hao Chen, Matt Gwilliam, Ser-Nam Lim, Abhinav Shrivastava

Such embedding largely limits the regression capacity and internal generalization for video interpolation.

Ranked #3 on Video Reconstruction on UVG

Denoising regression +3

106

Paper
Code

DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images

no code implementations • 5 Apr 2023 • Bo Qian, Hao Chen, Xiangning Wang, Haoxuan Che, Gitaek Kwon, Jaeyoung Kim, Sungjin Choi, Seoyoung Shin, Felix Krause, Markus Unterdechler, Junlin Hou, Rui Feng, Yihao Li, Mostafa El Habib Daho, Qiang Wu, Ping Zhang, Xiaokang Yang, Yiyu Cai, Weiping Jia, Huating Li, Bin Sheng

Computer-assisted automatic analysis of diabetic retinopathy (DR) is of great importance in reducing the risks of vision loss and even blindness.

Benchmarking Data Augmentation +1

Paper
Add Code

Exploring Vision-Language Models for Imbalanced Learning

1 code implementation • 4 Apr 2023 • Yidong Wang, Zhuohao Yu, Jindong Wang, Qiang Heng, Hao Chen, Wei Ye, Rui Xie, Xing Xie, Shikun Zhang

However, their performance on imbalanced dataset is relatively poor, where the distribution of classes in the training dataset is skewed, leading to poor performance in predicting minority classes.

Zero-Shot Learning

110

Paper
Code

Learning Robust Medical Image Segmentation from Multi-source Annotations

no code implementations • 2 Apr 2023 • Yifeng Wang, Luyang Luo, Mingxiang Wu, Qiong Wang, Hao Chen

Learning segmentation networks from multi-source annotations remains a challenge due to the uncertainties brought by the variance of annotations and the quality of images.

Image Segmentation MRI segmentation +2

Paper
Add Code

Recover Triggered States: Protect Model Against Backdoor Attack in Reinforcement Learning

1 code implementation • 1 Apr 2023 • Hao Chen, Chen Gong, Yizhe WANG, Xinwen Hou

This paper proposes the Recovery Triggered States (RTS) method, a novel approach that effectively protects the victim agents from backdoor attacks.

Backdoor Attack reinforcement-learning

Paper
Code

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

1 code implementation • 30 Mar 2023 • Wen Wang, Yan Jiang, Kangyang Xie, Zide Liu, Hao Chen, Yue Cao, Xinlong Wang, Chunhua Shen

Our vid2vid-zero leverages off-the-shelf image diffusion models, and doesn't require training on any video.

Image Generation Video Alignment +1

319

Paper
Code

Iteratively Coupled Multiple Instance Learning from Instance to Bag Classifier for Whole Slide Image Classification

1 code implementation • 28 Mar 2023 • Hongyi Wang, Luyang Luo, Fang Wang, Ruofeng Tong, Yen-Wei Chen, Hongjie Hu, Lanfen Lin, Hao Chen

In ICMIL, we use category information in the bag-level classifier to guide the patch-level fine-tuning of the patch feature extractor.

Classification Image Classification +1

Paper
Code

Image Quality-aware Diagnosis via Meta-knowledge Co-embedding

1 code implementation • CVPR 2023 • Haoxuan Che, Siyu Chen, Hao Chen

Medical images usually suffer from image degradation in clinical practice, leading to decreased performance of deep learning-based models.

Image Quality Assessment Meta-Learning

Paper
Code

DoNet: Deep De-overlapping Network for Cytology Instance Segmentation

1 code implementation • CVPR 2023 • Hao Jiang, Rushan Zhang, Yanning Zhou, Yumeng Wang, Hao Chen

Cell instance segmentation in cytology images has significant importance for biology analysis and cancer screening, while remains challenging due to 1) the extensive overlapping translucent cell clusters that cause the ambiguous boundaries, and 2) the confusion of mimics and debris as nuclei.

Instance Segmentation Region Proposal +2

Paper
Code

Few Shot Medical Image Segmentation with Cross Attention Transformer

1 code implementation • 24 Mar 2023 • Yi Lin, Yufan Chen, Kwang-Ting Cheng, Hao Chen

Our proposed network mines the correlations between the support image and query image, limiting them to focus only on useful foreground information and boosting the representation capacity of both the support prototype and query features.

Few-Shot Learning Image Segmentation +3

Paper
Code

Adversarial Attack and Defense for Medical Image Analysis: Methods and Applications

no code implementations • 24 Mar 2023 • Junhao Dong, Junxi Chen, Xiaohua Xie, JianHuang Lai, Hao Chen

In this exposition, we present a comprehensive survey on recent advances in adversarial attack and defense for medical image analysis with a novel taxonomy in terms of the application scenario.

Adversarial Attack Medical Diagnosis

Paper
Add Code

Two-level Graph Network for Few-Shot Class-Incremental Learning

no code implementations • 24 Mar 2023 • Hao Chen, Linyan Li, Fan Lyu, Fuyuan Hu, Zhenping Xia, Fenglei Xu

Class-level graph network aims to mitigate the semantic conflict between prototype features of new classes and old classes.

Few-Shot Class-Incremental Learning Incremental Learning +1

Paper
Add Code

Towards Scalable Neural Representation for Diverse Videos

no code implementations • CVPR 2023 • Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava

Implicit neural representations (INR) have gained increasing attention in representing 3D scenes and images, and have been recently applied to encode videos (e. g., NeRV, E-NeRV).

Action Recognition Video Compression

Paper
Add Code

Boosting Convolution with Efficient MLP-Permutation for Volumetric Medical Image Segmentation

no code implementations • 23 Mar 2023 • Yi Lin, Xiao Fang, Dong Zhang, Kwang-Ting Cheng, Hao Chen

Recently, the advent of vision Transformer (ViT) has brought substantial advancements in 3D dataset benchmarks, particularly in 3D volumetric medical image segmentation (Vol-MedSeg).

Image Segmentation Semantic Segmentation +1

Paper
Add Code

Exploring Visual Prompts for Whole Slide Image Classification with Multiple Instance Learning

no code implementations • 23 Mar 2023 • Yi Lin, Zhongchen Zhao, Zhengjie ZHU, Lisheng Wang, Kwang-Ting Cheng, Hao Chen

Multiple instance learning (MIL) has emerged as a popular method for classifying histopathology whole slide images (WSIs).

Image Classification Multiple Instance Learning +1

Paper
Add Code

Label-Efficient Deep Learning in Medical Image Analysis: Challenges and Future Directions

no code implementations • 22 Mar 2023 • Cheng Jin, Zhengrui Guo, Yi Lin, Luyang Luo, Hao Chen

Thus, label-efficient deep learning methods are developed to make comprehensive use of the labeled data as well as the abundance of unlabeled and weak-labeled data.

Paper
Add Code

Implicit Ray-Transformers for Multi-view Remote Sensing Image Segmentation

no code implementations • 15 Mar 2023 • Zipeng Qi, Hao Chen, Chenyang Liu, Zhenwei Shi, Zhengxia Zou

In the first stage, we optimize a neural field to encode the color and 3D structure of the remote sensing scene based on multi-view images.

Image Segmentation Scene Segmentation +1

Paper
Add Code

A Monkey Swing Counting Algorithm Based on Object Detection

no code implementations • 12 Mar 2023 • Hao Chen, Zhe-Ming Lu, Jie Liu

This paper focuses on proposing a deep learning-based monkey swing counting algorithm.

object-detection Object Detection

Paper
Add Code

Traj-MAE: Masked Autoencoders for Trajectory Prediction

no code implementations • ICCV 2023 • Hao Chen, Jiaze Wang, Kun Shao, Furui Liu, Jianye Hao, Chenyong Guan, Guangyong Chen, Pheng-Ann Heng

Specifically, our Traj-MAE employs diverse masking strategies to pre-train the trajectory encoder and map encoder, allowing for the capture of social and temporal information among agents while leveraging the effect of environment from multiple granularities.

Autonomous Driving Trajectory Prediction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.