Search Results for author: Chao Zhang

Found 340 papers, 126 papers with code

Transferring SLU Models in Novel Domains

no code implementations • ICLR 2019 • Yaohua Tang, Kaixiang Mo, Qian Xu, Chao Zhang, Qiang Yang

When building models for novel natural language domains, a major challenge is the lack of data in the new domains, no matter whether the data is annotated or not.

Intent Recognition Meta-Learning +4

Paper
Add Code

AcTune: Uncertainty-Based Active Self-Training for Active Fine-Tuning of Pretrained Language Models

1 code implementation • NAACL 2022 • Yue Yu, Lingkai Kong, Jieyu Zhang, Rongzhi Zhang, Chao Zhang

We develop AcTune, a new framework that improves the label efficiency of active PLM fine-tuning by unleashing the power of unlabeled data via self-training.

Active Learning text-classification +1

Paper
Code

Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning

1 code implementation • ACL 2022 • Rongzhi Zhang, Yue Yu, Pranav Shetty, Le Song, Chao Zhang

Weakly-supervised learning (WSL) has shown promising results in addressing label scarcity on many NLP tasks, but manually designing a comprehensive, high-quality labeling rule set is tedious and difficult.

Weakly-supervised Learning

Paper
Code

NoteLLM-2: Multimodal Large Representation Models for Recommendation

no code implementations • 27 May 2024 • Chao Zhang, Haoxin Zhang, Shiwei Wu, Di wu, Tong Xu, Yan Gao, Yao Hu, Enhong Chen

We propose two ways to enhance the focus on visual information.

Paper
Add Code

Incremental Pseudo-Labeling for Black-Box Unsupervised Domain Adaptation

no code implementations • 26 May 2024 • Yawen Zou, Chunzhi Gu, Jun Yu, Shangce Gao, Chao Zhang

Black-Box unsupervised domain adaptation (BBUDA) learns knowledge only with the prediction of target data from the source model without access to the source data and source model, which attempts to alleviate concerns about the privacy and security of data.

Paper
Add Code

Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models

no code implementations • 23 May 2024 • Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Chengwei Qin, Pin-Yu Chen, Eng Siong Chng, Chao Zhang

We propose an unsupervised adaptation framework, Self-TAught Recognizer (STAR), which leverages unlabeled data to enhance the robustness of automatic speech recognition (ASR) systems in diverse target domains, such as noise and accents.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models

no code implementations • 22 May 2024 • Guangzhi Sun, Potsawee Manakul, Adian Liusie, Kunat Pipatanakul, Chao Zhang, Phil Woodland, Mark Gales

Multimodal foundation models are prone to hallucination, generating outputs that either contradict the input or are not grounded by factual information.

Benchmarking Hallucination +2

Paper
Add Code

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

1 code implementation • 14 May 2024 • Zhimin Li, Jianwei Zhang, Qin Lin, Jiangfeng Xiong, Yanxin Long, Xinchi Deng, Yingfang Zhang, Xingchao Liu, Minbin Huang, Zedong Xiao, Dayou Chen, Jiajun He, Jiahao Li, Wenyue Li, Chen Zhang, Rongwei Quan, Jianxiang Lu, Jiabin Huang, Xiaoyan Yuan, Xiaoxiao Zheng, Yixuan Li, Jihong Zhang, Chao Zhang, Meng Chen, Jie Liu, Zheng Fang, Weiyan Wang, Jinbao Xue, Yangyu Tao, Jianchen Zhu, Kai Liu, Sihuan Lin, Yifu Sun, Yun Li, Dongdong Wang, Mingtao Chen, Zhichao Hu, Xiao Xiao, Yan Chen, Yuhong Liu, Wei Liu, Di Wang, Yong Yang, Jie Jiang, Qinglin Lu

For fine-grained language understanding, we train a Multimodal Large Language Model to refine the captions of the images.

Image Generation Language Modelling +2

2,009

Paper
Code

Goal-oriented compression for $L_p$-norm-type goal functions: Application to power consumption scheduling

no code implementations • 13 May 2024 • Yifei Sun, Hang Zou, Chao Zhang, Samson Lasaulce, Michel Kieffer

Motivated by applications such as the smart grid, this paper focuses on a goal function which is of $L_p$-norm-type.

Data Compression Quantization +1

Paper
Add Code

BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers

1 code implementation • 29 Apr 2024 • ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Yanqiao Zhu, May D. Wang, Joyce C. Ho, Chao Zhang, Carl Yang

Developing effective biomedical retrieval models is important for excelling at knowledge-intensive biomedical tasks but still challenging due to the deficiency of sufficient publicly annotated biomedical data and computational resources.

Retrieval Unsupervised Pre-training

Paper
Code

Frequency-Guided Multi-Level Human Action Anomaly Detection with Normalizing Flows

no code implementations • 26 Apr 2024 • Shun Maeda, Chunzhi Gu, Jun Yu, Shogo Tokai, Shangce Gao, Chao Zhang

We introduce the task of human action anomaly detection (HAAD), which aims to identify anomalous motions in an unsupervised manner given only the pre-determined normal category of training action samples.

Anomaly Detection

Paper
Add Code

Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual Modalities

no code implementations • 23 Apr 2024 • Siyin Wang, Chao-Han Huck Yang, Ji Wu, Chao Zhang

Large language models (LLMs) can adapt to new tasks through in-context learning (ICL) based on a few examples presented in dialogue history without any model parameter update.

In-Context Learning

Paper
Add Code

Emerging Advancements in 6G NTN Radio Access Technologies: An Overview

no code implementations • 22 Apr 2024 • Husnain Shahid, Carla Amatetti, Riccardo Campana, Sorya Tong, Dorin Panaitopol, Alessandro Vanelli Coralli, Abdelhamed Mohamed, Chao Zhang, Ebraam Khalifa, Eduardo Medeiros, Estefania Recayte, Fatemeh Ghasemifard, Ji Lianghai, Juan Bucheli, Karthik Anantha Swamy, Marius Caus, Mehmet Gurelli, Miguel A. Vazquez, Musbah Shaat, Nathan Borios, Per-Erik Eriksson, Sebastian Euler, Zheng Li, Xiaotian Fu

The efforts on the development, standardization and improvements to communication systems towards 5G Advanced and 6G are on track to provide benefits such as an unprecedented level of connectivity and performance, enabling a diverse range of vertical services.

Management

Paper
Add Code

Cepstral Analysis Based Artifact Detection, Recognition and Removal for Prefrontal EEG

no code implementations • 12 Apr 2024 • Siqi Han, Chao Zhang, Jiaxin Lei, Qingquan Han, Yuhui Du, Anhe Wang, Shuo Bai, Milin Zhang

The proposed method achieves an accuracy of 99. 62% on the artifact detection task and a 82. 79% accuracy on the 6-category eye movement classification task.

Artifact Detection EEG +1

Paper
Add Code

Empowering Image Recovery_ A Multi-Attention Approach

no code implementations • 6 Apr 2024 • Juan Wen, Yawei Li, Chao Zhang, Weiyan Hou, Radu Timofte, Luc van Gool

Integration of attention mechanisms across feature and positional dimensions further enhances the recovery of fine details.

Image Restoration

Paper
Add Code

Semantic Map-based Generation of Navigation Instructions

1 code implementation • 28 Mar 2024 • Chengzu Li, Chao Zhang, Simone Teufel, Rama Sanand Doddipatla, Svetlana Stoyanchev

In this paper, we propose a new approach to navigation instruction generation by framing the problem as an image captioning task using semantic maps as visual input.

Image Captioning

Paper
Code

Leveraging Large Language Model to Generate a Novel Metaheuristic Algorithm with CRISPE Framework

1 code implementation • 25 Mar 2024 • Rui Zhong, Yuefeng Xu, Chao Zhang, Jun Yu

In this paper, we borrow the large language model (LLM) ChatGPT-3. 5 to automatically and quickly design a new metaheuristic algorithm (MA) with only a small amount of input.

Language Modelling Large Language Model +1

Paper
Code

M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset

no code implementations • 21 Mar 2024 • Zhe Chen, Heyang Liu, Wenyi Yu, Guangzhi Sun, Hongcheng Liu, Ji Wu, Chao Zhang, Yu Wang, Yanfeng Wang

Although multiple academic video datasets have been constructed and released, few of them support both multimodal content recognition and understanding tasks, which is partially due to the lack of high-quality human annotations.

speech-recognition Speech Recognition +1

Paper
Add Code

ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models

1 code implementation • 17 Mar 2024 • Yuzhao Heng, Chunyuan Deng, Yitong Li, Yue Yu, Yinghao Li, Rongzhi Zhang, Chao Zhang

Although Large Language Models (LLMs) exhibit remarkable adaptability across domains, these models often fall short in structured knowledge extraction tasks such as named entity recognition (NER).

Attribute named-entity-recognition +2

Paper
Code

Efficient Multiplayer Battle Game Optimizer for Adversarial Robust Neural Architecture Search

1 code implementation • 15 Mar 2024 • Rui Zhong, Yuefeng Xu, Chao Zhang, Jun Yu

This paper introduces a novel metaheuristic algorithm, known as the efficient multiplayer battle game optimizer (EMBGO), specifically designed for addressing complex numerical optimization tasks.

Neural Architecture Search

Paper
Code

DiaLoc: An Iterative Approach to Embodied Dialog Localization

no code implementations • 11 Mar 2024 • Chao Zhang, Mohan Li, Ignas Budvytis, Stephan Liwicki

However, most existing works in embodied dialog research focus on navigation and leave the localization task understudied.

Paper
Add Code

NoteLLM: A Retrievable Large Language Model for Note Recommendation

no code implementations • 4 Mar 2024 • Chao Zhang, Shiwei Wu, Haoxin Zhang, Tong Xu, Yan Gao, Yao Hu, Di wu, Enhong Chen

Indeed, learning to generate hashtags/categories can potentially enhance note embeddings, both of which compress key note information into limited content.

Contrastive Learning Language Modelling +1

Paper
Add Code

APISR: Anime Production Inspired Real-World Anime Super-Resolution

1 code implementation • 3 Mar 2024 • Boyang Wang, Fengyu Yang, Xihang Yu, Chao Zhang, Hanbin Zhao

In addition, we identify two anime-specific challenges of distorted and faint hand-drawn lines and unwanted color artifacts.

Super-Resolution

585

Paper
Code

Accelerating materials discovery for polymer solar cells: Data-driven insights enabled by natural language processing

1 code implementation • 29 Feb 2024 • Pranav Shetty, Aishat Adeboye, Sonakshi Gupta, Chao Zhang, Rampi Ramprasad

We present a natural language processing pipeline that was used to extract polymer solar cell property data from the literature and simulate various active learning strategies.

Active Learning

Paper
Code

Diffusion Models as Constrained Samplers for Optimization with Unknown Constraints

no code implementations • 28 Feb 2024 • Lingkai Kong, Yuanqi Du, Wenhao Mu, Kirill Neklyudov, Valentin De Bortoli, Haorui Wang, Dongxia Wu, Aaron Ferber, Yi-An Ma, Carla P. Gomes, Chao Zhang

To constrain the optimization process to the data manifold, we reformulate the original optimization problem as a sampling problem from the product of the Boltzmann distribution defined by the objective function and the data distribution learned by the diffusion model.

Paper
Add Code

CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision

1 code implementation • 26 Feb 2024 • Hao Wang, Zeyu Gao, Chao Zhang, Zihan Sha, Mingyang Sun, Yuchen Zhou, Wenyu Zhu, Wenju Sun, Han Qiu, Xi Xiao

At the core, our approach boosts superior transfer learning capabilities by effectively aligning binary code with their semantics explanations (in natural language), resulting a model able to generate better embeddings for binary code.

Representation Learning Transfer Learning

Paper
Code

ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance Labeling

no code implementations • 21 Feb 2024 • Lingxi Zhang, Yue Yu, Kuan Wang, Chao Zhang

Retrieval-augmented generation enhances large language models (LLMs) by incorporating relevant information from external knowledge sources.

Retrieval Transfer Learning +1

Paper
Add Code

A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction

1 code implementation • 20 Feb 2024 • Yinghao Li, Rampi Ramprasad, Chao Zhang

It breaks the generation into a two-step pipeline: initially, LLMs generate answers in natural language as intermediate responses.

Language Modelling named-entity-recognition +4

Paper
Code

Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation

no code implementations • 20 Feb 2024 • Wen Wu, Bo Li, Chao Zhang, Chung-Cheng Chiu, Qiujia Li, Junwen Bai, Tara N. Sainath, Philip C. Woodland

The evidential uncertainty measure is extended to quantify the uncertainty in emotion distribution estimation.

Classification Emotion Classification

Paper
Add Code

BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models

1 code implementation • 13 Feb 2024 • Haotian Sun, Yuchen Zhuang, Wei Wei, Chao Zhang, Bo Dai

BBox-Adapter distinguishes target and source domain data by treating target data as positive and source data as negative.

Paper
Code

EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty

1 code implementation • 26 Jan 2024 • Yuhui Li, Fangyun Wei, Chao Zhang, Hongyang Zhang

In this paper, we reconsider speculative sampling and derive two key observations.

Code Generation Instruction Following +2

533

Paper
Code

Neural Sinkhorn Gradient Flow

no code implementations • 25 Jan 2024 • Huminhao Zhu, Fangyikang Wang, Chao Zhang, Hanbin Zhao, Hui Qian

We utilize the velocity field matching training scheme in NSGF, which only requires samples from the source and target distribution to compute an empirical velocity field approximation.

Paper
Add Code

LightSleepNet: Design of a Personalized Portable Sleep Staging System Based on Single-Channel EEG

no code implementations • 24 Jan 2024 • Yiqiao Liao, Chao Zhang, Milin Zhang, Zhihua Wang, Xiang Xie

This paper proposed LightSleepNet - a light-weight, 1-d Convolutional Neural Network (CNN) based personalized architecture for real-time sleep staging, which can be implemented on various mobile platforms with limited hardware resources.

EEG Sleep Staging +1

Paper
Add Code

HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification

no code implementations • 24 Jan 2024 • Vidit Jain, Mukund Rungta, Yuchen Zhuang, Yue Yu, Zeyu Wang, Mu Gao, Jeffrey Skolnick, Chao Zhang

The best-performing models aim to learn a static representation by combining document and hierarchical label information.

Language Modelling Multi Label Text Classification +3

Paper
Add Code

TPD: Enhancing Student Language Model Reasoning via Principle Discovery and Guidance

no code implementations • 24 Jan 2024 • Haorui Wang, Rongzhi Zhang, Yinghao Li, Lingkai Kong, Yuchen Zhuang, Xiusi Chen, Chao Zhang

The teacher LLM generates problem-solving instructions and corrective principles based on the student LLM's errors.

Language Modelling

Paper
Add Code

Large Language Models are Efficient Learners of Noise-Robust Speech Recognition

1 code implementation • 19 Jan 2024 • Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng

To this end, we propose to extract a language-space noise embedding from the N-best list to represent the noise conditions of source speech, which can promote the denoising process in GER.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

105

Paper
Code

FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis

no code implementations • 19 Jan 2024 • Chao Zhang, YUREN MAO, Yijiang Fan, Yu Mi, Yunjun Gao, Lu Chen, Dongfang Lou, Jinshu Lin

Text-to-SQL, which provides zero-code interface for operating relational databases, has gained much attention in financial analysis; because, financial professionals may not well-skilled in SQL programming.

Language Modelling Large Language Model +1

Paper
Add Code

Misconfidence-based Demonstration Selection for LLM In-Context Learning

no code implementations • 12 Jan 2024 • Shangqing Xu, Chao Zhang

In each step, it analyzes a pool of candidate examples and identifies the ones most likely to challenge the LLM's current understanding, measured by a new metric called misconfidence.

In-Context Learning

Paper
Add Code

Multi-Channel Multi-Domain based Knowledge Distillation Algorithm for Sleep Staging with Single-Channel EEG

no code implementations • 7 Jan 2024 • Chao Zhang, Yiqiao Liao, Siqi Han, Milin Zhang, Zhihua Wang, Xiang Xie

The proposed algorithm achieves a state-of-the-art single-channel sleep staging accuracy of 86. 5%, with only 0. 6% deterioration from the state-of-the-art multi-channel model.

EEG Knowledge Distillation +1

Paper
Add Code

A Closed-loop Brain-Machine Interface SoC Featuring a 0.2$μ$J/class Multiplexer Based Neural Network

no code implementations • 7 Jan 2024 • Chao Zhang, Yongxiang Guo, Dawid Sheng, Zhixiong Ma, Chao Sun, Yuwei Zhang, Wenxin Zhao, Fenyan Zhang, Tongfei Wang, Xing Sheng, Milin Zhang

This work presents the first fabricated electrophysiology-optogenetic closed-loop bidirectional brain-machine interface (CL-BBMI) system-on-chip (SoC) with electrical neural signal recording, on-chip sleep staging and optogenetic stimulation.

Sleep Staging

Paper
Add Code

3DMIT: 3D Multi-modal Instruction Tuning for Scene Understanding

1 code implementation • 6 Jan 2024 • Zeju Li, Chao Zhang, Xiaoyan Wang, Ruilong Ren, Yifan Xu, Ruifei Ma, Xiangde Liu

The remarkable potential of multi-modal large language models (MLLMs) in comprehending both vision and language information has been widely acknowledged.

Scene Understanding Visual Question Answering (VQA)

Paper
Code

Towards Modeling Uncertainties of Self-explaining Neural Networks via Conformal Prediction

no code implementations • 3 Jan 2024 • Wei Qian, Chenxu Zhao, Yangyi Li, Fenglong Ma, Chao Zhang, Mengdi Huai

To tackle the aforementioned challenges, in this paper, we design a novel uncertainty modeling framework for self-explaining networks, which not only demonstrates strong distribution-free uncertainty modeling performance for the generated explanations in the interpretation layer but also excels in producing efficient and effective prediction sets for the final predictions based on the informative high-level basis explanations.

Conformal Prediction Uncertainty Quantification

Paper
Add Code

Multiplayer Battle Game-Inspired Optimizer for Complex Optimization Problems

no code implementations • 31 Dec 2023 • Yuefeng Xu, Rui Zhong, Chao Zhang, Jun Yu

Various popular multiplayer battle royale games share a lot of common elements.

Paper
Add Code

Large Language Models for Generative Information Extraction: A Survey

1 code implementation • 29 Dec 2023 • Derong Xu, Wei Chen, Wenjun Peng, Chao Zhang, Tong Xu, Xiangyu Zhao, Xian Wu, Yefeng Zheng, Enhong Chen

Information extraction (IE) aims to extract structural knowledge (such as entities, relations, and events) from plain natural language texts.

484

Paper
Code

GAD-PVI: A General Accelerated Dynamic-Weight Particle-Based Variational Inference Framework

no code implementations • 27 Dec 2023 • Fangyikang Wang, Huminhao Zhu, Chao Zhang, Hanbin Zhao, Hui Qian

Particle-based Variational Inference (ParVI) methods approximate the target distribution by iteratively evolving finite weighted particle systems.

Position Variational Inference

Paper
Add Code

A Joint Multi-Gradient Algorithm for Demosaicing Bayer Images

no code implementations • International Conference on Communication, Image and Signal Processing (CCISP) 2023 • Di wu, Zhihui Xin, Chao Zhang

Experiments show that the algorithm in this paper has better recovery in image edges as well as texture complex regions with higher PSNR and SSIM values and better subjective visual perception compared to the traditional gradient algorithms such as BI, Cok, Hibbard, Laroche, Hamiton, while the algorithm involves only the add-subtract and shift operations, which is suitable to be implemented on the hardware platform.

Demosaicking SSIM

Paper
Add Code

Multilevel Saliency-Guided Self-Supervised Learning for Image Anomaly Detection

no code implementations • 30 Nov 2023 • Jianjian Qin, Chunzhi Gu, Jun Yu, Chao Zhang

To fully exploit saliency guidance, on each map, we select a pixel pair from the cluster with the highest centroid saliency to form a patch pair.

Anomaly Detection Self-Supervised Learning

Paper
Add Code

LanGWM: Language Grounded World Model

no code implementations • 29 Nov 2023 • Rudra P. K. Poudel, Harit Pandya, Chao Zhang, Roberto Cipolla

Furthermore, our proposed technique of explicit language-grounded visual representation learning has the potential to improve models for human-robot interaction because our extracted visual features are language grounded.

Model-based Reinforcement Learning Out-of-Distribution Generalization +2

Paper
Add Code

How Far Have We Gone in Vulnerability Detection Using Large Language Models

1 code implementation • 21 Nov 2023 • Zeyu Gao, Hao Wang, Yuchen Zhou, Wenyu Zhu, Chao Zhang

Given the significant successes of large language models (LLMs) in various tasks, there is growing anticipation of their efficacy in vulnerability detection.

Vulnerability Detection

Paper
Code

PolyIE: A Dataset of Information Extraction from Polymer Material Scientific Literature

1 code implementation • 13 Nov 2023 • Jerry Junyang Cheung, Yuchen Zhuang, Yinghao Li, Pranav Shetty, Wantian Zhao, Sanjeev Grampurohit, Rampi Ramprasad, Chao Zhang

Scientific information extraction (SciIE), which aims to automatically extract information from scientific literature, is becoming more important than ever.

Relation Extraction

Paper
Code

Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning

no code implementations • 13 Nov 2023 • Yue Yu, Jiaming Shen, Tianqi Liu, Zhen Qin, Jing Nathan Yan, Jialu Liu, Chao Zhang, Michael Bendersky

To fully unleash the power of explanations, we propose EASE, an Explanation-Aware Soft Ensemble framework to empower in-context learning with LLMs.

In-Context Learning Language Modelling +2

Paper
Add Code

Speech-based Slot Filling using Large Language Models

no code implementations • 13 Nov 2023 • Guangzhi Sun, Shutong Feng, Dongcheng Jiang, Chao Zhang, Milica Gašić, Philip C. Woodland

Recently, advancements in large language models (LLMs) have shown an unprecedented ability across various language tasks.

In-Context Learning slot-filling +1

Paper
Add Code

Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study

1 code implementation • 13 Nov 2023 • Yinghao Li, Haorui Wang, Chao Zhang

Large Language Models (LLMs) have shown remarkable proficiency in language understanding and have been successfully applied to a variety of real-world tasks through task-specific fine-tuning or prompt engineering.

Logical Reasoning Prompt Engineering

Paper
Code

Image-Pointcloud Fusion based Anomaly Detection using PD-REAL Dataset

no code implementations • 7 Nov 2023 • Jianjian Qin, Chunzhi Gu, Jun Yu, Chao Zhang

We present PD-REAL, a novel large-scale dataset for unsupervised anomaly detection (AD) in the 3D domain.

Unsupervised Anomaly Detection

Paper
Add Code

Improving MIMO channel estimation via receive power feedback

no code implementations • 1 Nov 2023 • Chao Zhang, Hang Zou, Samson Lasaulce, Lucas Saludjian

Estimating the channel state is known to be an important problem in wireless networks.

Paper
Add Code

SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process

1 code implementation • 25 Oct 2023 • Zichong Li, Yanbo Xu, Simiao Zuo, Haoming Jiang, Chao Zhang, Tuo Zhao, Hongyuan Zha

We conduct extensive experiments in both event type prediction and uncertainty quantification of arrival time.

Type prediction Uncertainty Quantification

Paper
Code

Orientation-Aware Leg Movement Learning for Action-Driven Human Motion Prediction

no code implementations • 23 Oct 2023 • Chunzhi Gu, Chao Zhang, Shigeru Kuriyama

Specifically, we follow a two-stage forecasting strategy by first employing the motion diffusion model to generate the target motion with a specified future action, and then producing the in-betweening to smoothly connect the observation and prediction to eventually address motion prediction.

Human motion prediction motion prediction

Paper
Add Code

ToolChain: Efficient Action Space Navigation in Large Language Models with A Search

no code implementations • 20 Oct 2023 • Yuchen Zhuang, Xiang Chen, Tong Yu, Saayan Mitra, Victor Bursztyn, Ryan A. Rossi, Somdeb Sarkhel, Chao Zhang

It formulates the entire action space as a decision tree, where each node represents a possible API function call involved in a solution plan.

Decision Making valid

Paper
Add Code

SALMONN: Towards Generic Hearing Abilities for Large Language Models

1 code implementation • 20 Oct 2023 • Changli Tang, Wenyi Yu, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang

Hearing is arguably an essential ability of artificial intelligence (AI) agents in the physical world, which refers to the perception and understanding of general auditory information consisting of at least three types of sounds: speech, audio events, and music.

Audio captioning Automatic Speech Recognition +10

840

Paper
Code

When Rigidity Hurts: Soft Consistency Regularization for Probabilistic Hierarchical Time Series Forecasting

1 code implementation • 17 Oct 2023 • Harshavardhan Kamarthi, Lingkai Kong, Alexander Rodríguez, Chao Zhang, B. Aditya Prakash

We close both these gap and propose PROFHiT, which is a fully probabilistic hierarchical forecasting model that jointly models forecast distribution of entire hierarchy.

Time Series Time Series Forecasting

Paper
Code

Query-dominant User Interest Network for Large-Scale Search Ranking

no code implementations • 10 Oct 2023 • Tong Guo, Xuanping Li, Haitao Yang, Xiao Liang, Yong Yuan, Jingyou Hou, Bingqing Ke, Chao Zhang, Junlin He, Shunyu Zhang, Enyun Yu, WenWu

The overall historical behaviors are various but noisy while search behaviors are always sparse.

Attribute Information Retrieval

Paper
Add Code

Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models

2 code implementations • 9 Oct 2023 • Guangzhi Sun, Wenyi Yu, Changli Tang, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang

Audio-visual large language models (LLM) have drawn significant attention, yet the fine-grained combination of both input streams is rather under-explored, which is challenging but necessary for LLMs to understand general video inputs.

Question Answering Video Question Answering

Paper
Code

Conditional Diffusion Model for Target Speaker Extraction

no code implementations • 7 Oct 2023 • Theodor Nguyen, Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C Woodland

For the reverse-time process, a parametrised score function is conditioned on a target speaker embedding to extract the target speaker from the mixture of sources.

Target Speaker Extraction

Paper
Add Code

Joint Projection Learning and Tensor Decomposition Based Incomplete Multi-view Clustering

1 code implementation • 6 Oct 2023 • Wei Lv, Chao Zhang, Huaxiong Li, Xiuyi Jia, Chunlin Chen

We further consider the graph noise of projected data caused by missing samples and use a tensor-decomposition based graph filter for robust clustering. JPLTD decomposes the original tensor into an intrinsic tensor and a sparse tensor.

Clustering Incomplete multi-view clustering +1

Paper
Code

Transferring speech-generic and depression-specific knowledge for Alzheimer's disease detection

no code implementations • 6 Oct 2023 • Ziyun Cui, Wen Wu, Wei-Qiang Zhang, Ji Wu, Chao Zhang

Apart from the knowledge from speech-generic representations, this paper also proposes to simultaneously transfer the knowledge from a speech depression detection task based on the high comorbidity rates of depression and AD.

Alzheimer's Disease Detection Depression Detection +1

Paper
Add Code

Multi-Dimension-Embedding-Aware Modality Fusion Transformer for Psychiatric Disorder Clasification

no code implementations • 4 Oct 2023 • Guoxin Wang, Xuyang Cao, Shan An, Fengmei Fan, Chao Zhang, Jinsong Wang, Feng Yu, Zhiren Wang

In this work, we proposed a multi-dimension-embedding-aware modality fusion transformer (MFFormer) for schizophrenia and bipolar disorder classification using rs-fMRI and T1 weighted structural MRI (T1w sMRI).

Time Series

Paper
Add Code

Adapting LLM Agents with Universal Feedback in Communication

no code implementations • 1 Oct 2023 • Kuan Wang, Yadong Lu, Michael Santacroce, Yeyun Gong, Chao Zhang, Yelong Shen

To optimize agent interactions for task-specific learning with our universal buffer and pipeline, we introduce diverse communication patterns tailored for both single-agent and multi-agent environments.

Decision Making GSM8K

Paper
Add Code

It HAS to be Subjective: Human Annotator Simulation via Zero-shot Density Estimation

1 code implementation • 30 Sep 2023 • Wen Wu, Wenlin Chen, Chao Zhang, Philip C. Woodland

Human annotator simulation (HAS) serves as a cost-effective substitute for human evaluation such as data annotation and system assessment.

Density Estimation Meta-Learning

Paper
Code

Subspace-Guided Feature Reconstruction for Unsupervised Anomaly Localization

no code implementations • 25 Sep 2023 • Katsuya Hotta, Chao Zhang, Yoshihiro Hagihara, Takuya Akashi

In this paper, we propose a novel subspace-guided feature reconstruction framework to pursue adaptive feature approximation for anomaly localization.

Paper
Add Code

Connecting Speech Encoder and Large Language Model for ASR

no code implementations • 25 Sep 2023 • Wenyi Yu, Changli Tang, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang

Q-Former-based LLMs can generalise well to out-of-domain datasets, where 12% relative WER reductions over the Whisper baseline ASR model were achieved on the Eval2000 test set without using any in-domain training data from Switchboard.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Affect Recognition in Conversations Using Large Language Models

no code implementations • 22 Sep 2023 • Shutong Feng, Guangzhi Sun, Nurul Lubis, Chao Zhang, Milica Gašić

This study delves into the capacity of large language models (LLMs) to recognise human affect in conversations, with a focus on both open-domain chit-chat dialogues and task-oriented dialogues.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Enhancing Quantised End-to-End ASR Models via Personalisation

1 code implementation • 17 Sep 2023 • Qiuming Zhao, Guangzhi Sun, Chao Zhang, Mingxing Xu, Thomas Fang Zheng

Recent end-to-end automatic speech recognition (ASR) models have become increasingly larger, making them particularly challenging to be deployed on resource-constrained devices.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

A Multi-In and Multi-Out Dendritic Neuron Model and its Optimization

no code implementations • 14 Sep 2023 • Yu Ding, Jun Yu, Chunzhi Gu, Shangce Gao, Chao Zhang

Recently, a novel mathematical ANN model, known as the dendritic neuron model (DNM), has been proposed to address nonlinear problems by more accurately reflecting the structure of real neurons.

Multi-class Classification

Paper
Add Code

Can Whisper perform speech-based in-context learning?

no code implementations • 13 Sep 2023 • Siyin Wang, Chao-Han Huck Yang, Ji Wu, Chao Zhang

Language-level adaptation experiments using Chinese dialects showed that when applying SICL to isolated word ASR, consistent and considerable relative WER reductions can be achieved using Whisper models of any size on two dialects, which is on average 32. 3%.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

RAIN: Your Language Models Can Align Themselves without Finetuning

1 code implementation • 13 Sep 2023 • Yuhui Li, Fangyun Wei, Jinjing Zhao, Chao Zhang, Hongyang Zhang

We discover that by integrating self-evaluation and rewind mechanisms, unaligned LLMs can directly produce responses consistent with human preferences via self-boosting.

Adversarial Attack

Paper
Code

AGMDT: Virtual Staining of Renal Histology Images with Adjacency-Guided Multi-Domain Transfer

no code implementations • 12 Sep 2023 • Tao Ma, Chao Zhang, Min Lu, Lin Luo

Renal pathology, as the gold standard of kidney disease diagnosis, requires doctors to analyze a series of tissue slices stained by H&E staining and special staining like Masson, PASM, and PAS, respectively.

Graph Matching Style Transfer

Paper
Add Code

Cross-Utterance Conditioned VAE for Speech Generation

no code implementations • 8 Sep 2023 • Yang Li, Cheng Yu, Guangzhi Sun, Weiqin Zu, Zheng Tian, Ying Wen, Wei Pan, Chao Zhang, Jun Wang, Yang Yang, Fanglei Sun

Experimental results on the LibriTTS datasets demonstrate that our proposed models significantly enhance speech synthesis and editing, producing more natural and expressive speech.

Speech Synthesis

Paper
Add Code

PolyGET: Accelerating Polymer Simulations by Accurate and Generalizable Forcefield with Equivariant Transformer

no code implementations • 1 Sep 2023 • Rui Feng, Huan Tran, Aubrey Toland, Binghong Chen, Qi Zhu, Rampi Ramprasad, Chao Zhang

Machine learning (ML) forcefields have been developed to achieve both the accuracy of ab initio methods and the efficiency of empirical force fields.

Paper
Add Code

Situated Natural Language Explanations

no code implementations • 27 Aug 2023 • Zining Zhu, Haoming Jiang, Jingfeng Yang, Sreyashi Nag, Chao Zhang, Jie Huang, Yifan Gao, Frank Rudzicz, Bing Yin

Situated NLE provides a perspective and facilitates further research on the generation and evaluation of explanations.

Prompt Engineering

Paper
Add Code

kTrans: Knowledge-Aware Transformer for Binary Code Embedding

1 code implementation • 24 Aug 2023 • Wenyu Zhu, Hao Wang, Yuchen Zhou, JiaMing Wang, Zihan Sha, Zeyu Gao, Chao Zhang

By feeding explicit knowledge as additional inputs to the Transformer, and fusing implicit knowledge with a novel pre-training task, kTrans provides a new perspective to incorporating domain knowledge into a Transformer framework.

Outlier Detection

Paper
Code

Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations

1 code implementation • 14 Aug 2023 • Wen Wu, Chao Zhang, Philip C. Woodland

Two metrics are proposed to evaluate AER performance with automatic segmentation based on time-weighted emotion and speaker classification errors.

Action Detection Activity Detection +4

Paper
Code

One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training

1 code implementation • ICCV 2023 • Jianshuo Dong, Han Qiu, Yiming Li, Tianwei Zhang, Yuanjie Li, Zeqi Lai, Chao Zhang, Shu-Tao Xia

We propose a training-assisted bit flip attack, in which the adversary is involved in the training stage to build a high-risk model to release.

Paper
Code

DF2: Distribution-Free Decision-Focused Learning

no code implementations • 11 Aug 2023 • Lingkai Kong, Wenhao Mu, Jiaming Cui, Yuchen Zhuang, B. Aditya Prakash, Bo Dai, Chao Zhang

However, existing end-to-end DFL methods are hindered by three significant bottlenecks: model mismatch error, sample average approximation error, and gradient approximation error.

Paper
Add Code

Alexa, play with robot: Introducing the First Alexa Prize SimBot Challenge on Embodied AI

no code implementations • 9 Aug 2023 • Hangjie Shi, Leslie Ball, Govind Thattai, Desheng Zhang, Lucy Hu, Qiaozi Gao, Suhaila Shakiah, Xiaofeng Gao, Aishwarya Padmakumar, Bofei Yang, Cadence Chung, Dinakar Guthy, Gaurav Sukhatme, Karthika Arumugam, Matthew Wen, Osman Ipek, Patrick Lange, Rohan Khanna, Shreyas Pansare, Vasu Sharma, Chao Zhang, Cris Flagg, Daniel Pressel, Lavina Vaz, Luke Dai, Prasoon Goyal, Sattvik Sahai, Shaohua Liu, Yao Lu, Anna Gottardi, Shui Hu, Yang Liu, Dilek Hakkani-Tur, Kate Bland, Heather Rocker, James Jeun, Yadunandana Rao, Michael Johnston, Akshaya Iyengar, Arindam Mandal, Prem Natarajan, Reza Ghanadan

The Alexa Prize program has empowered numerous university students to explore, experiment, and showcase their talents in building conversational agents through challenges like the SocialBot Grand Challenge and the TaskBot Challenge.

Paper
Add Code

Revisiting DETR Pre-training for Object Detection

no code implementations • 2 Aug 2023 • Yan Ma, Weicong Liang, Bohan Chen, Yiduo Hao, BoJian Hou, Xiangyu Yue, Chao Zhang, Yuhui Yuan

Motivated by the remarkable achievements of DETR-based approaches on COCO object detection and segmentation benchmarks, recent endeavors have been directed towards elevating their performance through self-supervised pre-training of Transformers while preserving a frozen backbone.

Object object-detection +1

Paper
Add Code

Graph Neural Networks for Forecasting Multivariate Realized Volatility with Spillover Effects

no code implementations • 1 Aug 2023 • Chao Zhang, Xingyue Pu, Mihai Cucuringu, Xiaowen Dong

We present a novel methodology for modeling and forecasting multivariate realized volatilities using customized graph neural networks to incorporate spillover effects across stocks.

Paper
Add Code

Understanding Deep Neural Networks via Linear Separability of Hidden Layers

no code implementations • 26 Jul 2023 • Chao Zhang, Xinyu Chen, Wensheng Li, Lixue Liu, Wei Wu, DaCheng Tao

In this paper, we measure the linear separability of hidden layer outputs to study the characteristics of deep neural networks.

Paper
Add Code

Autoregressive Diffusion Model for Graph Generation

1 code implementation • 17 Jul 2023 • Lingkai Kong, Jiaming Cui, Haotian Sun, Yuchen Zhuang, B. Aditya Prakash, Chao Zhang

However, existing diffusion-based graph generative models are mostly one-shot generative models that apply Gaussian diffusion in the dequantized adjacency matrix space.

Denoising Graph Generation

Paper
Code

C3: Zero-shot Text-to-SQL with ChatGPT

1 code implementation • 14 Jul 2023 • XueMei Dong, Chao Zhang, Yuhang Ge, YUREN MAO, Yunjun Gao, Lu Chen, Jinshu Lin, Dongfang Lou

This paper proposes a ChatGPT-based zero-shot Text-to-SQL method, dubbed C3, which achieves 82. 3\% in terms of execution accuracy on the holdout test set of Spider and becomes the state-of-the-art zero-shot Text-to-SQL method on the Spider Challenge.

Ranked #4 on Text-To-SQL on spider

Text-To-SQL

101

Paper
Code

Graph Contrastive Learning with Multi-Objective for Personalized Product Retrieval in Taobao Search

no code implementations • 10 Jul 2023 • Longbin Li, Chao Zhang, Sen Li, Yun Zhong, Qingwen Liu, Xiaoyi Zeng

Graph-based CF methods improve personalization by modeling collaborative signal within the user click graph.

Collaborative Filtering Contrastive Learning +3

Paper
Add Code

Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data

no code implementations • 4 Jul 2023 • Guangzhi Sun, Chao Zhang, Ivan Vulić, Paweł Budzianowski, Philip C. Woodland

In this work, we propose a Knowledge-Aware Audio-Grounded generative slot-filling framework, termed KA2G, that focuses on few-shot and zero-shot slot filling for ToD with speech input.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Paper
Add Code

Towards Optimal Randomized Strategies in Adversarial Example Game

no code implementations • 29 Jun 2023 • Jiahao Xie, Chao Zhang, Weijie Liu, Wensong Bai, Hui Qian

The vulnerability of deep neural network models to adversarial example attacks is a practical challenge in many artificial intelligence applications.

Paper
Add Code

Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias

1 code implementation • NeurIPS 2023 • Yue Yu, Yuchen Zhuang, Jieyu Zhang, Yu Meng, Alexander Ratner, Ranjay Krishna, Jiaming Shen, Chao Zhang

Large language models (LLMs) have been recently leveraged as training data generators for various natural language processing (NLP) tasks.

Attribute Language Modelling +1

121

Paper
Code

A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step Inference

1 code implementation • 26 Jun 2023 • Chao Zhang, Shiwei Wu, Sirui Zhao, Tong Xu, Enhong Chen

In this paper, we present a solution for enhancing video alignment to improve multi-step inference.

Video Alignment

Paper
Code

G-STO: Sequential Main Shopping Intention Detection via Graph-Regularized Stochastic Transformer

no code implementations • 25 Jun 2023 • Yuchen Zhuang, Xin Shen, Yan Zhao, Chaosheng Dong, Ming Wang, Jin Li, Chao Zhang

The detection of the underlying shopping intentions of users based on their historical interactions is a crucial aspect for e-commerce platforms, such as Amazon, to enhance the convenience and efficiency of their customers' shopping experiences.

Sequential Recommendation

Paper
Add Code

ToolQA: A Dataset for LLM Question Answering with External Tools

1 code implementation • NeurIPS 2023 • Yuchen Zhuang, Yue Yu, Kuan Wang, Haotian Sun, Chao Zhang

To address this issue, we introduce a new dataset called ToolQA, which is designed to faithfully evaluate LLMs' ability to use external tools for question answering.

Hallucination Question Answering

211

Paper
Code

Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation

1 code implementation • 15 Jun 2023 • Ziyang Ma, Zhisheng Zheng, Guanrou Yang, Yu Wang, Chao Zhang, Xie Chen

Our models outperform other SSL models significantly on the LibriSpeech benchmark without the need for iterative re-clustering and re-training.

Ranked #1 on Automatic Speech Recognition on LibriSpeech test-other

Automatic Speech Recognition Clustering +4

Paper
Code

MUBen: Benchmarking the Uncertainty of Molecular Representation Models

2 code implementations • 14 Jun 2023 • Yinghao Li, Lingkai Kong, Yuanqi Du, Yue Yu, Yuchen Zhuang, Wenhao Mu, Chao Zhang

While some studies have included UQ to improve molecular pre-trained models, the process of selecting suitable backbone and UQ methods for reliable molecular uncertainty estimation remains underexplored.

Benchmarking Drug Discovery +4

Paper
Code

Estimating the Uncertainty in Emotion Attributes using Deep Evidential Regression

1 code implementation • 11 Jun 2023 • Wen Wu, Chao Zhang, Philip C. Woodland

In automatic emotion recognition (AER), labels assigned by different human annotators to the same utterance are often inconsistent due to the inherent complexity of emotion and the subjectivity of perception.

Attribute Emotion Recognition +1

Paper
Code

PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm

no code implementations • 11 Jun 2023 • Wensong Bai, Chao Zhang, Yichao Fu, Lingwei Peng, Hui Qian, Bin Dai

In this paper, we propose the first fully push-forward-based Distributional Reinforcement Learning algorithm, called Push-forward-based Actor-Critic EncourageR (PACER).

Continuous Control Distributional Reinforcement Learning +1

Paper
Add Code

TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses

1 code implementation • ICCV 2023 • Xuesong Chen, Shaoshuai Shi, Chao Zhang, Benjin Zhu, Qiang Wang, Ka Chun Cheung, Simon See, Hongsheng Li

3D multi-object tracking (MOT) is vital for many applications including autonomous driving vehicles and service robots.

3D Multi-Object Tracking 3D Object Tracking +2

103

Paper
Code

FlowFormer: A Transformer Architecture and Its Masked Cost Volume Autoencoding for Optical Flow

no code implementations • 8 Jun 2023 • Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Yijin Li, Hongwei Qin, Jifeng Dai, Xiaogang Wang, Hongsheng Li

This paper introduces a novel transformer-based network architecture, FlowFormer, along with the Masked Cost Volume AutoEncoding (MCVA) for pretraining it to tackle the problem of optical flow estimation.

Decoder Optical Flow Estimation

Paper
Add Code

Local Boosting for Weakly-Supervised Learning

no code implementations • 5 Jun 2023 • Rongzhi Zhang, Yue Yu, Jiaming Shen, Xiquan Cui, Chao Zhang

In this work, we show that the standard implementation of the convex combination of base learners can hardly work due to the presence of noisy labels.

Weakly-supervised Learning

Paper
Add Code

Can Contextual Biasing Remain Effective with Whisper and GPT-2?

1 code implementation • 2 Jun 2023 • Guangzhi Sun, Xianrui Zheng, Chao Zhang, Philip C. Woodland

End-to-end automatic speech recognition (ASR) and large language models, such as Whisper and GPT-2, have recently been scaled to use vast amounts of training data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

Graph Reasoning for Question Answering with Triplet Retrieval

no code implementations • 30 May 2023 • Shiyang Li, Yifan Gao, Haoming Jiang, Qingyu Yin, Zheng Li, Xifeng Yan, Chao Zhang, Bing Yin

State-of-the-art methods often utilize entities in questions to retrieve local subgraphs, which are then fed into KG encoder, e. g. graph neural networks (GNNs), to model their local structures and integrated into language models for question answering.

Knowledge Graphs Question Answering +1

Paper
Add Code

DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling

1 code implementation • 30 May 2023 • Yuchen Zhuang, Yue Yu, Lingkai Kong, Xiang Chen, Chao Zhang

Most existing methods for learning from noisy labels use static input features for denoising, but these methods are limited by the information they can provide on true label distributions and can result in biased or incorrect predictions.

Denoising

Paper
Code

Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator

1 code implementation • 30 May 2023 • Guangzhi Sun, Chao Zhang, Phil Woodland

The incorporation of biasing words obtained through contextual knowledge is of paramount importance in automatic speech recognition (ASR) applications.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

AdaPlanner: Adaptive Planning from Feedback with Language Models

1 code implementation • NeurIPS 2023 • Haotian Sun, Yuchen Zhuang, Lingkai Kong, Bo Dai, Chao Zhang

We propose a closed-loop approach, AdaPlanner, which allows the LLM agent to refine its self-generated plan adaptively in response to environmental feedback.

Decision Making Hallucination

Paper
Code

Extracting Shopping Interest-Related Product Types from the Web

no code implementations • 23 May 2023 • Yinghao Li, Colin Lockard, Prashant Shiralkar, Chao Zhang

To establish such connections, we propose to extract PTs from the Web pages containing hand-crafted PT recommendations for SIs.

Node Classification

Paper
Add Code

Self-supervised representations in speech-based depression detection

no code implementations • 20 May 2023 • Wen Wu, Chao Zhang, Philip C. Woodland

This paper proposes handling training data sparsity in speech-based automatic depression detection (SDD) using foundation models pre-trained with self-supervised learning (SSL).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

CCGen: Explainable Complementary Concept Generation in E-Commerce

no code implementations • 19 May 2023 • Jie Huang, Yifan Gao, Zheng Li, Jingfeng Yang, Yangqiu Song, Chao Zhang, Zining Zhu, Haoming Jiang, Kevin Chen-Chuan Chang, Bing Yin

We propose and study Complementary Concept Generation (CCGen): given a concept of interest, e. g., "Digital Cameras", generating a list of complementary concepts, e. g., 1) Camera Lenses 2) Batteries 3) Camera Cases 4) Memory Cards 5) Battery Chargers.

Paper
Add Code

ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval

1 code implementation • 18 May 2023 • Yue Yu, Yuchen Zhuang, Rongzhi Zhang, Yu Meng, Jiaming Shen, Chao Zhang

With the development of large language models (LLMs), zero-shot learning has attracted much attention for various NLP tasks.

Ranked #1 on Zero-Shot Text Classification on AG News

Descriptive Retrieval +6

Paper
Code

A Kriging-Random Forest Hybrid Model for Real-time Ground Property Prediction during Earth Pressure Balance Shield Tunneling

no code implementations • 9 May 2023 • Ziheng Geng, Chao Zhang, Yuhao Ren, Minxiang Zhu, Renpeng Chen, Hongzhan Cheng

The real-time information refers to the real-time operating parameters of the EPB shield, which are input into random forest to provide a real-time prediction of ground properties.

Property Prediction

Paper
Add Code

Do Not Blindly Imitate the Teacher: Using Perturbed Loss for Knowledge Distillation

no code implementations • 8 May 2023 • Rongzhi Zhang, Jiaming Shen, Tianqi Liu, Jialu Liu, Michael Bendersky, Marc Najork, Chao Zhang

In this work, we argue that such a learning objective is sub-optimal because there exists a discrepancy between the teacher's output distribution and the ground truth label distribution.

Knowledge Distillation

Paper
Add Code

An Asynchronous Decentralized Algorithm for Wasserstein Barycenter Problem

no code implementations • 23 Apr 2023 • Chao Zhang, Hui Qian, Jiahao Xie

Wasserstein Barycenter Problem (WBP) has recently received much attention in the field of artificial intelligence.

Paper
Add Code

Accelerated Doubly Stochastic Gradient Algorithm for Large-scale Empirical Risk Minimization

no code implementations • 23 Apr 2023 • Zebang Shen, Hui Qian, Tongzhou Mu, Chao Zhang

Nowadays, algorithms with fast convergence, small memory footprints, and low per-iteration complexity are particularly favorable for artificial intelligence applications.

Paper
Add Code

Cold-Start based Multi-Scenario Ranking Model for Click-Through Rate Prediction

no code implementations • 16 Apr 2023 • Peilin Chen, Hong Wen, Jing Zhang, Fuyu Lv, Zhao Li, Qijie Shen, Wanjie Tao, Ying Zhou, Chao Zhang

Online travel platforms (OTPs), e. g., Ctrip. com or Fliggy. com, can effectively provide travel-related products or services to users.

Click-Through Rate Prediction

Paper
Add Code

Mutually-paced Knowledge Distillation for Cross-lingual Temporal Knowledge Graph Reasoning

no code implementations • 27 Mar 2023 • Ruijie Wang, Zheng Li, Jingfeng Yang, Tianyu Cao, Chao Zhang, Bing Yin, Tarek Abdelzaher

This paper investigates cross-lingual temporal knowledge graph reasoning problem, which aims to facilitate reasoning on Temporal Knowledge Graphs (TKGs) in low-resource languages by transfering knowledge from TKGs in high-resource ones.

Knowledge Distillation Knowledge Graphs +1

Paper
Add Code

Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition

no code implementations • 20 Mar 2023 • Xiaoyu Yang, Qiujia Li, Chao Zhang, Philip C. Woodland

The performance of the student model can be further enhanced when multiple teachers are used jointly, achieving word error rate reductions (WERRs) of 17. 5% and 10. 6%.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Hulk: Graph Neural Networks for Optimizing Regionally Distributed Computing Systems

no code implementations • 27 Feb 2023 • Zhengqing Yuan, Huiwen Xue, Chao Zhang, Yongming Liu

Large deep learning models have shown great potential for delivering exceptional results in various applications.

Distributed Computing

Paper
Add Code

UML: A Universal Monolingual Output Layer for Multilingual ASR

no code implementations • 22 Feb 2023 • Chao Zhang, Bo Li, Tara N. Sainath, Trevor Strohman, Shuo-Yiin Chang

Consequently, the UML enables to switch in the interpretation of each output node depending on the language of the input speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

EvoText: Enhancing Natural Language Generation Models via Self-Escalation Learning for Up-to-Date Knowledge and Improved Performance

no code implementations • 8 Feb 2023 • Zhengqing Yuan, Huiwen Xue, Chao Zhang, Yongming Liu

EvoText enables the model to learn up-to-date knowledge through a self-escalation process that builds on a priori knowledge.

Language Modelling Natural Language Understanding +1

Paper
Add Code

Boosting Low-Data Instance Segmentation by Unsupervised Pre-training with Saliency Prompt

no code implementations • CVPR 2023 • Hao Li, Dingwen Zhang, Nian Liu, Lechao Cheng, Yalun Dai, Chao Zhang, Xinggang Wang, Junwei Han

Inspired by the recent success of the Prompting technique, we introduce a new pre-training method that boosts QEIS models by giving Saliency Prompt for queries/kernels.

Instance Segmentation Semantic Segmentation +1

Paper
Add Code

Multi-Objective Optimization Approach Using Deep Reinforcement Learning for Energy Efficiency in Heterogeneous Computing System

no code implementations • 1 Feb 2023 • Zheqi Yu, Chao Zhang, Pedro Machado, Adnan Zahid, Tim. Fernandez-Hart, Muhammad A. Imran, Qammer H. Abbasi

The growing demand for optimal and low-power energy consumption paradigms for Internet of Things (IoT) devices has garnered significant attention due to their cost-effectiveness, simplicity, and intelligibility.

Management

Paper
Add Code

Neighborhood-Regularized Self-Training for Learning with Few Labels

1 code implementation • 10 Jan 2023 • ran Xu, Yue Yu, Hejie Cui, Xuan Kan, Yanqiao Zhu, Joyce Ho, Chao Zhang, Carl Yang

Our further analysis demonstrates that our proposed data selection strategy reduces the noise of pseudo labels by 36. 8% and saves 57. 3% of the time when compared with the best baseline.

Paper
Code

Fighting Malicious Media Data: A Survey on Tampering Detection and Deepfake Detection

no code implementations • 12 Dec 2022 • Junke Wang, Zhenxin Li, Chao Zhang, Jingjing Chen, Zuxuan Wu, Larry S. Davis, Yu-Gang Jiang

Online media data, in the forms of images and videos, are becoming mainstream communication channels.

DeepFake Detection Face Swapping

Paper
Add Code

AL-iGAN: An Active Learning Framework for Tunnel Geological Reconstruction Based on TBM Operational Data

no code implementations • 2 Dec 2022 • Hao Wang, Lixue Liu, Xueguan Song, Chao Zhang, DaCheng Tao

In tunnel boring machine (TBM) underground projects, an accurate description of the rock-soil types distributed in the tunnel can decrease the construction risk ({\it e. g.} surface settlement and landslide) and improve the efficiency of construction.

Active Learning Generative Adversarial Network

Paper
Add Code

Direct-Effect Risk Minimization for Domain Generalization

1 code implementation • 26 Nov 2022 • Yuhui Li, Zejia Wu, Chao Zhang, Hongyang Zhang

In this work, we introduce the concepts of direct and indirect effects from causal inference to the domain generalization problem.

Causal Inference Domain Generalization +1

Paper
Code

End-to-End Stochastic Optimization with Energy-Based Model

1 code implementation • 25 Nov 2022 • Lingkai Kong, Jiaming Cui, Yuchen Zhuang, Rui Feng, B. Aditya Prakash, Chao Zhang

Decision-focused learning (DFL) was recently proposed for stochastic optimization problems that involve unknown parameters.

Scheduling Stochastic Optimization

Paper
Code

Single-channel EEG completion using Cascade Transformer

no code implementations • 16 Nov 2022 • Chao Zhang, Siqi Han, Milin Zhang

It is easy for the electroencephalogram (EEG) signal to be incomplete due to packet loss, electrode falling off, etc.

EEG

Paper
Add Code

Goal-Oriented Communications for the IoT and Application to Data Compression

no code implementations • 10 Nov 2022 • Chao Zhang, Hang Zou, Samson Lasaulce, Walid Saad, Marios Kountouris, Mehdi Bennis

Internet of Things (IoT) devices will play an important role in emerging applications, since their sensing, actuation, processing, and wireless communication capabilities stimulate data collection, transmission and decision processes of smart applications.

Data Compression

Paper
Add Code

Distribution-based Emotion Recognition in Conversation

1 code implementation • 9 Nov 2022 • Wen Wu, Chao Zhang, Philip C. Woodland

Automatic emotion recognition in conversation (ERC) is crucial for emotion-aware conversational artificial intelligence.

Emotion Recognition in Conversation

Paper
Code

Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems

no code implementations • 1 Nov 2022 • Shaan Bijwadia, Shuo-Yiin Chang, Bo Li, Tara Sainath, Chao Zhang, Yanzhang He

In this work, we propose a method to jointly train the ASR and EP tasks in a single end-to-end (E2E) multitask model, improving EP quality by optionally leveraging information from the ASR audio encoder.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Learning Task-Aware Effective Brain Connectivity for fMRI Analysis with Graph Neural Networks

1 code implementation • 1 Nov 2022 • Yue Yu, Xuan Kan, Hejie Cui, ran Xu, Yujia Zheng, Xiangchen Song, Yanqiao Zhu, Kun Zhang, Razieh Nabi, Ying Guo, Chao Zhang, Carl Yang

To better adapt GNNs for fMRI analysis, we propose TBDS, an end-to-end framework based on \underline{T}ask-aware \underline{B}rain connectivity \underline{D}AG (short for Directed Acyclic Graph) \underline{S}tructure generation for fMRI analysis.

Time Series Time Series Analysis

Paper
Code

Teacher-Student Network for 3D Point Cloud Anomaly Detection with Few Normal Samples

no code implementations • 31 Oct 2022 • Jianjian Qin, Chunzhi Gu, Jun Yu, Chao Zhang

Moreover, our method only requires very few normal samples to train the student network due to the teacher-student distillation mechanism.

3D Anomaly Detection Transfer Learning

Paper
Add Code

End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator

1 code implementation • 29 Oct 2022 • Guangzhi Sun, Chao Zhang, Philip C. Woodland

Specifically, a tree-constrained pointer generator (TCPGen), a powerful and efficient biasing model component, is studied, which leverages a slot shortlist with corresponding entities to extract biasing lists.

intent-classification Intent Classification +6

Paper
Code

RoChBert: Towards Robust BERT Fine-tuning for Chinese

1 code implementation • 28 Oct 2022 • Zihan Zhang, Jinfeng Li, Ning Shi, Bo Yuan, Xiangyu Liu, Rong Zhang, Hui Xue, Donghong Sun, Chao Zhang

Despite of the superb performance on a wide range of tasks, pre-trained language models (e. g., BERT) have been proved vulnerable to adversarial texts.

Data Augmentation Language Modelling

Paper
Code

COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning

1 code implementation • 27 Oct 2022 • Yue Yu, Chenyan Xiong, Si Sun, Chao Zhang, Arnold Overwijk

We present a new zero-shot dense retrieval (ZeroDR) method, COCO-DR, to improve the generalization ability of dense retrieval by combating the distribution shifts between source training tasks and target scenarios.

Ranked #1 on Zero-shot Text Search on CQADupStack

Language Modelling Retrieval +2

Paper
Code

UnfoldML: Cost-Aware and Uncertainty-Based Dynamic 2D Prediction for Multi-Stage Classification

no code implementations • 26 Oct 2022 • Yanbo Xu, Alind Khare, Glenn Matlin, Monish Ramadoss, Rishikesan Kamaleswaran, Chao Zhang, Alexey Tumanov

It achieves within 0. 1% accuracy from the highest-performing multi-class baseline, while saving close to 20X on spatio-temporal cost of inference and earlier (3. 5hrs) disease onset prediction.

Image Classification

Paper
Add Code

ReSel: N-ary Relation Extraction from Scientific Text and Tables by Learning to Retrieve and Select

1 code implementation • 26 Oct 2022 • Yuchen Zhuang, Yinghao Li, Jerry Junyang Cheung, Yue Yu, Yingjun Mou, Xiang Chen, Le Song, Chao Zhang

We study the problem of extracting N-ary relation tuples from scientific articles.

Relation Extraction Retrieval

Paper
Code

Pronunciation Generation for Foreign Language Words in Intra-Sentential Code-Switching Speech Recognition

no code implementations • 26 Oct 2022 • Wei Wang, Chao Zhang, Xiaopei Wu

In this paper, we make use of limited code-switching data as driving materials and explore a shortcut to quickly develop intra-sentential code-switching recognition skill on the commissioned native language acoustic model, where we propose a data-driven method to make the seed lexicon which is used to train grapheme-to-phoneme model to predict mapping pronunciations for foreign language word in code-switching sentences.

Sentence speech-recognition +1

Paper
Add Code

Multi-Objective Personalized Product Retrieval in Taobao Search

no code implementations • 9 Oct 2022 • Yukun Zheng, Jiang Bian, Guanghao Meng, Chao Zhang, Honggang Wang, Zhixuan Zhang, Sen Li, Tao Zhuang, Qingwen Liu, Xiaoyi Zeng

These problems promote us to further strengthen the capabilities of our EBR model in both relevance estimation and personalized retrieval.

Collaborative Filtering Retrieval

Paper
Add Code

Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning

4 code implementations • 3 Oct 2022 • Weicong Liang, Yuhui Yuan, Henghui Ding, Xiao Luo, WeiHong Lin, Ding Jia, Zheng Zhang, Chao Zhang, Han Hu

Vision transformers have recently achieved competitive results across various vision tasks but still suffer from heavy computation costs when processing a large number of tokens.

Clustering Depth Estimation +6

Paper
Code

Goal-Oriented Quantization: Analysis, Design, and Application to Resource Allocation

no code implementations • 30 Sep 2022 • Hang Zou, Chao Zhang, Samson Lasaulce, Lucas Saludjian, Vincent Poor

The task is modeled by the minimization problem of a general goal function $f(x;g)$ for which the decision $x$ has to be taken from a quantized version of the parameters $g$.

Quantization

Paper
Add Code

A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language Processing

1 code implementation • 27 Sep 2022 • Pranav Shetty, Arunkumar Chitteth Rajan, Christopher Kuenneth, Sonkakshi Gupta, Lakshmi Prerana Panchumarti, Lauren Holm, Chao Zhang, Rampi Ramprasad

The ever-increasing number of materials science articles makes it hard to infer chemistry-structure-property relations from published literature.

Language Modelling named-entity-recognition +2

Paper
Code

Context-Aware Query Rewriting for Improving Users' Search Experience on E-commerce Websites

no code implementations • 15 Sep 2022 • Simiao Zuo, Qingyu Yin, Haoming Jiang, Shaohui Xi, Bing Yin, Chao Zhang, Tuo Zhao

The model subsequently calculates session representations by combining the contextual information with the instant search query using an aggregation network.

Graph Attention

Paper
Add Code

Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Propagation Approach

1 code implementation • 15 Sep 2022 • Yue Yu, Rongzhi Zhang, ran Xu, Jieyu Zhang, Jiaming Shen, Chao Zhang

Large Language Models have demonstrated remarkable few-shot performance, but the performance can be sensitive to the selection of few-shot instances.

Language Modelling Text Classification

Paper
Code

Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification

no code implementations • 13 Sep 2022 • Chao Zhang, Bo Li, Tara Sainath, Trevor Strohman, Sepand Mavandadi, Shuo-Yiin Chang, Parisa Haghani

Language identification is critical for many downstream tasks in automatic speech recognition (ASR), and is beneficial to integrate into multilingual end-to-end ASR as an additional task.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

SaleNet: A low-power end-to-end CNN accelerator for sustained attention level evaluation using EEG

no code implementations • 3 Sep 2022 • Chao Zhang, Zijian Tang, Taoming Guo, Jiaxin Lei, Jiaxin Xiao, Anhe Wang, Shuo Bai, Milin Zhang

This paper proposes SaleNet - an end-to-end convolutional neural network (CNN) for sustained attention level evaluation using prefrontal electroencephalogram (EEG).

Clustering EEG +2

Paper
Add Code

Turn-Taking Prediction for Natural Conversational Speech

no code implementations • 29 Aug 2022 • Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Trevor Strohman, Qiao Liang, Yanzhang He

This makes doing speech recognition with conversational speech, including one with multiple queries, a challenging task.

speech-recognition Speech Recognition

Paper
Add Code

SciAnnotate: A Tool for Integrating Weak Labeling Sources for Sequence Labeling

1 code implementation • 7 Aug 2022 • Mengyang Liu, Haozheng Luo, Leonard Thong, Yinghao Li, Chao Zhang, Le Song

Compared to frequently used text annotation tools, our annotation tool allows for the development of weak labels in addition to providing a manual annotation experience.

Denoising named-entity-recognition +3

Paper
Code

DETRs with Hybrid Matching

8 code implementations • CVPR 2023 • Ding Jia, Yuhui Yuan, Haodi He, Xiaopei Wu, Haojun Yu, WeiHong Lin, Lei Sun, Chao Zhang, Han Hu

One-to-one set matching is a key design for DETR to establish its end-to-end capability, so that object detection does not require a hand-crafted NMS (non-maximum suppression) to remove duplicate detections.

Object Detection Pose Estimation +2

1,852

Paper
Code

Model-Aware Contrastive Learning: Towards Escaping the Dilemmas

1 code implementation • 16 Jul 2022 • Zizheng Huang, Haoxing Chen, Ziqi Wen, Chao Zhang, Huaxiong Li, Bo wang, Chunlin Chen

Contrastive learning (CL) continuously achieves significant breakthroughs across multiple domains.

Contrastive Learning Graph Representation Learning +3

Paper
Code

Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription

no code implementations • 8 Jul 2022 • Xianrui Zheng, Chao Zhang, Philip C. Woodland

Self-supervised-learning-based pre-trained models for speech data, such as Wav2Vec 2. 0 (W2V2), have become the backbone of many speech tasks.

Action Detection Activity Detection +3

Paper
Add Code

Learning Disentangled Representations for Controllable Human Motion Prediction

no code implementations • 4 Jul 2022 • Chunzhi Gu, Jun Yu, Chao Zhang

Specifically, the inductive bias imposed by the extra CVAE path encourages two latent variables in two paths to respectively govern separate representations for each partial-body motion.

Human motion prediction Inductive Bias +1

Paper
Add Code

Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition

no code implementations • 2 Jul 2022 • Guangzhi Sun, Chao Zhang, Philip C. Woodland

Incorporating biasing words obtained as contextual knowledge is critical for many automatic speech recognition (ASR) applications.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Adaptive Multi-view Rule Discovery for Weakly-Supervised Compatible Products Prediction

no code implementations • 28 Jun 2022 • Rongzhi Zhang, Rebecca West, Xiquan Cui, Chao Zhang

We develop AMRule, a multi-view rule discovery framework that can (1) adaptively and iteratively discover novel rulers that can complement the current weakly-supervised model to improve compatibility prediction; (2) discover interpretable rules from both structured attribute tables and unstructured product descriptions.

Attribute Language Modelling +1

Paper
Add Code

Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval

no code implementations • 20 Jun 2022 • Guile Wu, Chao Zhang, Stephan Liwicki

In global consistent quantization, we employ contrastive learning for both embedding and quantized representations and fuses these representations for consistent contrastive regularization between instances.

Contrastive Learning Image Retrieval +2

Paper
Add Code

When Rigidity Hurts: Soft Consistency Regularization for Probabilistic Hierarchical Time Series Forecasting

1 code implementation • 16 Jun 2022 • Harshavardhan Kamarthi, Lingkai Kong, Alexander Rodríguez, Chao Zhang, B. Aditya Prakash

We close both these gap and propose PROFHiT, which is a fully probabilistic hierarchical forecasting model that jointly models forecast distribution of entire hierarchy.

Time Series Time Series Forecasting

Paper
Code

Sparse Conditional Hidden Markov Model for Weakly Supervised Named Entity Recognition

1 code implementation • 27 May 2022 • Yinghao Li, Le Song, Chao Zhang

Weakly supervised named entity recognition methods train label models to aggregate the token annotations of multiple noisy labeling functions (LFs) without seeing any manually annotated labels.

Named Entity Recognition Named Entity Recognition (NER) +1

Paper
Code

Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator

no code implementations • 18 May 2022 • Guangzhi Sun, Chao Zhang, Philip C Woodland

MBWE and BLMD further improved the effectiveness of TCPGen and achieved more significant WER reductions on the biasing words.

Dialogue State Tracking Language Modelling +3

Paper
Add Code

Revisiting PINNs: Generative Adversarial Physics-informed Neural Networks and Point-weighting Method

1 code implementation • 18 May 2022 • Wensheng Li, Chao Zhang, Chuncheng Wang, Hanting Guan, DaCheng Tao

Physics-informed neural networks (PINNs) provide a deep learning framework for numerically solving partial differential equations (PDEs), and have been widely used in a variety of PDE problems.

Paper
Code

CERES: Pretraining of Graph-Conditioned Transformer for Semi-Structured Session Data

no code implementations • NAACL 2022 • Rui Feng, Chen Luo, Qingyu Yin, Bing Yin, Tuo Zhao, Chao Zhang

User sessions empower many search and recommendation tasks on a daily basis.

Entity Linking Self-Supervised Learning +1

Paper
Add Code

FlowFormer: A Transformer Architecture for Optical Flow

1 code implementation • 30 Mar 2022 • Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Ka Chun Cheung, Hongwei Qin, Jifeng Dai, Hongsheng Li

We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural network architecture for learning optical flow.

Ranked #1 on Optical Flow Estimation on Sintel-final

Decoder Optical Flow Estimation

384

Paper
Code

Learning a Structured Latent Space for Unsupervised Point Cloud Completion

no code implementations • CVPR 2022 • Yingjie Cai, Kwan-Yee Lin, Chao Zhang, Qiang Wang, Xiaogang Wang, Hongsheng Li

Specifically, we map a series of related partial point clouds into multiple complete shape and occlusion code pairs and fuse the codes to obtain their representations in the unified latent space.

Point Cloud Completion

Paper
Add Code

PRBoost: Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning

1 code implementation • 18 Mar 2022 • Rongzhi Zhang, Yue Yu, Pranav Shetty, Le Song, Chao Zhang

Weakly-supervised Learning

Paper
Code

FORCE: A Framework of Rule-Based Conversational Recommender System

no code implementations • 18 Mar 2022 • Jun Quan, Ze Wei, Qiang Gan, Jingqi Yao, Jingyi Lu, Yuchen Dong, Yiming Liu, Yi Zeng, Chao Zhang, Yongzhi Li, Huang Hu, Yingying He, Yang Yang, Daxin Jiang

The conversational recommender systems (CRSs) have received extensive attention in recent years.

Recommendation Systems

Paper
Add Code

Abandoning the Bayer-Filter to See in the Dark

1 code implementation • CVPR 2022 • Xingbo Dong, Wanyan Xu, Zhihui Miao, Lan Ma, Chao Zhang, Jiewen Yang, Zhe Jin, Andrew Beng Jin Teoh, Jiajun Shen

Next, a fully convolutional network is proposed to achieve the low-light image enhancement by fusing colored raw data with synthesized monochrome raw data.

Low-Light Image Enhancement

Paper
Code

Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors

no code implementations • 8 Mar 2022 • Wen Wu, Chao Zhang, Xixin Wu, Philip C. Woodland

In this paper, a novel Bayesian training loss based on per-utterance Dirichlet prior distributions is proposed for verbal emotion recognition, which models the uncertainty in one-hot labels created when human annotators assign the same utterance to different emotion classes.

Attribute Emotion Classification +1

Paper
Add Code

Shift-Robust Node Classification via Graph Adversarial Clustering

no code implementations • 7 Mar 2022 • Qi Zhu, Chao Zhang, Chanyoung Park, Carl Yang, Jiawei Han

Then a shift-robust classifier is optimized on training graph and adversarial samples on target graph, which are generated by cluster GNN.

Classification Clustering +2

Paper
Add Code

Tail-GAN: Learning to Simulate Tail Risk Scenarios

no code implementations • 3 Mar 2022 • Rama Cont, Mihai Cucuringu, Renyuan Xu, Chao Zhang

The estimation of loss distributions for dynamic portfolios requires the simulation of scenarios representing realistic joint dynamics of their components, with particular importance devoted to the simulation of tail risk scenarios.

Generative Adversarial Network

Paper
Add Code

A Survey on Programmatic Weak Supervision

1 code implementation • 11 Feb 2022 • Jieyu Zhang, Cheng-Yu Hsieh, Yue Yu, Chao Zhang, Alexander Ratner

Labeling training data has become one of the major roadblocks to using machine learning.

174

Paper
Code

Volatility forecasting with machine learning and intraday commonality

no code implementations • 8 Feb 2022 • Chao Zhang, Yihuang Zhang, Mihai Cucuringu, Zhongmin Qian

We apply machine learning models to forecast intraday realized volatility (RV), by exploiting commonality in intraday volatility via pooling stock data together, and by incorporating a proxy for the market volatility.

BIG-bench Machine Learning

Paper
Add Code

SIGMA: A Structural Inconsistency Reducing Graph Matching Algorithm

no code implementations • 6 Feb 2022 • Weijie Liu, Chao Zhang, Nenggan Zheng, Hui Qian

In this paper, we propose a novel criterion to measure the graph matching accuracy, structural inconsistency (SI), which is defined based on the network topological structure.

Graph Matching

Paper
Add Code

Improving the fusion of acoustic and text representations in RNN-T

no code implementations • 25 Jan 2022 • Chao Zhang, Bo Li, Zhiyun Lu, Tara N. Sainath, Shuo-Yiin Chang

The recurrent neural network transducer (RNN-T) has recently become the mainstream end-to-end approach for streaming automatic speech recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Towards Transferable Unrestricted Adversarial Examples with Minimum Changes

1 code implementation • 4 Jan 2022 • Fangcheng Liu, Chao Zhang, Hongyang Zhang

Extensive experiments verify the effectiveness of our framework on balancing imperceptibility and transferability of the crafted adversarial examples.

Adversarial Attack

Paper
Code

Recurring the Transformer for Video Action Recognition

no code implementations • CVPR 2022 • Jiewen Yang, Xingbo Dong, Liujun Liu, Chao Zhang, Jiajun Shen, Dahai Yu

Besides, the proposed RViT can work on both fixed-length and variant-length video clips properly without requiring large GPU memory thanks to the frame by frame processing flow.

Action Recognition Representation Learning +3

Paper
Add Code

Cross-Impact of Order Flow Imbalance in Equity Markets

no code implementations • 25 Dec 2021 • Rama Cont, Mihai Cucuringu, Chao Zhang

We investigate the impact of order flow imbalance (OFI) on price movements in equity markets in a multi-asset setting.

Paper
Add Code

Supervised Multivariate Learning with Simultaneous Feature Auto-grouping and Dimension Reduction

no code implementations • 17 Dec 2021 • Yiyuan She, Jiahui Shen, Chao Zhang

In this paper, new information-theoretical limits are presented to reveal the intrinsic cost of seeking for clusters, as well as the blessing from dimensionality in multivariate learning.

Dimensionality Reduction Variable Selection

Paper
Add Code

AcTune: Uncertainty-aware Active Self-Training for Semi-Supervised Active Learning with Pretrained Language Models

1 code implementation • 16 Dec 2021 • Yue Yu, Lingkai Kong, Jieyu Zhang, Rongzhi Zhang, Chao Zhang

We propose {\ours}, a new framework that leverages unlabeled data to improve the label efficiency of active PLM fine-tuning.

Active Learning Language Modelling +2

Paper
Code

DPVI: A Dynamic-Weight Particle-Based Variational Inference Framework

no code implementations • 2 Dec 2021 • Chao Zhang, Zhijian Li, Hui Qian, Xin Du

We develop a general Dynamic-weight Particle-based Variational Inference (DPVI) framework according to a novel continuous composite flow, which evolves the positions and weights of particles simultaneously.

Variational Inference

Paper
Add Code

HRFormer: High-Resolution Vision Transformer for Dense Predict

2 code implementations • NeurIPS 2021 • Yuhui Yuan, Rao Fu, Lang Huang, WeiHong Lin, Chao Zhang, Xilin Chen, Jingdong Wang

We present a High-Resolution Transformer (HRFormer) that learns high-resolution representations for dense prediction tasks, in contrast to the original Vision Transformer that produces low-resolution representations and has high memory and computational cost.

Pose Estimation Semantic Segmentation +1

476

Paper
Code

PMSSC: Parallelizable multi-subset based self-expressive model for subspace clustering

no code implementations • 24 Nov 2021 • Katsuya Hotta, Takuya Akashi, Shogo Tokai, Chao Zhang

Subspace clustering methods which embrace a self-expressive model that represents each data point as a linear combination of other data points in the dataset provide powerful unsupervised learning techniques.

Clustering

Paper
Add Code

A Universal End-to-End Approach to Portfolio Optimization via Deep Learning

no code implementations • 17 Nov 2021 • Chao Zhang, Zihao Zhang, Mihai Cucuringu, Stefan Zohren

The designed framework circumvents the traditional forecasting step and avoids the estimation of the covariance matrix, lifting the bottleneck for generalizing to a large amount of instruments.

Portfolio Optimization

Paper
Add Code

Soft-Sensing ConFormer: A Curriculum Learning-based Convolutional Transformer

no code implementations • 12 Nov 2021 • Jaswanth Yella, Chao Zhang, Sergei Petrov, Yu Huang, Xiaoye Qian, Ali A. Minai, Sthitie Bom

Over the last few decades, modern industrial processes have investigated several cost-effective methodologies to improve the productivity and yield of semiconductor manufacturing.

Paper
Add Code

Approximating Optimal Transport via Low-rank and Sparse Factorization

no code implementations • 12 Nov 2021 • Weijie Liu, Chao Zhang, Nenggan Zheng, Hui Qian

Optimal transport (OT) naturally arises in a wide range of machine learning applications but may often become the computational bottleneck.

Paper
Add Code

GraSSNet: Graph Soft Sensing Neural Networks

no code implementations • 12 Nov 2021 • Yu Huang, Chao Zhang, Jaswanth Yella, Sergei Petrov, Xiaoye Qian, Yufei Tang, Xingquan Zhu, Sthitie Bom

In the era of big data, data-driven based classification has become an essential method in smart manufacturing to guide production and optimize inspection.

Time Series Time Series Analysis +1

Paper
Add Code

Soft Sensing Model Visualization: Fine-tuning Neural Network from What Model Learned

no code implementations • 12 Nov 2021 • Xiaoye Qian, Chao Zhang, Jaswanth Yella, Yu Huang, Ming-Chun Huang, Sthitie Bom

To understand how the proposed model works, the deep visualization approach is applied.

Time Series Time Series Analysis

Paper
Add Code

Diversity-Promoting Human Motion Interpolation via Conditional Variational Auto-Encoder

no code implementations • 12 Nov 2021 • Chunzhi Gu, Shuofeng Zhao, Chao Zhang

In this paper, we present a deep generative model based method to generate diverse human motion interpolation results.

Decoder Motion Interpolation

Paper
Add Code

Soft Sensing Transformer: Hundreds of Sensors are Worth a Single Word

1 code implementation • 10 Nov 2021 • Chao Zhang, Jaswanth Yella, Yu Huang, Xiaoye Qian, Sergei Petrov, Andrey Rzhetsky, Sthitie Bom

We demonstrate the challenges and effectiveness of modeling industrial big data by a Soft Sensing Transformer model on these data sets.

Sentence Time Series +1

Paper
Code

Callee: Recovering Call Graphs for Binaries with Transfer and Contrastive Learning

1 code implementation • 2 Nov 2021 • Wenyu Zhu, Zhiyao Feng, Zihan Zhang, Jianjun Chen, Zhijian Ou, Min Yang, Chao Zhang

Recovering binary programs' call graphs is crucial for inter-procedural analysis tasks and applications based on them. transfer One of the core challenges is recognizing targets of indirect calls (i. e., indirect callees).

Contrastive Learning Question Answering +1

Paper
Code

Lightweight Mobile Automated Assistant-to-physician for Global Lower-resource Areas

no code implementations • 28 Oct 2021 • Chao Zhang, Hanxin Zhang, Atif Khan, Ted Kim, Olasubomi Omoleye, Oluwamayomikun Abiona, Amy Lehman, Christopher O. Olopade, Olufunmilayo I. Olopade, Pedro Lopes, Andrey Rzhetsky

Importance: Lower-resource areas in Africa and Asia face a unique set of healthcare challenges: the dual high burden of communicable and non-communicable diseases; a paucity of highly trained primary healthcare providers in both rural and densely populated urban areas; and a lack of reliable, inexpensive internet connections.

Paper
Add Code

HRFormer: High-Resolution Transformer for Dense Prediction

1 code implementation • 18 Oct 2021 • Yuhui Yuan, Rao Fu, Lang Huang, WeiHong Lin, Chao Zhang, Xilin Chen, Jingdong Wang

Ranked #3 on Pose Estimation on AIC

Image Classification Multi-Person Pose Estimation +2

476

Paper
Code

Unrestricted Adversarial Attacks on ImageNet Competition

1 code implementation • 17 Oct 2021 • Yuefeng Chen, Xiaofeng Mao, Yuan He, Hui Xue, Chao Li, Yinpeng Dong, Qi-An Fu, Xiao Yang, Tianyu Pang, Hang Su, Jun Zhu, Fangcheng Liu, Chao Zhang, Hongyang Zhang, Yichi Zhang, Shilong Liu, Chang Liu, Wenzhao Xiang, Yajie Wang, Huipeng Zhou, Haoran Lyu, Yidan Xu, Zixuan Xu, Taoyu Zhu, Wenjun Li, Xianfeng Gao, Guoqiu Wang, Huanqian Yan, Ying Guo, Chaoning Zhang, Zheng Fang, Yang Wang, Bingyang Fu, Yunfei Zheng, Yekui Wang, Haorong Luo, Zhen Yang

Many works have investigated the adversarial attacks or defenses under the settings where a bounded and imperceptible perturbation can be added to the input.

Adversarial Attack

Paper
Code

Input Length Matters: Improving RNN-T and MWER Training for Long-form Telephony Speech Recognition

no code implementations • 8 Oct 2021 • Zhiyun Lu, Yanwei Pan, Thibault Doutre, Parisa Haghani, Liangliang Cao, Rohit Prabhavalkar, Chao Zhang, Trevor Strohman

Our experiments show that for both losses, the WER on long-form speech reduces substantially as the training utterance length increases.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Self-Training with Differentiable Teacher

no code implementations • Findings (NAACL) 2022 • Simiao Zuo, Yue Yu, Chen Liang, Haoming Jiang, Siawpeng Er, Chao Zhang, Tuo Zhao, Hongyuan Zha

In self-training, the student contributes to the prediction performance, and the teacher controls the training process by generating pseudo-labels.

named-entity-recognition Named Entity Recognition +3

Paper
Add Code

CAMul: Calibrated and Accurate Multi-view Time-Series Forecasting

1 code implementation • 15 Sep 2021 • Harshavardhan Kamarthi, Lingkai Kong, Alexander Rodríguez, Chao Zhang, B. Aditya Prakash

We use CAMul for multiple domains with varied sources and modalities and show that CAMul outperforms other state-of-art probabilistic forecasting models by over 25\% in accuracy and calibration.

Decision Making Probabilistic Time Series Forecasting +1

Paper
Code

Learning to Predict Diverse Human Motions from a Single Image via Mixture Density Networks

no code implementations • 13 Sep 2021 • Chunzhi Gu, Yan Zhao, Chao Zhang

Human motion prediction, which plays a key role in computer vision, generally requires a past motion sequence as input.

Human motion prediction motion prediction

Paper
Add Code

Learning from Language Description: Low-shot Named Entity Recognition via Decomposed Framework

no code implementations • Findings (EMNLP) 2021 • Yaqing Wang, Haoda Chu, Chao Zhang, Jing Gao

In this work, we study the problem of named entity recognition (NER) in a low resource scenario, focusing on few-shot and zero-shot settings.

Few-Shot Learning named-entity-recognition +3

Paper
Add Code

Training Algorithm Matters for the Performance of Neural Network Potential: A Case Study of Adam and the Kalman Filter Optimizers

no code implementations • 8 Sep 2021 • Yunqi Shao, Florian M. Dietrich, Carl Nettelblad, Chao Zhang

Here we compare the performance of two popular training algorithms, the adaptive moment estimation algorithm (Adam) and the Extended Kalman Filter algorithm (EKF), using the Behler-Parrinello neural network (BPNN) and two publicly accessible datasets of liquid water [Proc.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.