Search Results for author: Jing Bai

Found 24 papers, 8 papers with code

Evaluation of Pretrained BERT Model by Using Sentence Clustering

no code implementations • PACLIC 2020 • Naoki Shibayama, Rui Cao, Jing Bai, Wen Ma, Hiroyuki Shinnou

Paper
Add Code

Enhancing Self-Attention with Knowledge-Assisted Attention Maps

no code implementations • NAACL 2022 • Jiangang Bai, Yujing Wang, Hong Sun, Ruonan Wu, Tianmeng Yang, Pengfei Tang, Defu Cao, Mingliang Zhang1, Yunhai Tong, Yaming Yang, Jing Bai, Ruofei Zhang, Hao Sun, Wei Shen

Large-scale pre-trained language models have attracted extensive attentions in the research community and shown promising results on various tasks of natural language processing.

Multi-Task Learning Natural Language Understanding

Paper
Add Code

Domain Adaptation for Sentiment Analysis using Keywords in the Target Domain as the Learning Weight

no code implementations • PACLIC 2018 • Jing Bai, Hiroyuki Shinnou, Kanako Komiya

Domain Adaptation Sentiment Analysis

Paper
Add Code

NutePrune: Efficient Progressive Pruning with Numerous Teachers for Large Language Models

no code implementations • 15 Feb 2024 • Shengrui Li, Xueting Han, Jing Bai

Structured pruning, offers an effective means to compress LLMs, thereby reducing storage costs and enhancing inference speed for more efficient utilization.

Knowledge Distillation

Paper
Add Code

Parameter-efficient is not sufficient: Exploring Parameter, Memory, and Time Efficient Adapter Tuning for Dense Predictions

no code implementations • 16 Jun 2023 • Dongshuo Yin, Xueting Han, Bin Li, Hao Feng, Jing Bai

We provide a gradient backpropagation highway for low-rank adapters which eliminates the need for expensive backpropagation through the frozen pre-trained model, resulting in substantial savings of training memory and training time.

Transfer Learning

Paper
Add Code

Time-aware Graph Structure Learning via Sequence Prediction on Temporal Graphs

1 code implementation • 13 Jun 2023 • Haozhen Zhang, Xueting Han, Xi Xiao, Jing Bai

To address these issues, we propose a Time-aware Graph Structure Learning (TGSL) approach via sequence prediction on temporal graphs, which learns better graph structures for downstream tasks through adding potential temporal edges.

Contrastive Learning Data Augmentation +3

Paper
Code

AdapterGNN: Parameter-Efficient Fine-Tuning Improves Generalization in GNNs

1 code implementation • 19 Apr 2023 • Shengrui Li, Xueting Han, Jing Bai

AdapterGNN preserves the knowledge of the large pre-trained model and leverages highly expressive adapters for GNNs, which can adapt to downstream tasks effectively with only a few parameters, while also improving the model's generalization ability.

Generalization Bounds

Paper
Code

Generative Adversarial Networks Based on Transformer Encoder and Convolution Block for Hyperspectral Image Classification

no code implementations • Remote Sensing 2022 • Jing Bai, Jiawei Lu, Zhu Xiao, Zheng Chen, Licheng Jiao

Nowadays, HSI classification can reach a high classification accuracy when given sufficient labeled samples as training set.

Ranked #2 on Hyperspectral Image Classification on Kennedy Space Center

Classification Classification Of Hyperspectral Images +4

Paper
Add Code

An Automated Question-Answering Framework Based on Evolution Algorithm

no code implementations • 26 Jan 2022 • Sinan Tan, Hui Xue, Qiyu Ren, Huaping Liu, Jing Bai

Our framework is based on an innovative evolution algorithm, which is stable and suitable for multiple dataset scenario.

Question Answering

Paper
Add Code

Learning Multi-granularity User Intent Unit for Session-based Recommendation

1 code implementation • 25 Dec 2021 • Jiayan Guo, Yaming Yang, Xiangchen Song, Yuan Zhang, Yujing Wang, Jing Bai, Yan Zhang

Specifically, we creatively propose Multi-granularity Intent Heterogeneous Session Graph which captures the interactions between different granularity intent units and relieves the burden of long-dependency.

Session-Based Recommendations

Paper
Code

Creating Training Sets via Weak Indirect Supervision

no code implementations • ICLR 2022 • Jieyu Zhang, Bohan Wang, Xiangchen Song, Yujing Wang, Yaming Yang, Jing Bai, Alexander Ratner

Creating labeled training sets has become one of the major roadblocks in machine learning.

text-classification Text Classification

Paper
Add Code

Graph Pointer Neural Networks

no code implementations • 3 Oct 2021 • Tianmeng Yang, Yujing Wang, Zhihan Yue, Yaming Yang, Yunhai Tong, Jing Bai

On the one hand, multi-hop-based approaches do not explicitly distinguish relevant nodes from a large number of multi-hop neighborhoods, leading to a severe over-smoothing problem.

Node Classification

Paper
Add Code

Attentive Knowledge-aware Graph Convolutional Networks with Collaborative Guidance for Personalized Recommendation

no code implementations • 5 Sep 2021 • Yankai Chen, Yaming Yang, Yujing Wang, Jing Bai, Xiangchen Song, Irwin King

However, simply integrating KGs in current KG-based RS models is not necessarily a guarantee to improve the recommendation performance, which may even weaken the holistic model capability.

Click-Through Rate Prediction Knowledge-Aware Recommendation +1

Paper
Add Code

Adaptive Transfer Learning on Graph Neural Networks

1 code implementation • 19 Jul 2021 • Xueting Han, Zhenhuan Huang, Bang An, Jing Bai

We design an adaptive auxiliary loss weighting model to learn the weights of auxiliary tasks by quantifying the consistency between auxiliary tasks and the target task.

Meta-Learning Multi-Task Learning

Paper
Code

Spectral Temporal Graph Neural Network for Multivariate Time-series Forecasting

2 code implementations • NeurIPS 2020 • Defu Cao, Yujing Wang, Juanyong Duan, Ce Zhang, Xia Zhu, Conguri Huang, Yunhai Tong, Bixiong Xu, Jing Bai, Jie Tong, Qi Zhang

In this paper, we propose Spectral Temporal Graph Neural Network (StemGNN) to further improve the accuracy of multivariate time-series forecasting.

Multivariate Time Series Forecasting Time Series

451

Paper
Code

Syntax-BERT: Improving Pre-trained Transformers with Syntax Trees

1 code implementation • EACL 2021 • Jiangang Bai, Yujing Wang, Yiren Chen, Yaming Yang, Jing Bai, Jing Yu, Yunhai Tong

Pre-trained language models like BERT achieve superior performances in various NLP tasks without explicit consideration of syntactic information.

Natural Language Understanding

Paper
Code

Evolving Attention with Residual Convolutions

2 code implementations • 20 Feb 2021 • Yujing Wang, Yaming Yang, Jiangang Bai, Mingliang Zhang, Jing Bai, Jing Yu, Ce Zhang, Gao Huang, Yunhai Tong

In this paper, we propose a novel and generic mechanism based on evolving attention to improve the performance of transformers.

Image Classification Machine Translation +2

Paper
Code

Predictive Attention Transformer: Improving Transformer with Attention Map Prediction

no code implementations • 1 Jan 2021 • Yujing Wang, Yaming Yang, Jiangang Bai, Mingliang Zhang, Jing Bai, Jing Yu, Ce Zhang, Yunhai Tong

Instead, we model their dependencies via a chain of prediction models that take previous attention maps as input to predict the attention maps of a new layer through convolutional neural networks.

Machine Translation

Paper
Add Code

AutoADR: Automatic Model Design for Ad Relevance

no code implementations • 14 Oct 2020 • Yiren Chen, Yaming Yang, Hong Sun, Yujing Wang, Yu Xu, Wei Shen, Rong Zhou, Yunhai Tong, Jing Bai, Ruofei Zhang

We add the model designed by AutoADR as a sub-model into the production Ad Relevance model.

Knowledge Distillation Neural Architecture Search

Paper
Add Code

Multivariate Time-series Anomaly Detection via Graph Attention Network

2 code implementations • 4 Sep 2020 • Hang Zhao, Yujing Wang, Juanyong Duan, Congrui Huang, Defu Cao, Yunhai Tong, Bixiong Xu, Jing Bai, Jie Tong, Qi Zhang

Anomaly detection on multivariate time-series is of great importance in both data mining research and industrial applications.

Anomaly Detection Graph Attention +3

297

Paper
Code

LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression

no code implementations • COLING 2020 • Yihuan Mao, Yujing Wang, Chufan Wu, Chen Zhang, Yang Wang, Yaming Yang, Quanlu Zhang, Yunhai Tong, Jing Bai

BERT is a cutting-edge language representation model pre-trained by a large corpus, which achieves superior performances on various natural language understanding tasks.

Blocking Knowledge Distillation +2