Search Results for author: Junqiu Wei

Found 4 papers, 2 papers with code

ClusterFormer: Neural Clustering Attention for Efficient and Effective Transformer

no code implementations • ACL 2022 • Ningning Wang, Guobing Gan, Peng Zhang, Shuai Zhang, Junqiu Wei, Qun Liu, Xin Jiang

Other sparse methods use clustering patterns to select words, but the clustering process is separate from the training process of the target task, which causes a decrease in effectiveness.

Clustering Machine Translation +4

Paper
Add Code

Training Multilingual Pre-trained Language Model with Byte-level Subwords

1 code implementation • 23 Jan 2021 • Junqiu Wei, Qun Liu, Yinpeng Guo, Xin Jiang

The pre-trained language models have achieved great successes in various natural language understanding (NLU) tasks due to its capacity to capture the deep contextualized information in text by pre-training on large-scale corpora.

Language Modelling Natural Language Understanding

2,977

Paper
Code

TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling

no code implementations • 28 Jul 2020 • Shuai Zhang, Peng Zhang, Xindian Ma, Junqiu Wei, Ningning Wang, Qun Liu

Transformer has been widely-used in many Natural Language Processing (NLP) tasks and the scaled dot-product attention between tokens is a core module of Transformer.

Language Modelling Machine Translation +2

Paper
Add Code

NEZHA: Neural Contextualized Representation for Chinese Language Understanding

10 code implementations • 31 Aug 2019 • Junqiu Wei, Xiaozhe Ren, Xiaoguang Li, Wenyong Huang, Yi Liao, Yasheng Wang, Jiashu Lin, Xin Jiang, Xiao Chen, Qun Liu

named-entity-recognition Named Entity Recognition +6

11,628

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.