1 code implementation • 30 Oct 2023 • Tianwen Wei, Liang Zhao, Lichang Zhang, Bo Zhu, Lijie Wang, Haihua Yang, Biye Li, Cheng Cheng, Weiwei Lü, Rui Hu, Chenxia Li, Liu Yang, Xilin Luo, Xuejie Wu, Lunan Liu, Wenjun Cheng, Peng Cheng, Jianhao Zhang, XiaoYu Zhang, Lei Lin, Xiaokun Wang, Yutuan Ma, Chuanhai Dong, Yanqi Sun, Yifu Chen, Yongyi Peng, Xiaojuan Liang, Shuicheng Yan, Han Fang, Yahui Zhou
In this technical report, we present Skywork-13B, a family of large language models (LLMs) trained on a corpus of over 3. 2 trillion tokens drawn from both English and Chinese texts.
1 code implementation • 25 Oct 2023 • Liu Yang, Haihua Yang, Wenjun Cheng, Lei Lin, Chenxia Li, Yifu Chen, Lunan Liu, Jianfei Pan, Tianwen Wei, Biye Li, Liang Zhao, Lijie Wang, Bo Zhu, Guoliang Li, Xuejie Wu, Xilin Luo, Rui Hu
Large language models (LLMs) have shown great potential to solve varieties of natural language processing (NLP) tasks, including mathematical reasoning.
no code implementations • 29 Jun 2023 • Tianwen Wei, Jian Luan, Wei Liu, Shuang Dong, Bin Wang
We present the Chinese Elementary School Math Word Problems (CMATH) dataset, comprising 1. 7k elementary school-level math word problems with detailed annotations, source from actual Chinese workbooks and exams.
1 code implementation • ACL 2022 • Tianwen Wei, Jianwei Qi, Shenghuan He
In this demonstration, we present an efficient BERT-based multi-task (MT) framework that is particularly suitable for iterative and incremental development of the tasks.
3 code implementations • NAACL 2021 • Tianwen Wei, Jianwei Qi, Shenghuan He, Songtao Sun
Conditional Random Field (CRF) based neural models are among the most performant methods for solving sequence labeling problems.
no code implementations • 8 Jun 2015 • Stephane Chretien, Tianwen Wei
The subdifferential of convex functions of the singular spectrum of real matrices has been widely studied in matrix analysis, optimization and automatic control theory.
no code implementations • 26 May 2015 • Tianwen Wei
This contribution summarizes the results on the asymptotic performance of several variants of the FastICA algorithm.
no code implementations • 28 Aug 2014 • Tianwen Wei
In the first part of this work, we are interested in the relationship between demixing vectors, local optimizers of the contrast function and (attractive or unattractive) fixed points of FastICA algorithm.
no code implementations • 1 Aug 2014 • Tianwen Wei
This contribution deals with the generalized symmetric FastICA algorithm in the domain of Independent Component Analysis (ICA).