Methods > General > Skip Connections

Residual Connection

Introduced by He et al. in Deep Residual Learning for Image Recognition

Residual Connections are a type of skip-connection that learn residual functions with reference to the layer inputs, instead of learning unreferenced functions.

Formally, denoting the desired underlying mapping as $\mathcal{H}({x})$, we let the stacked nonlinear layers fit another mapping of $\mathcal{F}({x}):=\mathcal{H}({x})-{x}$. The original mapping is recast into $\mathcal{F}({x})+{x}$.

The intuition is that it is easier to optimize the residual mapping than to optimize the original, unreferenced mapping. To the extreme, if an identity mapping were optimal, it would be easier to push the residual to zero than to fit an identity mapping by a stack of nonlinear layers.

Source: Deep Residual Learning for Image Recognition

Latest Papers

PAPER DATE
A Sample-Based Training Method for Distantly Supervised Relation Extraction with Pre-Trained Transformers
Mehrdad NasserMohamad Bagher SajadiBehrouz Minaei-Bidgoli
2021-04-15
Emotion Dynamics Modeling via BERT
Haiqin YangJianping Shen
2021-04-15
Are Multilingual BERT models robust? A Case Study on Adversarial Attacks for Multilingual Question Answering
Sara RosenthalMihaela BorneaAvirup Sil
2021-04-15
SINA-BERT: A pre-trained Language Model for Analysis of Medical Texts in Persian
Nasrin TaghizadehEhsan DoostmohammadiElham SeifossadatHamid R. RabieeMaedeh S. Tahaei
2021-04-15
Privacy-Adaptive BERT for Natural Language Understanding
Chen QuWeize KongLiu YangMingyang ZhangMichael BenderskyMarc Najork
2021-04-15
UHD-BERT: Bucketed Ultra-High Dimensional Sparse Representations for Full Ranking
Kyoung-Rok JangJunmo KangGiwon HongSung-Hyon MyaengJoohee ParkTaewon YoonHeecheol Seo
2021-04-15
ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning
| Swarnadeep SahaPrateek YadavLisa BauerMohit Bansal
2021-04-15
Demystify Optimization Challenges in Multilingual Transformers
Xian LiHongyu Gong
2021-04-15
Text Guide: Improving the quality of long text classification by a text selection method based on feature importance
Krzysztof FiokWaldemar KarwowskiEdgar GutierrezMohammad Reza DavahliMaciej WilamowskiTareq AhramAwad Al-JuaidJozef Zurada
2021-04-15
Self-supervised Video Object Segmentation by Motion Grouping
Charig YangHala LamdouarErika LuAndrew ZissermanWeidi Xie
2021-04-15
Vision Transformer using Low-level Chest X-ray Feature Corpus for COVID-19 Diagnosis and Severity Quantification
Sangjoon ParkGwanghyun KimYujin OhJoon Beom SeoSang Min LeeJin Hwan KimSungjun MoonJae-Kwang LimJong Chul Ye
2021-04-15
Cross-domain Speech Recognition with Unsupervised Character-level Distribution Matching
Wenxin HouJindong WangXu TanTao QinTakahiro Shinozaki
2021-04-15
UIT-E10dot3 at SemEval-2021 Task 5: Toxic Spans Detection with Named Entity Recognition and Question-Answering Approaches
Phu Gia HoangLuan Thanh NguyenKiet Van Nguyen
2021-04-15
BERT based Transformers lead the way in Extraction of Health Information from Social Media
Sidharth RAbhiraj TiwariParthivi ChoubeySaisha KashyapSahil KhoseKumud LakaraNishesh SinghUjjwal Verma
2021-04-15
NT5?! Training T5 to Perform Numerical Reasoning
| Peng-Jian YangYing Ting ChenYuechan ChenDaniel Cer
2021-04-15
TorontoCL at CMCL 2021 Shared Task: RoBERTa with Multi-Stage Fine-Tuning for Eye-Tracking Prediction
| Bai LiFrank Rudzicz
2021-04-15
Points as Queries: Weakly Semi-supervised Object Detection by Points
Liangyu ChenTong YangXiangyu ZhangWei zhangJian Sun
2021-04-15
Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech
Yixuan ZhouChanghe SongJingbei LiZhiyong WuHelen Meng
2021-04-14
An Introduction of mini-AlphaStar
| Ruo-Ze LiuWenhai WangYanjie ShenZhiqi LiYang YuTong Lu
2021-04-14
NAREOR: The Narrative Reordering Problem
Varun GangalSteven Y. FengEduard HovyTeruko Mitamura
2021-04-14
Decoupled Spatial-Temporal Transformer for Video Inpainting
Rui LiuHanming DengYangyi HuangXiaoyu ShiLewei LuWenxiu SunXiaogang WangJifeng DaiHongsheng Li
2021-04-14
Sparse Attention with Linear Units
Biao ZhangIvan TitovRico Sennrich
2021-04-14
Knowledge-driven Answer Generation for Conversational Search
Mariana LeiteRafael FerreiraDavid SemedoJoão Magalhães
2021-04-14
Non-autoregressive sequence-to-sequence voice conversion
Tomoki HayashiWen-Chin HuangKazuhiro KobayashiTomoki Toda
2021-04-14
On the Robustness of Goal Oriented Dialogue Systems to Real-world Noise
Jason KroneSailik SenguptaSaab Mansoor
2021-04-14
Disentangling Representations of Text by Masking Transformers
Xiongyi ZhangJan-Willem van de MeentByron C. Wallace
2021-04-14
An Interpretability Illusion for BERT
Tolga BolukbasiAdam PearceAnn YuanAndy CoenenEmily ReifFernanda ViégasMartin Wattenberg
2021-04-14
Static Embeddings as Efficient Knowledge Bases?
| Philipp DufterNora KassnerHinrich Schütze
2021-04-14
TWEAC: Transformer with Extendable QA Agent Classifiers
| Gregor GeigleNils ReimersAndreas RückléIryna Gurevych
2021-04-14
QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering
| Michihiro YasunagaHongyu RenAntoine BosselutPercy LiangJure Leskovec
2021-04-13
UPB at SemEval-2021 Task 7: Adversarial Multi-Task Learning for Detecting and Rating Humor and Offense
Răzvan-Alexandru SmăduDumitru-Clementin CercelMihai Dascalu
2021-04-13
Semantic maps and metrics for science Semantic maps and metrics for science using deep transformer encoders
Brendan ChambersJames Evans
2021-04-13
Lite-HRNet: A Lightweight High-Resolution Network
| Changqian YuBin XiaoChangxin GaoLu YuanLei ZhangNong SangJingdong Wang
2021-04-13
Learning and Planning in Complex Action Spaces
Thomas HubertJulian SchrittwieserIoannis AntonoglouMohammadamin BarekatainSimon SchmittDavid Silver
2021-04-13
Online and Offline Reinforcement Learning by Planning with a Learned Model
Julian SchrittwieserThomas HubertAmol MandhaneMohammadamin BarekatainIoannis AntonoglouDavid Silver
2021-04-13
Understanding Transformers for Bot Detection in Twitter
| Andres Garcia-SilvaCristian BerrioJose Manuel Gomez-Perez
2021-04-13
1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB's Convergence Speed
Conglong LiAmmar Ahmad AwanHanlin TangSamyam RajbhandariYuxiong He
2021-04-13
Mediators in Determining what Processing BERT Performs First
| Aviv SlobodkinLeshem ChoshenOmri Abend
2021-04-13
Transformer-based Methods for Recognizing Ultra Fine-grained Entities (RUFES)
Emanuela BorosAntoine Doucet
2021-04-13
Discourse Probing of Pretrained Language Models
Fajri KotoJey Han LauTimothy Baldwin
2021-04-13
Large-Scale Contextualised Language Modelling for Norwegian
| Andrey KutuzovJeremy BarnesErik VelldalLilja ØvrelidStephan Oepen
2021-04-13
MS2: Multi-Document Summarization of Medical Studies
| Jay DeYoungIz BeltagyMadeleine van ZuylenBailey KuehlLucy Lu Wang
2021-04-13
ViT-V-Net: Vision Transformer for Unsupervised Volumetric Medical Image Registration
| Junyu ChenYufan HeEric C. FreyYe LiYong Du
2021-04-13
Can a Transformer Pass the Wug Test? Tuning Copying Bias in Neural Morphological Inflection Models
Ling LiuMans Hulden
2021-04-13
Fighting the COVID-19 Infodemic with a Holistic BERT Ensemble
| Giorgos TziafasKonstantinos KogkalidisTommaso Caselli
2021-04-12
Multilingual Language Models Predict Human Reading Behavior
| Nora HollensteinFederico PirovanoCe ZhangLena JägerLisa Beinborn
2021-04-12
Updater-Extractor Architecture for Inductive World State Representations
Arseny MoskvichevJames A. Liu
2021-04-12
Learning dynamic and hierarchical traffic spatiotemporal features with Transformer
Haoyang YanXiaolei Ma
2021-04-12
Escaping the Big Data Paradigm with Compact Transformers
| Ali HassaniSteven WaltonNikhil ShahAbulikemu AbuduweiliJiachen LiHumphrey Shi
2021-04-12
Cloth Interactive Transformer for Virtual Try-On
Bin RenHao TangFanyang MengRunwei DingLing ShaoPhilip H. S. TorrNicu Sebe
2021-04-12
ENOS: Energy-Aware Network Operator Search for Hybrid Digital and Compute-in-Memory DNN Accelerators
Shamma NasrinAhish ShylendraYuti KadakiaNick IlievWilfred GomesTheja TulabandhulaAmit Ranjan Trivedi
2021-04-12
WHOSe Heritage: Classification of UNESCO World Heritage "Outstanding Universal Value" Documents with Smoothed Labels
Nan BaiRenqian LuoPirouz NourianAna Pereira Roders
2021-04-12
Fine-Tuning Transformers for Identifying Self-Reporting Potential Cases and Symptoms of COVID-19 in Tweets
| Max FlemingPriyanka DondetiCaitlin N. DreisbachAdam Poliak
2021-04-12
Learning to Remove: Towards Isotropic Pre-trained BERT Embedding
| Yuxin LiangRui CaoJie ZhengJie RenLing Gao
2021-04-12
Family of Origin and Family of Choice: Massively Parallel Lexiconized Iterative Pretraining for Severely Low Resource Machine Translation
Zhong ZhouAlex Waibel
2021-04-12
A Recipe for Global Convergence Guarantee in Deep Neural Networks
Kenji KawaguchiQingyun Sun
2021-04-12
On Representation Learning for Scientific News Articles Using Heterogeneous Knowledge Graphs
Angelika RomanouPanayiotis SmerosKarl Aberer
2021-04-12
Learning to Synthesize Data for Semantic Parsing
| Bailin WangWenpeng YinXi Victoria LinCaiming Xiong
2021-04-12
Paragraph-level Simplification of Medical Texts
Ashwin DevarajIain J. MarshallByron C. WallaceJunyi Jessy Li
2021-04-12
One Ring to Rule Them All: a simple solution to multi-view 3D-Reconstruction of shapes with unknown BRDF via a small Recurrent ResNet
Ziang ChengHongdong LiRichard HartleyYinqiang ZhengImari Sato
2021-04-11
Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa
| Junqi DaiHang YanTianxiang SunPengFei LiuXipeng Qiu
2021-04-11
Innovative Bert-based Reranking Language Models for Speech Recognition
Shih-Hsuan ChiuBerlin Chen
2021-04-11
UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost
Zhen WuLijun WuQi MengYingce XiaShufang XieTao QinXinyu DaiTie-Yan Liu
2021-04-11
Research on Optimization Method of Multi-scale Fish Target Fast Detection Network
Yang LiuShengmao ZhangFei WangWei FanGuohua ZouJing Bo
2021-04-11
MIPT-NSU-UTMN at SemEval-2021 Task 5: Ensembling Learning with Pre-trained Language Models for Toxic Spans Detection
| Mikhail KotyushevAnna GlazkovaDmitry Morozov
2021-04-10
Meta-tuning Language Models to Answer Prompts Better
Ruiqi ZhongKristy LeeZheng ZhangDan Klein
2021-04-10
ZS-BERT: Towards Zero-Shot Relation Extraction with Attribute Representation Learning
| Chih-Yao ChenCheng-Te Li
2021-04-10
Non-autoregressive Transformer-based End-to-end ASR using BERT
Fu-Hao YuKuan-Yu Chen
2021-04-10
Knowledge-Aware Graph-Enhanced GPT-2 for Dialogue State Tracking
Weizhe LinBo-Hsian TsengBill Byrne
2021-04-09
Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
Yuankai QiZizheng PanYicong HongMing-Hsuan YangAnton Van Den HengelQi Wu
2021-04-09
Combined Depth Space based Architecture Search For Person Re-identification
Hanjun LiGaojie WuWei-Shi Zheng
2021-04-09
Towards Fine-grained Visual Representations by Combining Contrastive Learning with Image Reconstruction and Attention-weighted Pooling
| Jonas DippelSteffen DippelJohannes Höhne
2021-04-09
Text2Chart: A Multi-Staged Chart Generator from Natural Language Text
Md. Mahinur RashidHasin Kawsar JahanAnnysha HuzzatRiyasaat Ahmed RahulTamim Bin ZakirFarhana MeemMd. Saddam Hossain MuktaSwakkhar Shatabda
2021-04-09
Deep Transformer Networks for Time Series Classification: The NPP Safety Case
Bing ZhaAlessandro VanniYassin HassanTunc AldemirAlper Yilmaz
2021-04-09
DenResCov-19: A deep transfer learning network for robust automatic classification of COVID-19, pneumonia, and tuberculosis from X-rays
Michail MamalakisAndrew J. SwiftBart VorselaarsSurajit RaySimonne WeeksWeiping DingRichard H. ClaytonLouise S. MackenzieAbhirup Banerjee
2021-04-08
Lone Pine at SemEval-2021 Task 5: Fine-Grained Detection of Hate Speech Using BERToxic
Yakoob KhanWeicheng MaSoroush Vosoughi
2021-04-08
Revisiting Simple Neural Probabilistic Language Models
Simeng SunMohit Iyyer
2021-04-08
Uppsala NLP at SemEval-2021 Task 2: Multilingual Language Models for Fine-tuning and Feature Extraction in Word-in-Context Disambiguation
Huiling YouXingran ZhuSara Stymne
2021-04-08
Probing BERT in Hyperbolic Spaces
| Boli ChenYao FuGuangwei XuPengjun XieChuanqi TanMosha ChenLiping Jing
2021-04-08
Does Your Dermatology Classifier Know What It Doesn't Know? Detecting the Long-Tail of Unseen Conditions
Abhijit Guha RoyJie RenShekoofeh AziziAaron LohVivek NatarajanBasil MustafaNick PawlowskiJan FreybergYuAn LiuZach BeaverNam VoPeggy BuiSamantha WinterPatricia MacWilliamsGreg S. CorradoUmesh TelangYun LiuTaylan CemgilAlan KarthikesalingamBalaji LakshminarayananJim Winkens
2021-04-08
Graph Attention Networks for Anti-Spoofing
Hemlata TakJee-weon JungJose PatinoMassimiliano TodiscoNicholas Evans
2021-04-08
Rethinking and Improving the Robustness of Image Style Transfer
Pei WangYijun LiNuno Vasconcelos
2021-04-08
Facial Attribute Transformers for Precise and Robust Makeup Transfer
Zhaoyi WanHaoran ChenJielei ZhangWentao JiangCong YaoJiebo Luo
2021-04-07
LI-Net: Large-Pose Identity-Preserving Face Reenactment Network
Jin LiuPeng ChenTao LiangZhaoxing LiCai YuShuqiao ZouJiao DaiJizhong Han
2021-04-07
Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
Zhicheng HuangZhaoyang ZengYupan HuangBei LiuDongmei FuJianlong Fu
2021-04-07
Partially-Connected Differentiable Architecture Search for Deepfake and Spoofing Detection
Wanying GeMichele PanarielloJose PatinoMassimiliano TodiscoNicholas Evans
2021-04-07
Combining Pre-trained Word Embeddings and Linguistic Features for Sequential Metaphor Identification
Rui MaoChenghua LinFrank Guerin
2021-04-07
Interpreting A Pre-trained Model Is A Key For Model Architecture Optimization: A Case Study On Wav2Vec 2.0
Liu ChenMeysam Asgari
2021-04-07
Better Neural Machine Translation by Extracting Linguistic Information from BERT
| Hassan S. ShavaraniAnoop Sarkar
2021-04-07
Interpreting Verbal Metaphors by Paraphrasing
Rui MaoChenghua LinFrank Guerin
2021-04-07
Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs
| Sujeong ChaWangrui HouHyun JungMy PhungMichael PichenyHong-Kwang KuoSamuel ThomasEdmilson Morais
2021-04-07
Attention Head Masking for Inference Time Content Selection in Abstractive Summarization
Shuyang CaoLu Wang
2021-04-06
Fourier Image Transformer
| Tim-Oliver BuchholzFlorian Jug
2021-04-06
Variational Transformer Networks for Layout Generation
Diego Martin ArroyoJanis PostelsFederico Tombari
2021-04-06
Content-Aware GAN Compression
Yuchen LiuZhixin ShuYijun LiZhe LinFederico PerazziS. Y. Kung
2021-04-06
LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring
Anton MitrofanovMariya KorenevskayaIvan PodluzhnyYuri KhokhlovAleksandr LaptevAndrei AndrusenkoAleksei IlinMaxim KorenevskyIvan MedennikovAleksei Romanenko
2021-04-06
MuSLCAT: Multi-Scale Multi-Level Convolutional Attention Transformer for Discriminative Music Modeling on Raw Waveforms
Kai MiddlebrookShyam SudhakaranDavid Guy Brizan
2021-04-06
ODE Transformer: An Ordinary Differential Equation-Inspired Model for Neural Machine Translation
Bei LiQuan DuTao ZhouShuhan ZhouXin ZengTong XiaoJingbo Zhu
2021-04-06
hBert + BiasCorp -- Fighting Racism on the Web
Olawale OnabolaZhuang MaYang XieBenjamin AkeraAbdulrahman IbraheemJia XueDianbo LiuYoshua Bengio
2021-04-06
Speaker embeddings by modeling channel-wise correlations
Themos StafylakisJohan RohdinLukas Burget
2021-04-06
Variable selection with missing data in both covariates and outcomes: Imputation and machine learning
| Liangyuan HuJung-Yi Joyce LinJiayi Ji
2021-04-06
CodeTrans: Towards Cracking the Language of Silicone's Code Through Self-Supervised Deep Learning and High Performance Computing
| Ahmed ElnaggarWei DingLlion JonesTom GibbsTamas FeherChristoph AngererSilvia SeveriniFlorian MatthesBurkhard Rost
2021-04-06
Integrating Frequency Translational Invariance in TDNNs and Frequency Positional Information in 2D ResNets to Enhance Speaker Verification
Jenthe ThienpondtBrecht DesplanquesKris Demuynck
2021-04-06
Efficient transfer learning for NLP with ELECTRA
| François Mercier
2021-04-06
A fully automated end-to-end process for fluorescence microscopy images of yeast cells: From segmentation to detection and classification
Asmaa HajaLambert R. B. Schomaker
2021-04-06
AST: Audio Spectrogram Transformer
Yuan GongYu-An ChungJames Glass
2021-04-05
Exploring Transformers in Emotion Recognition: a comparison of BERT, DistillBERT, RoBERTa, XLNet and ELECTRA
Diogo Cortiz
2021-04-05
What's the best place for an AI conference, Vancouver or ______: Why completing comparative questions is difficult
Avishai ZagouryEinat MinkovIdan SzpektorWilliam W. Cohen
2021-04-05
Insight about Detection, Prediction and Weather Impact of Coronavirus (Covid-19) using Neural Network
A K M Bahalul HaqueTahmid Hasan PrantoAbdulla All NomanAtik Mahmood
2021-04-05
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding
Suyoun KimAbhinav AroraDuc LeChing-Feng YehChristian FuegenOzlem KalinliMichael L. Seltzer
2021-04-05
ReCAM@IITK at SemEval-2021 Task 4: BERT and ALBERT based Ensemble for Abstract Word Prediction
| Abhishek MittalAshutosh Modi
2021-04-04
Improving Pretrained Models for Zero-shot Multi-label Text Classification through Reinforced Label Hierarchy Reasoning
| Hui LiuDanqing ZhangBing YinXiaodan Zhu
2021-04-04
TransfoRNN: Capturing the Sequential Information in Self-Attention Representations for Language Modeling
Tze Yuang ChongXuyang WangLin YangJunjie Wang
2021-04-04
MCL@IITK at SemEval-2021 Task 2: Multilingual and Cross-lingual Word-in-Context Disambiguation using Augmented Data, Signals, and Transformers
Rohan GuptaJay MundraDeepak MahajanAshutosh Modi
2021-04-04
IITK@Detox at SemEval-2021 Task 5: Semi-Supervised Learning and Dice Loss for Toxic Spans Detection
| Archit BansalAbhay KaushikAshutosh Modi
2021-04-04
Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances
| Chang ZengXin WangErica CooperJunichi Yamagishi
2021-04-04
IndT5: A Text-to-Text Transformer for 10 Indigenous Languages
El Moatez Billah NagoudiWei-Rui ChenMuhammad Abdul-MageedHasan Cavusogl
2021-04-04
Deepfake Detection Scheme Based on Vision Transformer and Distillation
Young-Jin HeoYoung-Ju ChoiYoung-Woon LeeByung-Gyu Kim
2021-04-03
Unsupervised Domain Adaptation with Global and Local Graph Neural Networks in Limited Labeled Data Scenario: Application to Disaster Management
Samujjwal GhoshSubhadeep MajiMaunendra Sankar Desarkar
2021-04-03
Exploring the Role of BERT Token Representations to Explain Sentence Probing Results
Hosein MohebbiAli ModarressiMohammad Taher Pilehvar
2021-04-03
Deep Feature CycleGANs: Speaker Identity Preserving Non-parallel Microphone-Telephone Domain Adaptation for Speaker Verification
Saurabh KatariaJesús VillalbaPiotr ŻelaskoLaureano Moro-VelázquezNajim Dehak
2021-04-03
Efficient DETR: Improving End-to-End Object Detector with Dense Prior
Zhuyu YaoJiangbo AiBoxun LiChi Zhang
2021-04-03
Deep ensembles based on Stochastic Activation Selection for Polyp Segmentation
Alessandra LuminiLoris NanniGianluca Maguolo
2021-04-02
Language-based Video Editing via Multi-Modal Multi-Level Transformer
Tsu-Jui FuXin Eric WangScott T. GraftonMiguel P. EcksteinWilliam Yang Wang
2021-04-02
AAformer: Auto-Aligned Transformer for Person Re-Identification
Kuan ZhuHaiyun GuoShiliang ZhangYaoWei WangGaopan HuangHonglin QiaoJing LiuJinqiao WangMing Tang
2021-04-02
Effect of depth order on iterative nested named entity recognition models
Perceval WajsburtYoann TailléXavier Tannier
2021-04-02
IITK@LCP at SemEval 2021 Task 1: Classification for Lexical Complexity Regression Task
| Neil Rajiv ShirudeSagnik MukherjeeTushar ShandhilyaAnanta MukherjeeAshutosh Modi
2021-04-02
The Coronavirus is a Bioweapon: Analysing Coronavirus Fact-Checked Stories
Lynnette Hui Xian NgKathleen M. Carley
2021-04-02
A Semantic Segmentation Network for Urban-Scale Building Footprint Extraction Using RGB Satellite Imagery
| Aatif JiwaniShubhrakanti GangulyChao DingNan ZhouDavid M. Chan
2021-04-02
Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis
Ajay JainMatthew TancikPieter Abbeel
2021-04-01
WakaVT: A Sequential Variational Transformer for Waka Generation
Yuka TakeishiMingxuan NiuJing LuoZhong JinXinyu Yang
2021-04-01
Students are the Best Teacher: Exit-Ensemble Distillation with Multi-Exits
Hojung LeeJong-Seok Lee
2021-04-01
LoFTR: Detector-Free Local Feature Matching with Transformers
| Jiaming SunZehong ShenYuang WangHujun BaoXiaowei Zhou
2021-04-01
The surprising impact of mask-head architecture on novel class segmentation
| Vighnesh BirodkarZhichao LuSiyang LiVivek RathodJonathan Huang
2021-04-01
TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking
Peng ChuJiang WangQuanzeng YouHaibin LingZicheng Liu
2021-04-01
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
Giorgio BarnabòGiovanni TrappoliniLorenzo LastillaCesare CampagnanoAngela FanFabio PetroniFabrizio Silvestri
2021-04-01
HLE-UPC at SemEval-2021 Task 5: Multi-Depth DistilBERT for Toxic Spans Detection
| Rafel Palliser-SansAlbert Rial-Farràs
2021-04-01
Keyword Transformer: A Self-Attention Model for Keyword Spotting
Axel BergMark O'ConnorMiguel Tairum Cruz
2021-04-01
Next Generation Multitarget Trackers: Random Finite Set Methods vs Transformer-based Deep Learning
| Juliano PintoGeorg HessWilliam LjungberghYuxuan XiaLennart SvenssonHenk Wymeersch
2021-04-01
Adversarial Attacks and Defenses for Speech Recognition Systems
Piotr ŻelaskoSonal JoshiYiwen ShaoJesus VillalbaJan TrmalNajim DehakSanjeev Khudanpur
2021-03-31
Scalable Visual Attribute Extraction through Hidden Layers of a Residual ConvNet
Andres BaloianNils Murrugarra-LlerenaJose M. Saavedra
2021-03-31
Convolutional Dynamic Alignment Networks for Interpretable Classifications
Moritz BöhleMario FritzBernt Schiele
2021-03-31
Drowsiness Detection Based On Driver Temporal Behavior Using a New Developed Dataset
Farnoosh FarajiFaraz LotfiJavad KhorramdelAli NajafiAli Ghaffari
2021-03-31
Learning Spatio-Temporal Transformer for Visual Tracking
| Bin YanHouwen PengJianlong FuDong WangHuchuan Lu
2021-03-31
Spatiotemporal Transformer for Video-based Person Re-identification
Tianyu ZhangLonghui WeiLingxi XieZijie ZhuangYongfei ZhangBo LiQi Tian
2021-03-30
Automatic Graph Partitioning for Very Large-scale Deep Learning
Masahiro TanakaKenjiro TauraToshihiro HanawaKentaro Torisawa
2021-03-30
Benchmarking Representation Learning for Natural World Image Collections
| Grant van HornElijah ColeSara BeeryKimberly WilberSerge BelongieOisin Mac Aodha
2021-03-30
Read and Attend: Temporal Localisation in Sign Language Videos
Gül VarolLiliane MomeniSamuel AlbanieTriantafyllos AfourasAndrew Zisserman
2021-03-30
Automated Cleanup of the ImageNet Dataset by Model Consensus, Explainability and Confident Learning
| Csaba Kertész
2021-03-30
Rethinking Spatial Dimensions of Vision Transformers
| Byeongho HeoSangdoo YunDongyoon HanSanghyuk ChunJunsuk ChoeSeong Joon Oh
2021-03-30
Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
| Mingchen ZhugeDehong GaoDeng-Ping FanLinbo JinBen ChenHaoming ZhouMinghui QiuLing Shao
2021-03-30
Identity-Aware CycleGAN for Face Photo-Sketch Synthesis and Recognition
Yuke FangJiani HuWeihong Deng
2021-03-30
Grounding Dialogue Systems via Knowledge Graph Aware Decoding with Pre-trained Transformers
| Debanjan ChaudhuriMd Rashad Al Hasan RonyJens Lehmann
2021-03-30
An In-depth Analysis of Passage-Level Label Transfer for Contextual Document Ranking
Koustav RudraZeon Trevor FernandoAvishek Anand
2021-03-30
CvT: Introducing Convolutions to Vision Transformers
| Haiping WuBin XiaoNoel CodellaMengchen LiuXiyang DaiLu YuanLei Zhang
2021-03-29
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Pengchuan ZhangXiyang DaiJianwei YangBin XiaoLu YuanLei ZhangJianfeng Gao
2021-03-29
Transformer Tracking
| Xin ChenBin YanJiawen ZhuDong WangXiaoyun YangHuchuan Lu
2021-03-29
Rethinking Neural Operations for Diverse Tasks
| Nicholas RobertsMikhail KhodakTri DaoLiam LiChristopher RéAmeet Talwalkar
2021-03-29
Retraining DistilBERT for a Voice Shopping Assistant by Using Universal Dependencies
Pratik JayaraoArpit Sharma
2021-03-29
Classification of Seeds using Domain Randomization on Self-Supervised Learning Frameworks
Venkat MargapuriMitchell Neilsen
2021-03-29
Whitening Sentence Representations for Better Semantics and Faster Retrieval
| Jianlin SuJiarun CaoWeijie LiuYangyiwen Ou
2021-03-29
Contextual Text Embeddings for Twi
Paul AzunreSalomey OseiSalomey AddoLawrence Asamoah Adu-GyamfiStephen MooreBernard AdabankahBernard OpokuClara Asare-NyarkoSamuel NyarkoCynthia AmoabaEsther Dansoa AppiahFelix AkwerhRichard Nii Lante LawsonJoel BuduEmmanuel DebrahNana BoatengWisdom OforiEdwin Buabeng-MunkohFranklin AdjeiIsaac Kojo Essel AmpomahJoseph OtooReindorf BorkorStandylove Birago MensahLucien MensahMark Amoako MarcelAnokye Acheampong AmponsahJames Ben Hayfron-Acquah
2021-03-29
Rethinking ResNets: Improved Stacking Strategies With High Order Schemes
Zhengbo LuoZitang SunWeilian ZhouSei-ichiro Kamata
2021-03-28
PENELOPIE: Enabling Open Information Extraction for the Greek Language through Machine Translation
| Dimitris PapadopoulosNikolaos PapadakisNikolaos Matsatsinis
2021-03-28
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
Ye JiaHeiga ZenJonathan ShenYu ZhangYonghui Wu
2021-03-28
HiT: Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval
Song LiuHaoqi FanShengsheng QianYiru ChenWenkui DingZhongyuan Wang
2021-03-28
Face Transformer for Recognition
| Yaoyao ZhongWeihong Deng
2021-03-27
Unsupervised Self-Training for Sentiment Analysis of Code-Switched Data
Akshat GuptaSargam MenghaniSai Krishna RallabandiAlan W Black
2021-03-27
COVID-19 personal protective equipment detection using real-time deep learning methods
| Shayan KhosravipourErfan TaghvaeiNasrollah Moghadam Charkari
2021-03-27
Leveraging neural representations for facilitating access to untranscribed speech from endangered languages
| Nay SanMartijn BarteldsMitchell BrowneLily CliffordFiona GibsonJohn MansfieldDavid NashJane SimpsonMyfany TurpinMaria VollmerSasha WilmothDan Jurafsky
2021-03-26
A Practical Survey on Faster and Lighter Transformers
Quentin FournierGaétan Marceau CaronDaniel Aloise
2021-03-26
Understanding Robustness of Transformers for Image Classification
Srinadh BhojanapalliAyan ChakrabartiDaniel GlasnerDaliang LiThomas UnterthinerAndreas Veit
2021-03-26
Improve GAN-based Neural Vocoder using Pointwise Relativistic LeastSquare GAN
Congyi WangYu ChenBin WangYi Shi
2021-03-26
Gated Transformer Networks for Multivariate Time Series Classification
| Minghao LiuShengqi RenSiyuan MaJiahui JiaoYizhou ChenZhiguang WangWei Song
2021-03-26
Lifting Transformer for 3D Human Pose Estimation in Video
Wenhao LiHong LiuRunwei DingMengyuan LiuPichao Wang
2021-03-26
On Generating Transferable Targeted Perturbations
| Muzammal NaseerSalman KhanMunawar HayatFahad Shahbaz KhanFatih Porikli
2021-03-26
BART based semantic correction for Mandarin automatic speech recognition system
Yun ZhaoXuerui YangJinchao WangYongyu GaoChao YanYuanfu Zhou
2021-03-26
Predicting Directionality in Causal Relations in Text
| Pedram HosseiniDavid A. BroniatowskiMona Diab
2021-03-25
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
| Ze LiuYutong LinYue CaoHan HuYixuan WeiZheng ZhangStephen LinBaining Guo
2021-03-25
AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting
| Ye YuanXinshuo WengYanglan OuKris Kitani
2021-03-25
Visual Grounding Strategies for Text-Only Natural Language Processing
Damien Sileo
2021-03-25
Bertinho: Galician BERT Representations
David VilaresMarcos GarciaCarlos Gómez-Rodríguez
2021-03-25
Mask Attention Networks: Rethinking and Strengthen Transformer
Zhihao FanYeyun GongDayiheng LiuZhongyu WeiSiyuan WangJian JiaoNan DuanRuofei ZhangXuanjing Huang
2021-03-25
BERT4SO: Neural Sentence Ordering by Fine-tuning BERT
Yutao ZhuJian-Yun NieKun ZhouShengchao LiuYabo LingPan Du
2021-03-25
Thinking Aloud: Dynamic Context Generation Improves Zero-Shot Reasoning Performance of GPT-2
Gregor BetzKyle RichardsonChristian Voigt
2021-03-24
FastMoE: A Fast Mixture-of-Expert Training System
| Jiaao HeJiezhong QiuAohan ZengZhilin YangJidong ZhaiJie Tang
2021-03-24
MANAS: Multi-Scale and Multi-Level Neural Architecture Search for Low-Dose CT Denoising
Zexin LuWenjun XiaYongqiang HuangHongming ShanHu ChenJiliu ZhouYi Zhang
2021-03-24
Czert -- Czech BERT-like Model for Language Representation
| Jakub SidoOndřej PražákPavel PřibáňJan PašekMichal SejákMiloslav Konopík
2021-03-24
Multi-view 3D Reconstruction with Transformer
Dan WangXinrui CuiXun ChenZhengxia ZouTianyang ShiSeptimiu SalcudeanZ. Jane WangRabab Ward
2021-03-24
Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning
| Amaia SalvadorErhan GundogduLoris BazzaniMichael Donoser
2021-03-24
A Framework for 3D Tracking of Frontal Dynamic Objects in Autonomous Cars
Faraz LotfiHamid D. Taghirad
2021-03-24
Detecting Hate Speech with GPT-3
| Ke-Li ChiuRohan Alexander
2021-03-23
Global Correlation Network: End-to-End Joint Multi-Object Detection and Tracking
Xuewu LinYu-ang GuoJianqiang Wang
2021-03-23
Are Neural Language Models Good Plagiarists? A Benchmark for Neural Paraphrase Detection
Jan Philip WahleTerry RuasNorman MeuschkeBela Gipp
2021-03-23
TMR: Evaluating NER Recall on Tough Mentions
Jingxuan TuConstantine Lignos
2021-03-23
Are all outliers alike? On Understanding the Diversity of Outliers for Detecting OODs
Ramneet KaurSusmit JhaAnirban RoyOleg SokolskyInsup Lee
2021-03-23
Repairing Pronouns in Translation with BERT-Based Post-Editing
Reid Pryzant
2021-03-23
Variable Name Recovery in Decompiled Binary Code using Constrained Masked Language Modeling
Pratyay BanerjeeKuntal Kumar PalFish WangChitta Baral
2021-03-23
Identifying Machine-Paraphrased Plagiarism
| Jan Philip WahleTerry RuasTomáš FoltýnekNorman MeuschkeBela Gipp
2021-03-22
Open Domain Question Answering over Tables via Dense Retrieval
| Jonathan HerzigThomas MüllerSyrine KricheneJulian Martin Eisenschlos
2021-03-22
BERT: A Review of Applications in Natural Language Processing and Understanding
M. V. Koroteev
2021-03-22
Hybrid Model for Patent Classification using Augmented SBERT and KNN
| Hamid BekamiriDaniel S. HainRoman Jurowetzki
2021-03-22
Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management
Mikael BrunilaRosie ZhaoAndrei MirceaSam LumleyRenee Sieber
2021-03-22
Incorporating Convolution Designs into Visual Transformers
| Kun YuanShaopeng GuoZiwei LiuAojun ZhouFengwei YuWei Wu
2021-03-22
Predicting brain-age from raw T 1 -weighted Magnetic Resonance Imaging data using 3D Convolutional Neural Networks
Lukas FischJan ErnstingNils R. WinterVincent HolsteinRamona LeeningsMarie BeisemannKelvin SarinkDaniel EmdenNils OpelRonny RedlichJonathan ReppleDominik GrotegerdSusanne MeinertNiklas WulmsHeike MinnerupJochen G. HirschThoralf NiendorfBeate EndemannFabian BambergThomas KrönckeAnnette PetersRobin BülowHenry VölzkeOyunbileg von StackelbergRamona Felizitas SowadeLale UmutluBörge SchmidtSvenja CaspersGerman National Cohort Study Center ConsortiumHarald KugelBernhard T. BauneTilo KircherBenjamin RisseUdo DannlowskiKlaus BergerTim Hahn
2021-03-22
A Batch Normalization Classifier for Domain Adaptation
| Matthew R. BehrendSean M. Robinson
2021-03-22
Control Distance IoU and Control Distance IoU Loss Function for Better Bounding Box Regression
| Dong ChenDuoqian Miao
2021-03-22
Tiny Transformers for Environmental Sound Classification at the Edge
David ElliottCarlos E. OteroSteven WyattEvan Martino
2021-03-22
Prediction of lung and colon cancer through analysis of histopathological images by utilizing Pre-trained CNN models with visualization of class activation and saliency maps
Satvik GargSomya Garg
2021-03-22
End-to-End Trainable Multi-Instance Pose Estimation with Transformers
Lucas StofflMaxime VidalAlexander Mathis
2021-03-22
Paying Attention to Activation Maps in Camera Pose Regression
Yoli ShavitRon FerensYosi Keller
2021-03-21
Non-Autoregressive Translation by Learning Target Categorical Codes
Yu BaoShuJian HuangTong XiaoDongqi WangXinyu DaiJiajun Chen
2021-03-21
MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation
Zachary SeymourKowshik ThopalliNiluthpol MithunHan-Pang ChiuSupun SamarasekeraRakesh Kumar
2021-03-21
ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques
| Yuanxin LiuZheng LinFengcheng Yuan
2021-03-21
NameRec*: Highly Accurate and Fine-grained Person Name Recognition
Rui ZhangYimeng DaiShijie Liu
2021-03-21
An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information
Zejun LiZhongyu WeiZhihao FanHaijun ShanXuanjing Huang
2021-03-21
Paying Attention to Multiscale Feature Maps in Multimodal Image Matching
Aviad MoreshetYosi Keller
2021-03-20
3M: Multi-style image caption generation using Multi-modality features under Multi-UPDOWN model
Chengxi LiBrent Harrison
2021-03-20
Efficient Subsampling for Generating High-Quality Images from Conditional Generative Adversarial Networks
| Xin DingYongwei WangZ. Jane WangWilliam J. Welch
2021-03-20
Play the Shannon Game With Language Models: A Human-Free Approach to Summary Evaluation
Nicholas EganOleg VasilyevJohn Bohannon
2021-03-19
Hopper: Multi-hop Transformer for Spatiotemporal Reasoning
| Honglu ZhouAsim KadavFarley LaiAlexandru Niculescu-MizilMartin Renqiang MinMubbasir KapadiaHans Peter Graf
2021-03-19
Transferable Model for Shape Optimization subject to Physical Constraints
Lukas HarschJohannes BurgbacherStefan Riedelbauch
2021-03-19
MuRIL: Multilingual Representations for Indian Languages
Simran KhanujaDiksha BansalSarvesh MehtaniSavya KhoslaAtreyee DeyBalaji GopalanDilip Kumar MargamPooja AggarwalRajiv Teja NagipoguShachi DaveShruti GuptaSubhash Chandra Bose GaliVish SubramanianPartha Talukdar
2021-03-19
Cost-effective Deployment of BERT Models in Serverless Environment
Katarína BenešováAndrej ŠvecMarek Šuppa
2021-03-19
API2Com: On the Improvement of Automatically Generated Code Comments Using API Documentations
Ramin ShahbaziRishab SharmaFatemeh H. Fard
2021-03-19
HW-NAS-Bench:Hardware-Aware Neural Architecture Search Benchmark
| Chaojian LiZhongzhi YuYonggan FuYongan ZhangYang ZhaoHaoran YouQixuan YuYue WangYingyan Lin
2021-03-19
Let Your Heart Speak in its Mother Tongue: Multilingual Captioning of Cardiac Signals
| Dani KiyassehTingting ZhuDavid Clifton
2021-03-19
GPT Understands, Too
| Xiao LiuYanan ZhengZhengxiao DuMing DingYujie QianZhilin YangJie Tang
2021-03-18
All NLP Tasks Are Generation Tasks: A General Pretraining Framework
| Zhengxiao DuYujie QianXiao LiuMing DingJiezhong QiuZhilin YangJie Tang
2021-03-18
Contextual Biasing of Language Models for Speech Recognition in Goal-Oriented Conversational Agents
Ashish ShenoySravan BodapatiKatrin Kirchhoff
2021-03-18
OmniPose: A Multi-Scale Framework for Multi-Person Pose Estimation
| Bruno ArtachoAndreas Savakis
2021-03-18
Enhancing Transformer for Video Understanding Using Gated Multi-Level Attention and Temporal Adversarial Training
Saurabh SahuPalash Goyal
2021-03-18
Model Extraction and Adversarial Transferability, Your BERT is Vulnerable!
Xuanli HeLingjuan LyuQiongkai XuLichao Sun
2021-03-18
Danish Fungi 2020 -- Not Just Another Image Recognition Dataset
| Lukáš PicekMilan ŠulcJiří MatasJacob Heilmann-ClausenThomas S. JeppesenThomas LæssøeTobias Frøslev
2021-03-18
On the Role of Images for Analyzing Claims in Social Media
| Gullal S. CheemaSherzod HakimovEric Müller-BudackRalph Ewerth
2021-03-17
Trans-SVNet: Accurate Phase Recognition from Surgical Videos via Hybrid Embedding Aggregation Transformer
Xiaojie GaoYueming JinYonghao LongQi DouPheng-Ann Heng
2021-03-17
UniParma at SemEval-2021 Task 5: Toxic Spans Detection Using CharacterBERT and Bag-of-Words Model
Akbar KarimiLeonardo RossiAndrea Prati
2021-03-17
Code Word Detection in Fraud Investigations using a Deep-Learning Approach
Youri van der ZeeJan C. ScholtesMarcel WesterhoudJulien Rossi
2021-03-17
You Only Look One-level Feature
| Qiang ChenYingming WangTong YangXiangyu ZhangJian ChengJian Sun
2021-03-17
Triplet-Watershed for Hyperspectral Image Classification
| Aditya ChallaSravan DandaB. S. Daya SagarLaurent Najman
2021-03-17
Contrastive Learning of Musical Representations
| Janne SpijkervetJohn Ashley Burgoyne
2021-03-17
ReconResNet: Regularised Residual Learning for MR Image Reconstruction of Undersampled Cartesian and Radial Data
| Soumick ChatterjeeMario BreitkopfChompunuch SarasaenHadya YassinGeorg RoseAndreas NürnbergerOliver Speck
2021-03-16
Dense Interaction Learning for Video-based Person Re-identification
Tianyu HeXin JinXu ShenJianqiang HuangZhibo ChenXian-Sheng Hua
2021-03-16
KGSynNet: A Novel Entity Synonyms Discovery Framework with Knowledge Graph
Yiying YangXi YinHaiqin YangXingjian FeiHao PengKaijie ZhouKunfeng LaiJianping Shen
2021-03-16
Robustly Optimized and Distilled Training for Natural Language Understanding
Haytham ElFadeelStan Peshterliev
2021-03-16
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval
| Siqi SunYen-Chun ChenLinjie LiShuohang WangYuwei FangJingjing Liu
2021-03-16
Knowledge driven Description Synthesis for Floor Plan Interpretation
Shreya GoyalChiranjoy ChattopadhyayGaurav Bhatnagar
2021-03-15
Understanding invariance via feedforward inversion of discriminatively trained classifiers
Piotr TeterwakChiyuan ZhangDilip KrishnanMichael C. Mozer
2021-03-15
SemVLP: Vision-Language Pre-training by Aligning Semantics at Multiple Levels
Chenliang LiMing YanHaiyang XuFuli LuoWei WangBin BiSongfang Huang
2021-03-14
Improving Code Summarization with Block-wise Abstract Syntax Tree Splitting
| Chen LinZhichao OuyangJunqing ZhuangJianqiang ChenHui LiRongxin Wu
2021-03-14
Embedding Calibration for Music Semantic Similarity using Auto-regressive Transformer
Xinran ZhangMaosong SunJiafeng LiuXiaobing Li
2021-03-13
Revisiting ResNets: Improved Training and Scaling Strategies
| Irwan BelloWilliam FedusXianzhi DuEkin D. CubukAravind SrinivasTsung-Yi LinJonathon ShlensBarret Zoph
2021-03-13
Text Mining of Stocktwits Data for Predicting Stock Prices
Mukul JaggiPriyanka MandalShreya NarangUsman NaseemMatloob Khushi
2021-03-13
Is BERT a Cross-Disciplinary Knowledge Learner? A Surprising Finding of Pre-trained Models' Transferability
Wei-Tsung KaoHung-Yi Lee
2021-03-12
Explaining and Improving BERT Performance on Lexical Semantic Change Detection
Severin LaicherSinan KurtyigitDominik SchlechtwegJonas KuhnSabine Schulte im Walde
2021-03-12
Vision Transformer for COVID-19 CXR Diagnosis using Chest X-ray Feature Corpus
Sangjoon ParkGwanghyun KimYujin OhJoon Beom SeoSang Min LeeJin Hwan KimSungjun MoonJae-Kwang LimJong Chul Ye
2021-03-12
Bilingual Dictionary-based Language Model Pretraining for Neural Machine Translation
Yusen LinJiayong LinShuaicheng ZhangHaoying Dai
2021-03-12
Severity Quantification and Lesion Localization of COVID-19 on CXR using Vision Transformer
Gwanghyun KimSangjoon ParkYujin OhJoon Beom SeoSang Min LeeJin Hwan KimSungjun MoonJae-Kwang LimJong Chul Ye
2021-03-12
Sequential Random Network for Fine-grained Image Classification
Chaorong LiMalu ZhangWei HuangFengqing QinAnping ZengYuanyuan Huang
2021-03-12
Predicting the Behavior of Dealers in Over-The-Counter Corporate Bond Markets
Yusen LinJinming XueLouiqa Raschid
2021-03-12
Comparing the Performance of NLP Toolkits and Evaluation measures in Legal Tech
Muhammad Zohaib Khan
2021-03-12
Unknown Object Segmentation from Stereo Images
Maximilian DurnerWout BoerdijkMartin SundermeyerWerner FriedlZoltan-Csaba MartonRudolph Triebel
2021-03-11
Evaluation of Morphological Embeddings for the Russian Language
Vitaly RomanovAlbina Khusainova
2021-03-11
Preprint: Norm Loss: An efficient yet effective regularization method for deep neural networks
Theodoros GeorgiouSebastian SchmittThomas BäckWei ChenMichael Lew
2021-03-11
Improving Bi-encoder Document Ranking Models with Two Rankers and Multi-teacher Distillation
Jaekeol ChoiEuna JungJangwon SuhWonjong Rhee
2021-03-11
Composite Re-Ranking for Efficient Document Search with BERT
Yingrui YangYifan QiaoJinjin ShaoMayuresh AnandXifeng YanTao Yang
2021-03-11
Pavement Distress Detection and Segmentation using YOLOv4 and DeepLabv3 on Pavements in the Philippines
James-Andrew Sarmiento
2021-03-11
Towards Multi-Sense Cross-Lingual Alignment of Contextual Embeddings
Linlin LiuThien Hai NguyenShafiq JotyLidong BingLuo Si
2021-03-11
SAR-U-Net: squeeze-and-excitation block and atrous spatial pyramid pooling based residual U-Net for automatic liver CT segmentation
Jinke WangPeiqing LvHaiying WangChangfa Shi
2021-03-11
LightMBERT: A Simple Yet Effective Method for Multilingual BERT Distillation
Xiaoqi JiaoYichun YinLifeng ShangXin JiangXiao ChenLinlin LiFang WangQun Liu
2021-03-11
FairFil: Contrastive Neural Debiasing Method for Pretrained Text Encoders
Pengyu ChengWeituo HaoSiyang YuanShijing SiLawrence Carin
2021-03-11
Self-supervised Text-to-SQL Learning with Header Alignment Training
Donggyu KimSeanie Lee
2021-03-11
On Improving Deep Learning Trace Analysis with System Call Arguments
Quentin FournierDaniel AloiseSeyed Vahid AzhariFrançois Tetreault
2021-03-11
Continuous 3D Multi-Channel Sign Language Production via Progressive Transformers and Mixture Density Networks
Ben SaundersNecati Cihan CamgozRichard Bowden
2021-03-11
CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review
| Dan HendrycksCollin BurnsAnya ChenSpencer Ball
2021-03-10
RL-CSDia: Representation Learning of Computer Science Diagrams
Shaowei WangLingling ZhangXuan LuoYi YangXin HuJun Liu
2021-03-10
Majority Voting with Bidirectional Pre-translation For Bitext Retrieval
| Alex JonesDerry Tanti Wijaya
2021-03-10
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition
| Maurice GerczukShahin AmiriparianSandra OttlBjörn Schuller
2021-03-10
Hurdles to Progress in Long-form Question Answering
Kalpesh KrishnaAurko RoyMohit Iyyer
2021-03-10
CEQE: Contextualized Embeddings for Query Expansion
Shahrzad NaseriJeffrey DaltonAndrew YatesJames Allan
2021-03-09
Pretrained Transformers as Universal Computation Engines
| Kevin LuAditya GroverPieter AbbeelIgor Mordatch
2021-03-09
Active Testing: Sample-Efficient Model Evaluation
| Jannik KossenSebastian FarquharYarin GalTom Rainforth
2021-03-09
Automatic code generation from sketches of mobile applications in end-user development using Deep Learning
| Daniel BauléChristiane Gresse von WangenheimAldo von WangenheimJean C. R. HauckEdson C. Vargas Júnior
2021-03-09
Language Models have a Moral Dimension
Patrick SchramowskiCigdem TuranNico AndersenConstantin RothkopfKristian Kersting
2021-03-08
Depth Evaluation for Metal Surface Defects by Eddy Current Testing using Deep Residual Convolutional Neural Networks
Tian MengYang TaoZiqi ChenJorge R. Salas AvilaQiaoye RanYuchun ShaoRuochen HuangYuedong XieQian ZhaoZhijie ZhangHujun YinAnthony J. PeytonWuliang Yin
2021-03-08
Syntax-BERT: Improving Pre-trained Transformers with Syntax Trees
| Jiangang BaiYujing WangYiren ChenYaming YangJing BaiJing YuYunhai Tong
2021-03-07
TransBTS: Multimodal Brain Tumor Segmentation Using Transformer
| Wenxuan WangChen ChenMeng DingJiangyun LiHong YuSen Zha
2021-03-07
Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain
Jinyu TianJiantao ZhouYuanman LiJia Duan
2021-03-07
Orthogonal Attention: A Cloze-Style Approach to Negation Scope Resolution
Aditya KhandelwalVahida Attar
2021-03-07
MTLHealth: A Deep Learning System for Detecting Disturbing Content in Student Essays
Joseph ValenciaErin Yao
2021-03-07
WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition
Zheng ZhuGuan HuangJiankang DengYun YeJunJie HuangXinze ChenJiagang ZhuTian YangJiwen LuDalong DuJie zhou
2021-03-06
Morphological Operation Residual Blocks: Enhancing 3D Morphological Feature Representation in Convolutional Neural Networks for Semantic Segmentation of Medical Images
Chentian LiChi MaWilliam W. Lu
2021-03-06
Multitasking Deep Learning Model for Detection of Five Stages of Diabetic Retinopathy
Sharmin MajumderNasser Kehtarnavaz
2021-03-06
Perception Framework through Real-Time Semantic Segmentation and Scene Recognition on a Wearable System for the Visually Impaired
Yingzhi ZhangHaoye ChenKailun YangJiaming ZhangRainer Stiefelhagen
2021-03-06
A Real-time Low-cost Artificial Intelligence System for Autonomous Spraying in Palm Plantations
| Zhenwang QinWensheng WangKarl-Heinz DammerLeifeng GuoZhen Cao
2021-03-06
Teachers Do More Than Teach: Compressing Image-to-Image Models
Qing JinJian RenOliver J. WoodfordJiazhuo WangGeng YuanYanzhi WangSergey Tulyakov
2021-03-05
Measuring Mathematical Problem Solving With the MATH Dataset
| Dan HendrycksCollin BurnsSaurav KadavathAkul AroraSteven BasartEric TangDawn SongJacob Steinhardt
2021-03-05
MalBERT: Using Transformers for Cybersecurity and Malicious Software Detection
Abir RahaliMoulay A. Akhloufi
2021-03-05
Fine-tuning Pretrained Multilingual BERT Model for Indonesian Aspect-based Sentiment Analysis
Annisa Nurul AzharMasayu Leylia Khodra
2021-03-05
SpecTr: Spectral Transformer for Hyperspectral Pathology Image Segmentation
| Boxiang YunYan WangJieneng ChenHuiyu WangWei ShenQingli Li
2021-03-05
Hierarchical Transformer for Multilingual Machine Translation
Albina KhusainovaAdil KhanAdín Ramírez RiveraVitaly Romanov
2021-03-05
Non-invasive Self-attention for Side Information Fusion in Sequential Recommendation
Chang LiuXiaoguang LiGuohao CaiZhenhua DongHong ZhuLifeng Shang
2021-03-05
IOT: Instance-wise Layer Reordering for Transformer Structures
| Jinhua ZhuLijun WuYingce XiaShufang XieTao QinWengang ZhouHouqiang LiTie-Yan Liu
2021-03-05
A Hybrid CNN-BiLSTM Voice Activity Detector
Nicholas WilkinsonThomas Niesler
2021-03-05
Hardware Acceleration of Fully Quantized BERT for Efficient Natural Language Processing
Zejian LiuGang LiJian Cheng
2021-03-04
CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation
| Yutong XieJianpeng ZhangChunhua ShenYong Xia
2021-03-04
The Transformer Network for the Traveling Salesman Problem
| Xavier BressonThomas Laurent
2021-03-04
End-to-end acoustic modelling for phone recognition of young readers
Lucile GelinMorgane DanielJulien PinquierThomas Pellegrini
2021-03-04
Few-shot Learning for Slot Tagging with Attentive Relational Network
Cennet OguzNgoc Thang Vu
2021-03-03
University of Copenhagen Participation in TREC Health Misinformation Track 2020
Lucas Chaves LimaDustin Brandon WrightIsabelle AugensteinMaria Maistro
2021-03-03
Sensing population distribution from satellite imagery via deep learning: model selection, neighboring effect, and systematic biases
Xiao HuangDi ZhuFan ZhangTao LiuXiao LiLei Zou
2021-03-03
Hate Towards the Political Opponent: A Twitter Corpus Study of the 2020 US Elections on the Basis of Offensive Speech and Stance Detection
Lara GrimmingerRoman Klinger
2021-03-02
Dual Reinforcement-Based Specification Generation for Image De-Rendering
Ramakanth PasunuruDavid RosenbergGideon MannMohit Bansal
2021-03-02
Using CNNs to Identify the Origin of Finger Vein Image
Babak MaserAndreas Uhl
2021-03-02
Probing Product Description Generation via Posterior Distillation
Haolan ZhanHainan ZhangHongshen ChenLei ShenZhuoye DingYongjun BaoWeipeng YanYanyan Lan
2021-03-02
A HINT from Arithmetic: On Systematic Generalization of Perception, Syntax, and Semantics
Qing LiSiyuan HuangYining HongYixin ZhuYing Nian WuSong-Chun Zhu
2021-03-02
Self-supervised Pretraining of Visual Features in the Wild
| Priya GoyalMathilde CaronBenjamin LefaudeuxMin XuPengchao WangVivek PaiMannat SinghVitaliy LiptchinskyIshan MisraArmand JoulinPiotr Bojanowski
2021-03-02
BERT-based knowledge extraction method of unstructured domain text
Wang ZijiaLi YeZhu Zhongkai
2021-03-01
Combat COVID-19 Infodemic Using Explainable Natural Language Processing Models
Jackie AyoubX. Jessie YangFeng Zhou
2021-03-01
Long Document Summarization in a Low Resource Setting using Pretrained Language Models
Ahsaas BajajPavitra DangatiKalpesh KrishnaPradhiksha Ashok KumarRheeya UppaalBradford WindsorEliot BrennerDominic DotterrerRajarshi DasAndrew McCallum
2021-03-01
BERT based patent novelty search by training claims to their own description
Michael FreunekAndré Bodmer
2021-03-01
Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training
Sheng LiuXiao LiYuexiang ZhaiChong YouZhihui ZhuCarlos Fernandez-GrandaQing Qu
2021-03-01
Over-sampling De-occlusion Attention Network for Prohibited Items Detection in Noisy X-ray Images
| Renshuai TaoYanlu WeiHainan LiAishan LiuYifu DingHaotong QinXianglong Liu
2021-03-01
CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation
Aly MagassoubaKomei SugiuraHisashi Kawai
2021-03-01
DTW-Merge: A Novel Data Augmentation Technique for Time Series Classification
Mohammad AkyashHoda MohammadzadeHamid Behroozi
2021-03-01
Brain Programming is Immune to Adversarial Attacks: Towards Accurate and Robust Image Classification using Symbolic Learning
Gerardo Ibarra-VazquezGustavo OlagueMariana Chan-LeyCesar PuenteCarlos Soubervielle-Montalvo
2021-03-01
NLP-CUET@LT-EDI-EACL2021: Multilingual Code-Mixed Hope Speech Detection using Cross-lingual Representation Learner
| Eftekhar HossainOmar SharifMohammed Moshiul Hoque
2021-02-28
NLP-CUET@DravidianLangTech-EACL2021: Investigating Visual and Textual Features to Identify Trolls from Multimodal Social Media Memes
Eftekhar HossainOmar SharifMohammed Moshiul Hoque
2021-02-28
Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly
| Tianlong ChenYu ChengZhe GanJingjing LiuZhangyang Wang
2021-02-28
NLP-CUET@DravidianLangTech-EACL2021: Offensive Language Detection from Multilingual Code-Mixed Text using Transformers
| Omar SharifEftekhar HossainMohammed Moshiul Hoque
2021-02-28
Transformers with Competitive Ensembles of Independent Mechanisms
Alex LambDi HeAnirudh GoyalGuolin KeChien-Feng LiaoMirco RavanelliYoshua Bengio
2021-02-27
COVID-19 Tweets Analysis through Transformer Language Models
| Abdul Hameed AzeemiAdeel Waheed
2021-02-27
Generative chemical transformer: attention makes neural machine learn molecular geometric structures via text
Hyunseung KimJonggeol NaWon Bo Lee
2021-02-27
Multi-task transfer learning for finding actionable information from crisis-related messages on social media
Congcong WangDavid Lillis
2021-02-26
MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition
Linghui MengJin XuXu TanJindong WangTao QinBo Xu
2021-02-25
LET: Linguistic Knowledge Enhanced Graph Transformer for Chinese Short Text Matching
| Boer LyuLu ChenSu ZhuKai Yu
2021-02-25
Sentiment Analysis of Persian-English Code-mixed Texts
| Nazanin SabriAli EdalatBehnam Bahrak
2021-02-25
LazyFormer: Self Attention with Lazy Update
Chengxuan YingGuolin KeDi HeTie-Yan Liu
2021-02-25
Visualizing MuZero Models
| Joery A. de VriesKen S. VoskuilThomas M. MoerlandAske Plaat
2021-02-25
Emotion-Aware, Emotion-Agnostic, or Automatic: Corpus Creation Strategies to Obtain Cognitive Event Appraisal Annotations
Jan HofmannEnrica TroianoRoman Klinger
2021-02-25
Even your Teacher Needs Guidance: Ground-Truth Targets Dampen Regularization Imposed by Self-Distillation
| Kenneth BorupLars N. Andersen
2021-02-25
PharmKE: Knowledge Extraction Platform for Pharmaceutical Texts using Transfer Learning
Nasi JofcheKostadin MishevRiste StojanovMilos JovanovikDimitar Trajanov
2021-02-25
A Framework For Pruning Deep Neural Networks Using Energy-Based Models
Hojjat SalehinejadShahrokh Valaee
2021-02-25
BERT-based Acronym Disambiguation with Multiple Training Strategies
Chunguang PanBingyan SongShengguang WangZhipeng Luo
2021-02-25
Bridging Adversarial and Statistical Domain Transfer via Spectral Adaptation Networks
| Christoph RaabPhilipp VäthPeter MeierFrank-Michael Schleif
2021-02-25
Highly Efficient Representation and Active Learning Framework for Imbalanced Data and its Application to COVID-19 X-Ray Classification
Heng HaoSima DidariJae Oh WooHankyu MoonPatrick Bangert
2021-02-25
Task-Specific Pre-Training and Cross Lingual Transfer for Code-Switched Data
Akshat GuptaSai Krishna RallabandiAlan Black
2021-02-24
LRG at SemEval-2021 Task 4: Improving Reading Comprehension with Abstract Words using Augmentation, Linguistic Features and Voting
| Abheesht SharmaHarshit PandeyGunjan ChhablaniYash BhartiaTirtharaj Dash
2021-02-24
NLRG at SemEval-2021 Task 5: Toxic Spans Detection Leveraging BERT-based Token Classification and Span Prediction Techniques
| Gunjan ChhablaniYash BhartiaAbheesht SharmaHarshit PandeyShan Suthaharan
2021-02-24
PADA: A Prompt-based Autoregressive Approach for Adaptation to Unseen Domains
| Eyal Ben-DavidNadav OvedRoi Reichart
2021-02-24
From Universal Language Model to Downstream Task: Improving RoBERTa-Based Vietnamese Hate Speech Detection
Quang Huu PhamViet Anh NguyenLinh Bao DoanNgoc N. TranTa Minh Thanh
2021-02-24
Hopeful_Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers
Ishan Sanjeev UpadhyayNikhil EAnshul WadhawanRadhika Mamidi
2021-02-24
When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute
| Tao Lei
2021-02-24
Combining Off and On-Policy Training in Model-Based Reinforcement Learning
Alexandre BorgesArlindo Oliveira
2021-02-24
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
| Wenhai WangEnze XieXiang LiDeng-Ping FanKaitao SongDing LiangTong LuPing LuoLing Shao
2021-02-24
Railway Anomaly detection model using synthetic defect images generated by CycleGAN
Takuro HoshiYohei BabaGaurang Gavai
2021-02-24
Histo-fetch -- On-the-fly processing of gigapixel whole slide images simplifies and speeds neural network training
| Brendon LutnickLeema Krishna MuraliBrandon GinleyAvi Z. RosenbergPinaki Sarder
2021-02-23
Accurate Learning of Graph Representations with Graph Multiset Pooling
| Jinheon BaekMinki KangSung Ju Hwang
2021-02-23
Robust and Transferable Anomaly Detection in Log Data using Pre-Trained Language Models
Harold OttJasmin BogatinovskiAlexander AckerSasho NedelkoskiOdej Kao
2021-02-23
SISE-PC: Semi-supervised Image Subsampling for Explainable Pathology
| Sohini RoychowdhuryKwok Sun TangMohith AshokAnoop Sanka
2021-02-23
Minimally-Supervised Structure-Rich Text Categorization via Learning on Text-Rich Networks
Xinyang ZhangChenwei ZhangLuna Xin DongJingbo ShangJiawei Han
2021-02-23
VisualCheXbert: Addressing the Discrepancy Between Radiology Report Labels and Image Labels
| Saahil JainAkshay SmitSteven QH TruongChanh DT NguyenMinh-Thanh HuynhMudit JainVictoria A. YoungAndrew Y. NgMatthew P. LungrenPranav Rajpurkar
2021-02-23
Deep Deformation Detail Synthesis for Thin Shell Models
Lan ChenLin GaoJie YangShibiao XuJuntao YeXiaopeng ZhangYu-Kun Lai
2021-02-23
Do Transformer Modifications Transfer Across Implementations and Applications?
| Sharan NarangHyung Won ChungYi TayWilliam FedusThibault FevryMichael MatenaKarishma MalkanNoah FiedelNoam ShazeerZhenzhong LanYanqi ZhouWei LiNan DingJake MarcusAdam RobertsColin Raffel
2021-02-23
Wavelet Transform Analytics for RF-Based UAV Detection and Identification System Using Machine Learning
Olusiji MedaiyeseMartins EzumaAdrian P. LaufIsmail Guvenc
2021-02-23
Revisiting Classification Perspective on Scene Text Recognition
| Hongxiang CaiJun SunYichao Xiong
2021-02-22
Using Prior Knowledge to Guide BERT's Attention in Semantic Textual Matching Tasks
| Tingyu XiaYue WangYuan TianYi Chang
2021-02-22
Evaluating Contextualized Language Models for Hungarian
| Judit ÁcsDániel LévaiDávid Márk NemeskeyAndrás Kornai
2021-02-22
Deepfake Video Detection Using Convolutional Vision Transformer
| Deressa WodajoSolomon Atnafu
2021-02-22
Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model
Junwei LiaoYu ShiMing GongLinjun ShouSefik EskimezLiyang LuHong QuMichael Zeng
2021-02-22
Position Information in Transformers: An Overview
Philipp DufterMartin SchmittHinrich Schütze
2021-02-22
Determination of Fault Location in Transmission Lines with Image Processing and Artificial Neural Networks
Serkan BudakBahadir Akbal
2021-02-22
Few Shot Learning for Information Verification
Usama KhalidMirza Omer Beg
2021-02-22
Conditional Positional Encodings for Vision Transformers
| Xiangxiang ChuZhi TianBo ZhangXinlong WangXiaolin WeiHuaxia XiaChunhua Shen
2021-02-22
UniT: Multimodal Multitask Learning with a Unified Transformer
Ronghang HuAmanpreet Singh
2021-02-22
MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture
Wancong ZhangIeshan Vaidya
2021-02-22
Lightweight Combinational Machine Learning Algorithm for Sorting Canine Torso Radiographs
Masuda Akter TonimaFatemeh EsfahaniAustin DehartYoumin Zhang
2021-02-22
RUBERT: A Bilingual Roman Urdu BERT Using Cross Lingual Transfer Learning
Usama KhalidMirza Omer BegMuhammad Umair Arshad
2021-02-22
Parallelizing Legendre Memory Unit Training
| Narsimha ChilkuriChris Eliasmith
2021-02-22
Pre-Training BERT on Arabic Tweets: Practical Considerations
Ahmed AbdelaliSabit HassanHamdy MubarakKareem DarwishYounes Samih
2021-02-21
Web-based Application for Detecting Indonesian Clickbait Headlines using IndoBERT
Muhammad Noor FakhruzzamanSie Wildan Gunawan
2021-02-21
Medical Transformer: Gated Axial-Attention for Medical Image Segmentation
| Jeya Maria Jose ValanarasuPoojan OzaIlker HacihalilogluVishal M. Patel
2021-02-21
Towards Accurate and Compact Architectures via Neural Architecture Transformer
| Yong GuoYin ZhengMingkui TanQi ChenZhipeng LiJian ChenPeilin ZhaoJunzhou Huang
2021-02-20
Multilingual Answer Sentence Reranking via Automatically Translated Data
Thuy VuAlessandro Moschitti
2021-02-20
Learning Dynamic BERT via Trainable Gate Variables and a Bi-modal Regularizer
Seohyeong JeongNojun Kwak
2021-02-19
Towards Emotion Recognition in Hindi-English Code-Mixed Data: A Transformer Based Approach
Anshul WadhawanAkshita Aggarwal
2021-02-19
A Deep Graph Wavelet Convolutional Neural Network for Semi-supervised Node Classification
Jingyi WangZhidong Deng
2021-02-19
Training cascaded networks for speeded decisions using a temporal-difference loss
Michael L. IuzzolinoMichael C. MozerSamy Bengio
2021-02-19
Using Transformer based Ensemble Learning to classify Scientific Articles
| Sohom GhoshAnkush Chopra
2021-02-19
Calibrate Before Use: Improving Few-Shot Performance of Language Models
| Tony Z. ZhaoEric WallaceShi FengDan KleinSameer Singh
2021-02-19
Dialect Identification in Nuanced Arabic Tweets Using Farasa Segmentation and AraBERT
Anshul Wadhawan
2021-02-19
Lottery Ticket Implies Accuracy Degradation, Is It a Desirable Phenomenon?
Ning LiuGeng YuanZhengping CheXuan ShenXiaolong MaQing JinJian RenJian TangSijia LiuYanzhi Wang
2021-02-19
Latent Variable Nested Set Transformers & AutoBots
Roger GirgisFlorian GolemoFelipe CodevillaJim Aldon D'SouzaSamira Ebrahimi KahouFelix HeideChristopher Pal
2021-02-19
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer
Rafał PowalskiŁukasz BorchmannDawid JurkiewiczTomasz DwojakMichał PietruszkaGabriela Pałka
2021-02-18
UnibucKernel: Geolocating Swiss German Jodels Using Ensemble Learning
Mihaela GamanSebastian CojocariuRadu Tudor Ionescu
2021-02-18
Training Large-Scale News Recommenders with Pretrained Language Models in the Loop
Shitao XiaoZheng LiuYingxia ShaoTao DiXing Xie
2021-02-18
Quiz-Style Question Generation for News Stories
| Adam D. LelkesVinh Q. TranCong Yu
2021-02-18
A Mathematical Principle of Deep Learning: Learn the Geodesic Curve in the Wasserstein Space
Kuo GaiShihua Zhang
2021-02-18
Recurrent Rational Networks
| Quentin DelfossePatrick SchramowskiAlejandro MolinaKristian Kersting
2021-02-18
SciDr at SDU-2020: IDEAS -- Identifying and Disambiguating Everyday Acronyms for Scientific Domain
| Aadarsh SinghPriyanshu Kumar
2021-02-17
Leveraging Query Resolution and Reading Comprehension for Conversational Passage Retrieval
Svitlana VakulenkoNikos VoskaridesZhucheng TuShayne Longpre
2021-02-17
THEaiTRE 1.0: Interactive generation of theatre play scripts
Rudolf RosaTomáš MusilOndřej DušekDominik JurkoPatrícia SchmidtováDavid MarečekOndřej BojarTom KocmiDaniel HrbekDavid KošťákMartina KinskáMarie NovákováJosef DoležalKlára VoseckáTomáš StudeníkPetr Žabka
2021-02-17
A Dataset and Benchmark for Malaria Life-Cycle Classification in Thin Blood Smear Images
Qazi Ammar ArshadMohsen AliSaeed-Ul HassanChen ChenAyisha ImranGhulam RasulWaqas Sultani
2021-02-17
Ensemble Transfer Learning of Elastography and B-mode Breast Ultrasound Images
Sampa MisraSeungwan JeonRavi ManaguliSeiyon LeeGyuwon KimSeungchul LeeRichard G BarrChulhong Kim
2021-02-17
Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters
Aston ZhangYi TayShuai ZhangAlvin ChanAnh Tuan LuuSiu Cheung HuiJie Fu
2021-02-17
TCN: Table Convolutional Network for Web Table Interpretation
Daheng WangPrashant ShiralkarColin LockardBinxuan HuangXin Luna DongMeng Jiang
2021-02-17
LambdaNetworks: Modeling Long-Range Interactions Without Attention
| Irwan Bello
2021-02-17
Non-Autoregressive Text Generation with Pre-trained Language Models
Yixuan SuDeng CaiYan WangDavid VandykeSimon BakerPiji LiNigel Collier
2021-02-16
Exploring Transformers in Natural Language Generation: GPT, BERT, and XLNet
M. Onat TopalAnil BasImke van Heerden
2021-02-16
TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models
Zhuohan LiSiyuan ZhuangShiyuan GuoDanyang ZhuoHao ZhangDawn SongIon Stoica
2021-02-16
Have Attention Heads in BERT Learned Constituency Grammar?
Ziyang Luo
2021-02-16
Revisiting Language Encoding in Learning Multilingual Representations
| Shengjie LuoKaiyuan GaoShuxin ZhengGuolin KeDi HeLiWei WangTie-Yan Liu
2021-02-16
Improving Deep-learning-based Semi-supervised Audio Tagging with Mixup
Léo CancesEtienne LabbéThomas Pellegrini
2021-02-16
GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training
| Chen ZhuRenkun NiZheng XuKezhi KongW. Ronny HuangTom Goldstein
2021-02-16
Axial Residual Networks for CycleGAN-based Voice Conversion
Jaeseong YouGyuhyeon NamDalhyun KimGyeongsu Chae
2021-02-16
Complex Momentum for Learning in Games
Jonathan LorraineDavid AcunaPaul VicolDavid Duvenaud
2021-02-16
An AutoML-based Approach to Multimodal Image Sentiment Analysis
Vasco LopesAntónio GasparLuís A. AlexandreJoão Cordeiro
2021-02-16
The corruptive force of AI-generated advice
Margarita LeibNils C. KöbisRainer Michael RilkeMarloes HagensBernd Irlenbusch
2021-02-15
Prompt Programming for Large Language Models: Beyond the Few-Shot Paradigm
Laria ReynoldsKyle McDonell
2021-02-15
Improved Customer Transaction Classification using Semi-Supervised Knowledge Distillation
Rohan Sukumaran
2021-02-15
Detection and severity classification of COVID-19 in CT images using deep learning
Yazan QiblaweyAnas TahirMuhammad E. H. ChowdhuryAmith KhandakarSerkan KiranyazTawsifur RahmanNabil IbtehazSakib MahmudSomaya Al-MadeedFarayi Musharavati
2021-02-15
Fast End-to-End Speech Recognition via a Non-Autoregressive Model and Cross-Modal Knowledge Transferring from BERT
Ye BaiJiangyan YiJianHua TaoZhengkun TianZhengqi WenShuai Zhang
2021-02-15
DOBF: A Deobfuscation Pre-Training Objective for Programming Languages
Baptiste RoziereMarie-Anne LachauxMarc SzafraniecGuillaume Lample
2021-02-15
Translational Equivariance in Kernelizable Attention
| Max HornKumar ShridharElrich GroenewaldPhilipp F. M. Baumann
2021-02-15
Momentum Residual Neural Networks
| Michael E. SanderPierre AblinMathieu BlondelGabriel Peyré
2021-02-15
Within-Document Event Coreference with BERT-Based Contextualized Representations
Shafiuddin Rehan AhmedJames H. Martin
2021-02-15
indicnlp@kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages
| Kushal KediaAbhilash Nandy
2021-02-14
indicnlp@ kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages
| Kushal KediaAbhilash Nandy
2021-02-14
Fast, Accurate Barcode Detection in Ultra High-Resolution Images
Jerome QuenumKehan WangAvideh Zakhor
2021-02-13
Multiversal views on language models
Laria ReynoldsKyle McDonell
2021-02-12
VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention
Peng LiuYuewen CaoSongxiang LiuNa HuGuangzhi LiChao WengDan Su
2021-02-12
Optimizing Inference Performance of Transformers on CPUs
Dave DiceAlex Kogan
2021-02-12
Improving Zero-shot Neural Machine Translation on Language-specific Encoders-Decoders
Junwei LiaoYu ShiMing GongLinjun ShouHong QuMichael Zeng
2021-02-12
Dancing along Battery: Enabling Transformer with Run-time Reconfigurability on Mobile Devices
Yuhong SongWeiwen JiangBingbing LiPanjie QiQingfeng ZhugeEdwin Hsing-Mean ShaSakyasingha DasguptaYiyu ShiCaiwen Ding
2021-02-12
Dynamic Precision Analog Computing for Neural Networks
| Sahaj GargJoe LouAnirudh JainMitchell Nahmias
2021-02-12
Transformer Language Models with LSTM-based Cross-utterance Information Representation
| G. SunC. ZhangP. C. Woodland
2021-02-12
Exploring Classic and Neural Lexical Translation Models for Information Retrieval: Interpretability, Effectiveness, and Efficiency Benefits
| Leonid BoytsovZico Kolter
2021-02-12
Characterizing English Variation across Social Media Communities with BERT
| Li LucyDavid Bamman
2021-02-12
Towards DeepSentinel: An extensible corpus of labelled Sentinel-1 and -2 imagery and a general-purpose sensor-fusion semantic embedding model
Lucas Kruitwagen
2021-02-11
Proof Artifact Co-training for Theorem Proving with Language Models
| Jesse Michael HanJason RuteYuhuai WuEdward W. AyersStanislas Polu
2021-02-11
Text Compression-aided Transformer Encoding
Zuchao LiZhuosheng ZhangHai ZhaoRui WangKehai ChenMasao UtiyamaEiichiro Sumita
2021-02-11
NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting
| Kai ChenGuang ChenDan XuLijun ZhangYuyao HuangAlois Knoll
2021-02-10
Searching for Fast Model Families on Datacenter Accelerators
Sheng LiMingxing TanRuoming PangAndrew LiLiqun ChengQuoc LeNorman P. Jouppi
2021-02-10
Improving Model-Based Reinforcement Learning with Internal State Representations through Self-Supervision
| Julien ScholzCornelius WeberMuhammad Burhan HafezStefan Wermter
2021-02-10
Pruning of Convolutional Neural Networks Using Ising Energy Model
| Hojjat SalehinejadShahrokh Valaee
2021-02-10
BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction
| Yuhang LiRuihao GongXu TanYang YangPeng HuQi ZhangFengwei YuWei WangShi Gu
2021-02-10
Application of Yolo on Mask Detection Task
Ren LiuZiang Ren
2021-02-10
Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition
| Benjia ZhouYunan LiJun Wan
2021-02-10
AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition
Yue MengRameswar PandaChung-Ching LinPrasanna SattigeriLeonid KarlinskyKate SaenkoAude OlivaRogerio Feris
2021-02-10
Joint Intent Detection and Slot Filling with Wheel-Graph Attention Networks
Pengfei WeiBi ZengWenxiong Liao
2021-02-09
Distribution Adaptive INT8 Quantization for Training CNNs
Kang ZhaoSida HuangPan PanYinghan LiYingya ZhangZhenyu GuYinghui Xu
2021-02-09
Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning
Yu LiuLianghua HuangPan PanBin WangYinghui XuRong Jin
2021-02-09
MALI: A memory efficient and reverse accurate integrator for Neural ODEs
| Juntang ZhuangNicha C. DvornekSekhar TatikondaJames S. Duncan
2021-02-09
Conversational Query Rewriting with Self-supervised Learning
Hang LiuMeng ChenYouzheng WuXiaodong HeBoWen Zhou
2021-02-09
Bayesian Transformer Language Models for Speech Recognition
Boyang XueJianwei YuJunhao XuShansong LiuShoukang HuZi YeMengzhe GengXunying LiuHelen Meng
2021-02-09
NewsBERT: Distilling Pre-trained Language Model for Intelligent News Application
Chuhan WuFangzhao WuYang YuTao QiYongfeng HuangQi Liu
2021-02-09
AuGPT: Dialogue with Pre-trained Language Models and Data Augmentation
| Jonáš KulhánekVojtěch HudečekTomáš NekvindaOndřej Dušek
2021-02-09
Point Cloud Transformers applied to Collider Physics
| Vinicius MikuniFlorencia Canelli
2021-02-09
Transfer Learning Approach for Arabic Offensive Language Detection System -- BERT-Based Model
Fatemah HusainOzlem Uzuner
2021-02-09
Colorization Transformer
| Manoj KumarDirk WeissenbornNal Kalchbrenner
2021-02-08
TransReID: Transformer-based Object Re-Identification
| Shuting HeHao LuoPichao WangFan WangHao LiWei Jiang
2021-02-08
Generating Fake Cyber Threat Intelligence Using Transformer-Based Models
Priyanka RanadeAritran PiplaiSudip MittalAnupam JoshiTim Finin
2021-02-08
TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation
| Jieneng ChenYongyi LuQihang YuXiangde LuoEhsan AdeliYan WangLe LuAlan L. YuilleYuyin Zhou
2021-02-08
Spike-based Residual Blocks
Wei FangZhaofei YuTimothée MasquelierYanqi ChenTiejun HuangYonghong Tian
2021-02-08
How True is GPT-2? An Empirical Analysis of Intersectional Occupational Biases
| Hannah KirkYennie JunHaider IqbalElias BenussiFilippo VolpinFrederic A. DreyerAleksandar ShtedritskiYuki M. Asano
2021-02-08
A Hybrid Task-Oriented Dialog System with Domain and Task Adaptive Pretraining
| Boliang ZhangYing LyuNing DingTianhao ShenZhaoyang JiaKun HanKevin Knight
2021-02-08
Wake Word Detection with Streaming Transformers
Yiming WangHang LvDaniel PoveyLei XieSanjeev Khudanpur
2021-02-08
Nyströmformer: A Nyström-Based Algorithm for Approximating Self-Attention
| Yunyang XiongZhanpeng ZengRudrasis ChakrabortyMingxing TanGlenn FungYin LiVikas Singh
2021-02-07
Spoiler Alert: Using Natural Language Processing to Detect Spoilers in Book Reviews