CoBERL Explained | Papers With Code

Method Name:*

Method Full Name:*

Description with Markdown (optional):

**Contrastive BERT** is a reinforcement learning agent that combines a new contrastive loss and a hybrid [LSTM](https://paperswithcode.com/method/lstm)-[transformer](https://paperswithcode.com/method/transformer) architecture to tackle the challenge of improving data efficiency for RL. It uses bidirectional masked prediction in combination with a generalization of recent contrastive methods to learn better representations for transformers in RL, without the need of hand engineered data augmentations.

For the architecture, a residual network is used to encode observations into embeddings $Y\_{t}$. $Y_{t}$  is fed through a causally masked [GTrXL transformer](https://www.paperswithcode.com/method/gtrxl), which computes the predicted masked inputs $X\_{t}$ and passes those together with $Y\_{t}$ to a learnt gate. The output of the gate is passed through a single [LSTM](https://www.paperswithcode.com/method/lstm) layer to produce the values that we use for computing the RL loss. A contrastive loss is computed using predicted masked inputs $X_{t}$ and $Y_{t}$ as targets. For this, we do not use the causal mask of the Transformer.

Code Snippet URL (optional):

Image

Currently: methods/2d2010a1-2ec0-40aa-89b0-f25943ab7df2.png Clear
Change:

Attached collections:

RL TRANSFORMERS

Add:

New collection name:

Top-level area:

Parent collection (if any):

Description (optional):

Component	Type	Add Remove
GTrXL	RL Transformers
LSTM	Recurrent Neural Networks
ReLIC	Self-Supervised Learning
Residual Connection	Skip Connections

Contrastive BERT

Papers

Tasks

Usage Over Time

Components

Categories

Add Remove