TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Supervised Text Retrieval	20 Newsgroups	PSH	Precision@100	0.866	# 2
Supervised Text Retrieval	20 Newsgroups	SSB-VAE	Precision@100	0.878	# 1
Supervised Image Retrieval	CIFAR-10	SSB-VAE	Precision@100	0.910	# 1
Supervised Image Retrieval	CIFAR-10	VDSH	Precision@100	0.867	# 3
Supervised Image Retrieval	CIFAR-10	PSH	Precision@100	0.906	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/self-supervised-bernoulli-autoencoders-for/supervised-text-retrieval-on-20-newsgroups-1)](https://paperswithcode.com/sota/supervised-text-retrieval-on-20-newsgroups-1?p=self-supervised-bernoulli-autoencoders-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/self-supervised-bernoulli-autoencoders-for/supervised-image-retrieval-on-cifar-10)](https://paperswithcode.com/sota/supervised-image-retrieval-on-cifar-10?p=self-supervised-bernoulli-autoencoders-for)`

Self-Supervised Bernoulli Autoencoders for Semi-Supervised Hashing

17 Jul 2020 · Ricardo Ñanculef, Francisco Mena, Antonio Macaluso, Stefano Lodi, Claudio Sartori ·

Semantic hashing is an emerging technique for large-scale similarity search based on representing high-dimensional data using similarity-preserving binary codes used for efficient indexing and search. It has recently been shown that variational autoencoders, with Bernoulli latent representations parametrized by neural nets, can be successfully trained to learn such codes in supervised and unsupervised scenarios, improving on more traditional methods thanks to their ability to handle the binary constraints architecturally. However, the scenario where labels are scarce has not been studied yet. This paper investigates the robustness of hashing methods based on variational autoencoders to the lack of supervision, focusing on two semi-supervised approaches currently in use. The first augments the variational autoencoder's training objective to jointly model the distribution over the data and the class labels. The second approach exploits the annotations to define an additional pairwise loss that enforces consistency between the similarity in the code (Hamming) space and the similarity in the label space. Our experiments show that both methods can significantly increase the hash codes' quality. The pairwise approach can exhibit an advantage when the number of labelled points is large. However, we found that this method degrades quickly and loses its advantage when labelled samples decrease. To circumvent this problem, we propose a novel supervision method in which the model uses its label distribution predictions to implement the pairwise objective. Compared to the best baseline, this procedure yields similar performance in fully supervised settings but improves the results significantly when labelled data is scarce. Our code is made publicly available at https://github.com/amacaluso/SSB-VAE.

PDF Abstract

Code

Add Remove Mark official

amacaluso/SSB-VAE official

Tasks

Add Remove

Supervised Image Retrieval

Supervised Text Retrieval

Datasets

CIFAR-10 20 Newsgroups COVID-19 Twitter Chatter Dataset

Results from the Paper

Add Remove

Ranked #1 on Supervised Image Retrieval on CIFAR-10

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Supervised Text Retrieval	20 Newsgroups	PSH	Precision@100	0.866	# 2	Compare
Supervised Text Retrieval	20 Newsgroups	SSB-VAE	Precision@100	0.878	# 1	Compare
Supervised Image Retrieval	CIFAR-10	SSB-VAE	Precision@100	0.910	# 1	Compare
Supervised Image Retrieval	CIFAR-10	VDSH	Precision@100	0.867	# 3	Compare
Supervised Image Retrieval	CIFAR-10	PSH	Precision@100	0.906	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Self-Supervised Bernoulli Autoencoders for Semi-Supervised Hashing

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove