TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Link Prediction	Wikidata5M	ComplEx	MRR	0.308	# 6
Link Prediction	Wikidata5M	ComplEx	Hits@1	0.255	# 7
Link Prediction	Wikidata5M	ComplEx	Hits@10	0.398	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/parallel-training-of-knowledge-graph/link-prediction-on-wikidata5m)](https://paperswithcode.com/sota/link-prediction-on-wikidata5m?p=parallel-training-of-knowledge-graph)`

Parallel Training of Knowledge Graph Embedding Models: A Comparison of Techniques

Proceedings of the VLDB Endowment 2021 · Adrian Kochsiek, Rainer Gemulla ·

Knowledge graph embedding (KGE) models represent the entities and relations of a knowledge graph (KG) using dense continuous representations called embeddings. KGE methods have recently gained traction for tasks such as knowledge graph completion and reasoning as well as to provide suitable entity representations for downstream learning tasks. While a large part of the available literature focuses on small KGs, a number of frameworks that are able to train KGE models for large-scale KGs by parallelization across multiple GPUs or machines have recently been proposed. So far, the benefits and drawbacks of the various parallelization techniques have not been studied comprehensively. In this paper, we report on an experimental study in which we presented, re-implemented in a common computational framework, investigated, and improved the available techniques. We found that the evaluation methodologies used in prior work are often not comparable and can be misleading, and that most of currently implemented training methods tend to have a negative impact on embedding quality. We propose a simple but effective variation of the stratification technique used by PyTorch BigGraph for mitigation. Moreover, basic random partitioning can be an effective or even the best-performing choice when combined with suitable sampling techniques. Ultimately, we found that efficient and effective parallel training of large-scale KGE models is indeed achievable but requires a careful choice of techniques.

PDF