Masked Label Prediction: Unified Message Passing Model for Semi-Supervised Classification

8 Sep 2020  ·  Yunsheng Shi, Zhengjie Huang, Shikun Feng, Hui Zhong, Wenjin Wang, Yu Sun ·

Graph neural network (GNN) and label propagation algorithm (LPA) are both message passing algorithms, which have achieved superior performance in semi-supervised classification. GNN performs feature propagation by a neural network to make predictions, while LPA uses label propagation across graph adjacency matrix to get results. However, there is still no effective way to directly combine these two kinds of algorithms. To address this issue, we propose a novel Unified Message Passaging Model (UniMP) that can incorporate feature and label propagation at both training and inference time. First, UniMP adopts a Graph Transformer network, taking feature embedding and label embedding as input information for propagation. Second, to train the network without overfitting in self-loop input label information, UniMP introduces a masked label prediction strategy, in which some percentage of input label information are masked at random, and then predicted. UniMP conceptually unifies feature propagation and label propagation and is empirically powerful. It obtains new state-of-the-art semi-supervised classification results in Open Graph Benchmark (OGB).

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Node Property Prediction ogbn-arxiv UniMP Test Accuracy 0.7311 ± 0.0020 # 46
Validation Accuracy 0.7450 ± 0.0005 # 45
Number of params 473489 # 51
Ext. data No # 1
Node Property Prediction ogbn-arxiv UniMP_large Test Accuracy 0.7379 ± 0.0014 # 35
Validation Accuracy 0.7475 ± 0.0008 # 43
Number of params 1162515 # 43
Ext. data No # 1
Node Property Prediction ogbn-arxiv UniMP_v2 Test Accuracy 0.7397 ± 0.0015 # 31
Validation Accuracy 0.7506 ± 0.0009 # 31
Number of params 687377 # 46
Ext. data No # 1
Node Property Prediction ogbn-papers100M TransformerConv Test Accuracy 0.6736 ± 0.0010 # 11
Validation Accuracy 0.7172 ± 0.0005 # 7
Number of params 883378 # 17
Ext. data No # 1
Node Property Prediction ogbn-products UniMP Test Accuracy 0.8256 ± 0.0031 # 31
Validation Accuracy 0.9308 ± 0.0017 # 21
Number of params 1475605 # 26
Ext. data No # 1
Node Property Prediction ogbn-proteins UniMP+CrossEdgeFeat Test ROC-AUC 0.8691 ± 0.0018 # 10
Validation ROC-AUC 0.9258 ± 0.0009 # 8
Number of params 1959984 # 15
Ext. data No # 1
Node Property Prediction ogbn-proteins UniMP Test ROC-AUC 0.8642 ± 0.0008 # 12
Validation ROC-AUC 0.9175 ± 0.0006 # 12
Number of params 1909104 # 16
Ext. data No # 1

Methods