TREE-G: Decision Trees Contesting Graph Neural Networks

6 Jul 2022  Â·  Maya Bechler-Speicher, Amir Globerson, Ran Gilad-Bachrach ·

When dealing with tabular data, models based on decision trees are a popular choice due to their high accuracy on these data types, their ease of application, and explainability properties. However, when it comes to graph-structured data, it is not clear how to apply them effectively, in a way that incorporates the topological information with the tabular data available on the vertices of the graph. To address this challenge, we introduce TREE-G. TREE-G modifies standard decision trees, by introducing a novel split function that is specialized for graph data. Not only does this split function incorporate the node features and the topological information, but it also uses a novel pointer mechanism that allows split nodes to use information computed in previous splits. Therefore, the split function adapts to the predictive task and the graph at hand. We analyze the theoretical properties of TREE-G and demonstrate its benefits empirically on multiple graph and vertex prediction benchmarks. In these experiments, TREE-G consistently outperforms other tree-based models and often outperforms other graph-learning algorithms such as Graph Neural Networks (GNNs) and Graph Kernels, sometimes by large margins. Moreover, TREE-Gs models and their predictions can be explained and visualized

PDF Abstract

Results from the Paper


 Ranked #1 on Graph Classification on HIV dataset (Accuracy metric)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Node Classification Citeseer TREE-G Accuracy 74.5 # 28
Node Classification Cora: fixed 20 node per class TREE-G Accuracy 83.5 # 4
Graph Classification D&D TREE-G Accuracy 76.2% # 33
Graph Classification ENZYMES TREE-G Accuracy 59.6 # 21
Graph Classification HIV dataset TREE-G Accuracy 83.5 # 1
Graph Classification IMDb-B TREE-G Accuracy 73% # 25
Graph Classification IMDb-M TREE-G Accuracy 56.4% # 3
Graph Classification MUTAG TREE-G Accuracy 91.1% # 16
Graph Classification Mutagenicity TREE-G Accuracy 83 # 1
Graph Classification NCI1 TREE-G Accuracy 75.9% # 38
Graph Classification PROTEINS TREE-G Accuracy 75.6 # 52
Graph Classification PTC TREE-G Accuracy 59.1% # 36
Node Classification Pubmed TREE-G Accuracy 78.0 # 56

Methods


No methods listed for this paper. Add relevant methods here