论文标题
走向更深的图形神经网络
Towards Deeper Graph Neural Networks
论文作者
论文摘要
图神经网络在图表学习领域显示出显着成功。图卷积执行邻域聚集,并代表最重要的图形操作之一。然而,这些邻居聚集方法中的一层仅考虑直接邻居,并且在更深的过程中,性能会降低以实现更大的接受场。最近的一些研究将这种绩效恶化归因于过度平滑的问题,该问题指出,重复传播使不同类别的节点表示无法区分。在这项工作中,我们系统地研究了这一观察结果,并为更深的图形神经网络发展了新的见解。首先,我们在此问题上提供了系统的分析,并认为损害性能的关键因素显着损害了当前图卷积操作中的表示转换和传播的纠缠。解耦这两个操作后,更深的图神经网络可用于从较大的接受场中学习图形表示。当构建非常深入的模型时,我们进一步提供了上述观察结果的理论分析,这可以作为对过度平滑问题的严格而温和的描述。基于我们的理论和经验分析,我们提出了深层自适应图神经网络(DAGNN),以适应性地整合了来自大型接受场的信息。一组有关引文,共同创作和共同购买数据集的实验证实了我们的分析和见解,并证明了我们提出的方法的优势。
Graph neural networks have shown significant success in the field of graph representation learning. Graph convolutions perform neighborhood aggregation and represent one of the most important graph operations. Nevertheless, one layer of these neighborhood aggregation methods only consider immediate neighbors, and the performance decreases when going deeper to enable larger receptive fields. Several recent studies attribute this performance deterioration to the over-smoothing issue, which states that repeated propagation makes node representations of different classes indistinguishable. In this work, we study this observation systematically and develop new insights towards deeper graph neural networks. First, we provide a systematical analysis on this issue and argue that the key factor compromising the performance significantly is the entanglement of representation transformation and propagation in current graph convolution operations. After decoupling these two operations, deeper graph neural networks can be used to learn graph node representations from larger receptive fields. We further provide a theoretical analysis of the above observation when building very deep models, which can serve as a rigorous and gentle description of the over-smoothing issue. Based on our theoretical and empirical analysis, we propose Deep Adaptive Graph Neural Network (DAGNN) to adaptively incorporate information from large receptive fields. A set of experiments on citation, co-authorship, and co-purchase datasets have confirmed our analysis and insights and demonstrated the superiority of our proposed methods.