论文标题

标量耦合使用图形嵌入本地注意编码器的恒定预测

Scalar Coupling Constant Prediction Using Graph Embedding Local Attention Encoder

论文作者

Jian, Caiqing, Cheng, Xinyu, Zhang, Jian, Wang, Lihui

论文摘要

标量耦合常数(SCC)在分析有机物的三维结构中起关键作用,但是,使用量子力学计算的传统SCC预测非常耗时。 To calculate SCC efficiently and accurately, we proposed a graph embedding local self-attention encoder (GELAE) model, in which, a novel invariant structure representation of the coupling system in terms of bond length, bond angle and dihedral angle was presented firstly, and then a local self-attention module embedded with the adjacent matrix of a graph was designed to extract effectively the features of coupling systems, finally, with a modified分类损失函数,预测SCC。为了验证所提出的方法的优越性,我们使用不同的结构表示,不同的注意模块和不同的损失进行了一系列比较实验。实验结果表明,与传统的化学键结构表示相比,这项工作中提出的旋转和翻译不变结构表示可以提高SCC预测准确性。随着嵌入局部自我注意的图,验证集中预测模型的平均绝对误差(MAE)从0.1603 Hz降低到0.1067 Hz;使用基于分类的损耗函数而不是缩放回归损失,预测的SCC的MAE可以降低到0.0963 Hz,该MAE接近Champs数据集中的量子化学标准。

Scalar coupling constant (SCC) plays a key role in the analysis of three-dimensional structure of organic matter, however, the traditional SCC prediction using quantum mechanical calculations is very time-consuming. To calculate SCC efficiently and accurately, we proposed a graph embedding local self-attention encoder (GELAE) model, in which, a novel invariant structure representation of the coupling system in terms of bond length, bond angle and dihedral angle was presented firstly, and then a local self-attention module embedded with the adjacent matrix of a graph was designed to extract effectively the features of coupling systems, finally, with a modified classification loss function, the SCC was predicted. To validate the superiority of the proposed method, we conducted a series of comparison experiments using different structure representations, different attention modules, and different losses. The experimental results demonstrate that, compared to the traditional chemical bond structure representations, the rotation and translation invariant structure representations proposed in this work can improve the SCC prediction accuracy; with the graph embedded local self-attention, the mean absolute error (MAE) of the prediction model in the validation set decreases from 0.1603 Hz to 0.1067 Hz; using the classification based loss function instead of the scaled regression loss, the MAE of the predicted SCC can be decreased to 0.0963 HZ, which is close to the quantum chemistry standard on CHAMPS dataset.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源