论文标题
路径插定策略不规则时间序列的签名模型
Path Imputation Strategies for Signature Models of Irregular Time Series
论文作者
论文摘要
签名变换是连续矢量值路径空间上的“通用非线性”,并且已在计时序列中引起了机器学习的关注。但是,通常在离散时间点观察到现实世界的时间数据,必须首先将其转换为连续路径,然后才能应用签名技术。我们通过将其描述为插补问题来明确说明这一步骤,并在将基于签名的神经网应用于不规则的时间序列数据时,从经验上评估各种插补策略的影响。对于这些策略之一,高斯流程(GP)适配器,我们提出了一个扩展〜(GP-POM),该扩展名(GP-POM)将不确定性信息直接提供给后续分类器,同时又可以防止昂贵的蒙特卡洛(MC)采样。在我们的实验中,我们发现插补的选择会极大地影响浅层签名模型,而更深层次的体系结构更加可靠。接下来,我们观察到,即使与常规GP适配器的不确定性感知训练相比,即使是不确定性感知的预测(基于GP-POM或指标归纳)也对预测性能有益。总之,我们已经证明,路径构建对于签名模型确实至关重要,并且我们提出的策略总体上可以提高竞争性能,同时尤其是提高了签名模型的鲁棒性。
The signature transform is a 'universal nonlinearity' on the space of continuous vector-valued paths, and has received attention for use in machine learning on time series. However, real-world temporal data is typically observed at discrete points in time, and must first be transformed into a continuous path before signature techniques can be applied. We make this step explicit by characterising it as an imputation problem, and empirically assess the impact of various imputation strategies when applying signature-based neural nets to irregular time series data. For one of these strategies, Gaussian process (GP) adapters, we propose an extension~(GP-PoM) that makes uncertainty information directly available to the subsequent classifier while at the same time preventing costly Monte-Carlo (MC) sampling. In our experiments, we find that the choice of imputation drastically affects shallow signature models, whereas deeper architectures are more robust. Next, we observe that uncertainty-aware predictions (based on GP-PoM or indicator imputations) are beneficial for predictive performance, even compared to the uncertainty-aware training of conventional GP adapters. In conclusion, we have demonstrated that the path construction is indeed crucial for signature models and that our proposed strategy leads to competitive performance in general, while improving robustness of signature models in particular.