论文标题
迈向深度监督的异常检测:从部分标记为异常数据的加固学习
Toward Deep Supervised Anomaly Detection: Reinforcement Learning from Partially Labeled Anomaly Data
论文作者
论文摘要
我们考虑了一组部分标记为异常示例和一个大规模的未标记数据集的异常检测问题。这是许多重要应用中的常见情况。现有的相关方法要么仅适合通常不会跨越整个异常集的有限异常示例,要么从未标记的数据中进行无监督的学习。相反,我们在这里提出了一种基于强化的学习方法,该方法能够对标记和未标记异常的检测进行端到端优化。这种方法通过自动与异常偏见的模拟环境进行自动相互作用来了解已知的异常,同时不断将学习的异常扩展到新的异常类(即未知异常),通过积极探索未标记数据中可能的异常情况。这是通过共同优化对小标记的异常数据的剥削和罕见未标记异常的探索来实现的。对48个现实世界数据集进行的广泛实验表明,我们的模型大大优于五种最先进的竞争方法。
We consider the problem of anomaly detection with a small set of partially labeled anomaly examples and a large-scale unlabeled dataset. This is a common scenario in many important applications. Existing related methods either exclusively fit the limited anomaly examples that typically do not span the entire set of anomalies, or proceed with unsupervised learning from the unlabeled data. We propose here instead a deep reinforcement learning-based approach that enables an end-to-end optimization of the detection of both labeled and unlabeled anomalies. This approach learns the known abnormality by automatically interacting with an anomaly-biased simulation environment, while continuously extending the learned abnormality to novel classes of anomaly (i.e., unknown anomalies) by actively exploring possible anomalies in the unlabeled data. This is achieved by jointly optimizing the exploitation of the small labeled anomaly data and the exploration of the rare unlabeled anomalies. Extensive experiments on 48 real-world datasets show that our model significantly outperforms five state-of-the-art competing methods.