mab-malware：用于攻击静态恶意软件分类器的增强学习框架

论文标题

mab-malware：用于攻击静态恶意软件分类器的增强学习框架

MAB-Malware: A Reinforcement Learning Framework for Attacking Static Malware Classifiers

论文作者

Song, Wei, Li, Xuezixiang, Afroz, Sadia, Garg, Deepali, Kuznetsov, Dmitry, Yin, Heng

论文摘要

现代商业防病毒系统越来越依赖机器学习来跟上新恶意软件的猖a。但是，众所周知，机器学习模型容易受到对抗性示例（AES）的影响。先前的工作表明，ML恶意软件分类器在白色框对抗攻击中脆弱。但是，商业防病毒产品中使用的ML模型通常不可用于攻击者，而仅返回硬分类标签。因此，以纯黑盒方式评估ML模型和现实世界AV的鲁棒性更为实用。我们提出了一个基于黑框增强学习（RL）框架，以生成PE恶意软件分类器和AV引擎的AES。它将对抗性攻击问题视为一个多臂强盗问题，该问题在利用成功模式和探索更多品种之间找到了最佳平衡。与其他框架相比，我们的改进分为三点。 1）通过将生成过程建模为无状态过程以避免结合爆炸来限制勘探空间。 2）由于有效载荷在AE生成中的关键作用，我们设计以重复使用成功的有效载荷。 3）最小化AE样本的更改以正确分配RL学习中的奖励。它还有助于识别逃避的根本原因。结果，我们的框架比其他现成的框架具有更高的黑盒逃避率。结果表明，在两个最先进的ML检测器中，它具有超过74 \％-97 \％的逃避率，在纯黑盒环境中，商业AVS的逃避率超过32 \％-48 \％逃避率。我们还证明，基于ML的分类器之间对抗性攻击的可传递性高于纯粹基于ML基于ML和商业AV之间的攻击转移性。

Modern commercial antivirus systems increasingly rely on machine learning to keep up with the rampant inflation of new malware. However, it is well-known that machine learning models are vulnerable to adversarial examples (AEs). Previous works have shown that ML malware classifiers are fragile to the white-box adversarial attacks. However, ML models used in commercial antivirus products are usually not available to attackers and only return hard classification labels. Therefore, it is more practical to evaluate the robustness of ML models and real-world AVs in a pure black-box manner. We propose a black-box Reinforcement Learning (RL) based framework to generate AEs for PE malware classifiers and AV engines. It regards the adversarial attack problem as a multi-armed bandit problem, which finds an optimal balance between exploiting the successful patterns and exploring more varieties. Compared to other frameworks, our improvements lie in three points. 1) Limiting the exploration space by modeling the generation process as a stateless process to avoid combination explosions. 2) Due to the critical role of payload in AE generation, we design to reuse the successful payload in modeling. 3) Minimizing the changes on AE samples to correctly assign the rewards in RL learning. It also helps identify the root cause of evasions. As a result, our framework has much higher black-box evasion rates than other off-the-shelf frameworks. Results show it has over 74\%--97\% evasion rate for two state-of-the-art ML detectors and over 32\%--48\% evasion rate for commercial AVs in a pure black-box setting. We also demonstrate that the transferability of adversarial attacks among ML-based classifiers is higher than the attack transferability between purely ML-based and commercial AVs.

下载PDF全文

下载文献需遵守相关版权规定

论文标题