论文标题

基于元加强学习的边缘计算中的快速自适应任务卸载

Fast Adaptive Task Offloading in Edge Computing based on Meta Reinforcement Learning

论文作者

Wang, Jin, Hu, Jia, Min, Geyong, Zomaya, Albert Y., Georgalas, Nektarios

论文摘要

多访问边缘计算(MEC)旨在将云服务扩展到网络边缘,以减少网络流量和服务延迟。 MEC中的一个基本问题是如何有效地从用户设备(UE)到MEC主机有效地卸载移动应用程序的异质任务。最近,已经提出了许多基于深入的强化学习(DRL)方法,以通过与由UE,无线渠道和MEC宿主组成的MEC环境进行交互来学习卸载策略。但是,这些方法对新环境的适应性较弱,因为它们的样本效率较低,并且需要全面的再培训才能学习新环境的更新政策。为了克服这一弱点,我们提出了一种基于元强化学习的任务卸载方法,该方法可以快速适应具有少量梯度更新和样本的新环境。我们通过定制的序列到序列(SEQ2SEQ)神经网络将移动应用程序建模为指示无环图(DAG)和卸载策略。为了有效地训练SEQ2SEQ网络,我们提出了一种协同一阶近似和剪辑替代目标的方法。实验结果表明,与三个基线相比,这种新的卸载方法可以将潜伏期降低25%,同时能够快速适应新环境。

Multi-access edge computing (MEC) aims to extend cloud service to the network edge to reduce network traffic and service latency. A fundamental problem in MEC is how to efficiently offload heterogeneous tasks of mobile applications from user equipment (UE) to MEC hosts. Recently, many deep reinforcement learning (DRL) based methods have been proposed to learn offloading policies through interacting with the MEC environment that consists of UE, wireless channels, and MEC hosts. However, these methods have weak adaptability to new environments because they have low sample efficiency and need full retraining to learn updated policies for new environments. To overcome this weakness, we propose a task offloading method based on meta reinforcement learning, which can adapt fast to new environments with a small number of gradient updates and samples. We model mobile applications as Directed Acyclic Graphs (DAGs) and the offloading policy by a custom sequence-to-sequence (seq2seq) neural network. To efficiently train the seq2seq network, we propose a method that synergizes the first order approximation and clipped surrogate objective. The experimental results demonstrate that this new offloading method can reduce the latency by up to 25% compared to three baselines while being able to adapt fast to new environments.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源