论文标题
联合设备调度和资源分配,用于延迟限制的无线联合学习
Joint Device Scheduling and Resource Allocation for Latency Constrained Wireless Federated Learning
论文作者
论文摘要
在联合学习(FL)中,设备通过通过无线渠道上传其本地模型更新来为全球培训做出了贡献。由于计算和通信资源有限,设备调度对于FL的收敛速度至关重要。在本文中,我们提出了一个联合设备调度和资源分配策略,以最大程度地提高给定的总培训时间预算的模型准确性,以确保延迟约束无线FL。训练绩效损失的下限,就训练回合的数量和每轮预定设备的数量而言。基于界限,通过将其分解为两个子问题来解决精度最大化问题。首先,考虑到计划的设备,最佳带宽分配建议将更多的带宽分配给具有较差的通道条件或较弱的计算功能的设备。然后,引入了一种贪婪的设备调度算法,在每个步骤中,该算法选择了使用最佳带宽分配获得的最小更新时间的设备,直到下限开始增加,这意味着计划更多设备将降低模型的精度。实验表明,在广泛的数据分布和单元半径的广泛设置下,提出的策略优于最先进的调度策略。
In federated learning (FL), devices contribute to the global training by uploading their local model updates via wireless channels. Due to limited computation and communication resources, device scheduling is crucial to the convergence rate of FL. In this paper, we propose a joint device scheduling and resource allocation policy to maximize the model accuracy within a given total training time budget for latency constrained wireless FL. A lower bound on the reciprocal of the training performance loss, in terms of the number of training rounds and the number of scheduled devices per round, is derived. Based on the bound, the accuracy maximization problem is solved by decoupling it into two sub-problems. First, given the scheduled devices, the optimal bandwidth allocation suggests allocating more bandwidth to the devices with worse channel conditions or weaker computation capabilities. Then, a greedy device scheduling algorithm is introduced, which in each step selects the device consuming the least updating time obtained by the optimal bandwidth allocation, until the lower bound begins to increase, meaning that scheduling more devices will degrade the model accuracy. Experiments show that the proposed policy outperforms state-of-the-art scheduling policies under extensive settings of data distributions and cell radius.