论文标题
近端策略优化基于IRS辅助ISAC系统的THZ频段的传输束形和相移设计
Proximal Policy Optimization-based Transmit Beamforming and Phase-shift Design in an IRS-aided ISAC System for the THz Band
论文作者
论文摘要
在本文中,提出了在Terahertz(THZ)频段运行的IRS辅助综合传感和通信(ISAC)系统,以最大程度地提高系统容量。传输横梁成形和相移设计被转变为带有Ergodic约束的通用优化问题。然后,通过基于梯度的,基于梯度的二次近端策略优化(PPO)在多用户多输入单输出(MISO)方案中实现了传输波束形成和相移设计的联合优化。具体而言,演员部分会产生连续的传输光束形成,评论家部分负责离散相移设计。基于MISO方案,我们研究了一个分布式PPO(DPPO)框架,该框架具有多用户多输入多输入(MIMO)方案中多线程学习的概念。仿真结果证明了原始偶二PPO算法及其多线程版本的有效性,该版本就传输光束成型和相移设计而言。
In this paper, an IRS-aided integrated sensing and communications (ISAC) system operating in the terahertz (THz) band is proposed to maximize the system capacity. Transmit beamforming and phase-shift design are transformed into a universal optimization problem with ergodic constraints. Then the joint optimization of transmit beamforming and phase-shift design is achieved by gradient-based, primal-dual proximal policy optimization (PPO) in the multi-user multiple-input single-output (MISO) scenario. Specifically, the actor part generates continuous transmit beamforming and the critic part takes charge of discrete phase shift design. Based on the MISO scenario, we investigate a distributed PPO (DPPO) framework with the concept of multi-threading learning in the multi-user multiple-input multiple-output (MIMO) scenario. Simulation results demonstrate the effectiveness of the primal-dual PPO algorithm and its multi-threading version in terms of transmit beamforming and phase-shift design.