论文标题

Formic:通过具有隐式通信的多重RL觅食

ForMIC: Foraging via Multiagent RL with Implicit Communication

论文作者

Shaw, Samuel, Wenzel, Emerson, Walker, Alexis, Sartoretti, Guillaume

论文摘要

多机构觅食(MAF)涉及分发一组代理以搜索环境并从中提取资源。大自然提供了高效觅食者的几个例子,其中觅食集体使用生物学标记(例如,信息素)中的个人通过环境将关键信息传达给他人。在这项工作中,我们提出了一种分布式加强学习MAF方法的FOMIC,该方法通过共同的环境赋予代理人具有隐性的沟通能力。但是,通过污名相互作用的学习有效政策是高度不平凡的,因为代理需要表现良好才能互相发送有用的信号,但也需要感知他人的信号以表现良好。在这项工作中,我们开发了几种关键的学习技术,用于具有污名相互作用的培训政策,在这种循环依赖中存在。通过依靠巧妙的课程学习设计,操作过滤以及引入非学习代理以以低计算成本提高训练时间的代理密度,我们开发了一个最小的学习框架,从而导致对有效污名策略进行稳定的培训。我们提出了仿真结果,这些结果表明,我们所学会的政策在一组实验中优于现有的最新MAF算法,这些实验会改变团队规模,资源的数量和位置以及在培训时间看不到的关键环境动态。

Multi-agent foraging (MAF) involves distributing a team of agents to search an environment and extract resources from it. Nature provides several examples of highly effective foragers, where individuals within the foraging collective use biological markers (e.g., pheromones) to communicate critical information to others via the environment. In this work, we propose ForMIC, a distributed reinforcement learning MAF approach that endows agents with implicit communication abilities via their shared environment. However, learning efficient policies with stigmergic interactions is highly nontrivial, since agents need to perform well to send each other useful signals, but also need to sense others' signals to perform well. In this work, we develop several key learning techniques for training policies with stigmergic interactions, where such a circular dependency is present. By relying on clever curriculum learning design, action filtering, and the introduction of non-learning agents to increase the agent density at training time at low computational cost, we develop a minimal learning framework that leads to the stable training of efficient stigmergic policies. We present simulation results which demonstrate that our learned policy outperforms existing state-of-the-art MAF algorithms in a set of experiments that vary team size, number and placement of resources, and key environmental dynamics not seen at training time.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源