面向联合全域作战的海上无人集群协同防御行动策略设计

Strategy Design of Maritime Unmanned Cluster Cooperative Defense for Joint All-Domain Operations

摘要: 面向联合全域作战, 提出了一种海上无人集群防御系统行动单元协同行动的场景. 在深度强化学习方法的基础上提出一种应用于异构集群的多智能体深度确定性策略梯度算法. 并对算法模型的状态空间、动作空间以及奖励函数进行设计, 采用集中训练、分散执行的框架帮助智能体快速学到协同防御行为. 针对此场景进行了仿真实现, 验证了经学习后的行动单元具备协同作战能力, 使作战过程更具智能化.

Abstract: For Joint all-domain operations, this paper proposes a scenario for cooperative operations of maritime unmanned cluster defense systems. Based on deep reinforcement learning, a heterogeneous cluster multi-agent deep deterministic policy gradient（MADDPG）algorithm is proposed. On this basis, the model structure of MADDPG algorithm is designed from the aspects of state space, action space and reward function. The framework of centralized training and decentralized execution allows agents learn collaborative behavior quickly. This paper carried out a simulation for this scenario, and verified that the learned combat unit has the ability to cooperate in combat, making the combat process more intelligent.