2.118

影响因子

    高级检索

    基于强化学习和微分对策的无人机集群动态自组织杀伤网

    Dynamic Self-organizing Kill Web for UAV Swarms Based on Reinforcement Learning and Differential Game Theory

    • 摘要: 针对高强度对抗环境下无人机集群杀伤网动态构建的难题,提出一种融合深度强化学习与微分对策的协同决策框架。该方法突破传统静态优化局限,实现杀伤网自主演化:瞬时决策引擎将战场态势转化为组合优化问题,生成周期性贪心策略样本,为强化学习提供价值基准;动态分布式杀伤网使每架无人机基于局部态势自主决策通信节点与打击目标,支撑全局任务导向的拓扑自组织与功能自修复;对抗性轨迹闭环设计为对抗双方提供差异化运动规划——进攻方采用混合虚拟势场法实现高速突防规避,防守方基于微分对策求解协同拦截最优轨迹,通过任务反馈机制联通决策层与规划层。该框架显著提升集群在强对抗、高动态场景下的体系韧性,为智能化战争提供可扩展的自主协同解决方案。

       

      Abstract: Addressing the challenge of dynamically constructing kill-webs for UAV swarms in high-intensity adversarial environments, a collaborative decision-making framework integrating deep reinforcement learning and differential game theory is proposed. This method breaks through the limitations of traditional static optimization, enabling autonomous evolution of the kill-web: an instantaneous decision-making engine transforms battlefield situational awareness into a combinatorial optimization problem, generating periodic greedy policy samples to provide a value baseline for reinforcement learning. The dynamic distributed kill-web allows each UAV to autonomously determine communication nodes and to strike targets based on local situational awareness, supporting global task-oriented topology self-organization and dynamic function self-recovery. An adversarial trajectory closed-loop design provides differentiated motion planning for opposing forces—the offensive side employs a hybrid virtual potential field method for high-speed penetration and evasion, while the defensive side solves optimal cooperative interception trajectories based on differential game theory. This is augmented by a task feedback mechanism connecting the decision-making layer with the motion planning layer. This framework significantly enhances swarm system resilience in highly adversarial and dynamic scenarios, providing a scalable autonomous collaborative solution for intelligent warfare.

       

    /

    返回文章
    返回