高速飞行器追逃博弈决策技术

崔雅萌; 王会霞; 郑春胜; 胡瑞光

doi:10.3969/j.issn.2096-0204.2021.04.0403

高速飞行器追逃博弈决策技术

Pursuit-Evasion Game Decision Technology of High Speed Vehicles

摘要

摘要: 红方单个飞行器面对蓝方多个拦截器时, 受到飞行器个体能力以及突防手段的限制, 难以突防成功. 针对此问题, 基于强化学习设计了一种可躲避蓝方多个拦截器的智能博弈策略. 建立红方飞行器的数学模型, 包含抛撒诱饵、机动调整、姿态调整等行为; 建立蓝方飞行器数学模型, 蓝方采用比例导引法拦截红方; 设计了基于深度确定性策略梯度的飞行器逃逸算法, 为了加快智能体的学习速度和算法收敛速度, 用先验知识进行预训练和优先经验回放机制相结合的方式进行算法训练. 仿真结果表明该算法可使红方飞行器面对蓝方多个拦截器时成功逃逸.

Abstract: Due to the limitation of individual capability and penetration means, a single red aircraft is difficult to penetrate successfully from multiple blue interceptors. To solve this problem, an intelligent game strategy based on reinforcement learning is designed. Firstly, the mathematical model of the red aircraft is established, including throwing bait, maneuvering adjustment, attitude adjustment, etc. Secondly, the mathematical model of the blue aircraft is established, and the blue intercepts the red with the proportional guidance method. Next, a flight escape algorithm based on deep deterministic policy gradient is designed. Then, by combining the prior knowledge pre-training and the preferred experience playback mechanism, the learning speed and the convergence speed of the algorithm are accelerated. The simulation results show that the proposed algorithm can make the red aircraft penetrate successfully when facing the blue interceptors.

HTML全文

参考文献(0)

施引文献

资源附件(0)