2.118

影响因子

    高级检索

    基于强化学习的作战指挥智能决策方法

    An Intelligent Decision-making Method for Operational Command Based on Reinforcement Learning

    • 摘要: 为提高作战指挥决策效能,提出一种基于强化学习的作战指挥智能决策方法。在基于Q-learning算法构造作战辅助决策模型的基础上,对模型结构中的状态空间、动作空间和综合环境进行设计,并给出兵力和部队整体士气状态变化规则,以及不同动作发生时的状态修正原则。从歼敌数量、兵力损失和整体士气变化情况出发,设计了决策模型奖励函数。基于典型作战场景和俄乌冲突经典战役对所提决策模型进行了仿真测试,结果表明:该模型可输入双方部队基本情况和战场综合环境训练得到决策Q表,依据不同动作Q值大小所提决策建议与实际作战决策具有较强的一致性,可有效辅助指挥员进行作战决策。

       

      Abstract: To improve the effectiveness of operational command decision-making, an intelligent decision-making method for operational command based on reinforcement learning is proposed. On the basis of constructing the combat aided decision-making model based on Q-learning algorithm, the state space, action space and integrated environment in the model structure are designed, and the rules for the changes of the states of the forces and the overall morale of the troops are given, and the principles of the state correction when different actions occur are also defined. Then, the reward function of the decision-making model is designed from the number of annihilated enemies, the loss of forces and the change of overall morale. Finally, the proposed decision-making model is simulated and tested in a classic battle scenario and the classic battle of the Russo-Ukrainian conflict. The experimental results show that the model can input the basic situation of both sides' troops and the comprehensive environment of the battlefield training to obtain the decision Q table, and the decision suggestions based on the Q values of different actions have a strong consistency with the actual combat decision-making, which can effectively assist commanders in making combat decisions.

       

    /

    返回文章
    返回