2.118

影响因子

    高级检索

    基于星上实时数据的卫星任务在线决策技术

    On-line Decision-making Technology for Satellite Missions Based on On-board Real-time Data

    • 摘要: 针对未来遥感卫星自主能力强、在轨协同任务多的需求,分析了在轨自主决策问题产生的机制和智能化模型的能力,建立了星地协同在轨任务决策的流程机制,设计了星上任务在线决策学习方法,通过设计卫星决策任务强化学习回报函数,明确强化学习中收益值,通过设计实现基于深度Q-网络的卫星智能任务需求决策过程,获得可以用于星上自主决策的决策列表。实验结果表明,在12种规则的情况下,任务属性配置准确率均达到80%以上。将深度强化学习应用于任务规划与资源分配,从而实现遥感卫星协同观测、任务协同、信息引导等卫星综合利用能力,也对进一步拓展强化学习理论和方法的应用领域具有重要的理论价值。

       

      Abstract: To meet the needs of future remote sensing satellites for strong autonomy and too many on-orbit collaboration missions. The mechanisms caused by on-orbit autonomous decision-making and intelligent model capability are analyzed, then flow mechanism of a satellite-ground coordination on-orbit decision-making process is established and an on-board decision-making learning method for satellite mission is designed. The learning return function is reinforced by designing the decision-making mission of satellites. The payoff value in reinforcement learning is clarified. The intelligent mission requirement decision-making process of satellites based on a deep Qnetwork-based approach is designed and implemented, a decision-making list is obtained for the autonomous decision-making of satellites. The experiment results show that under the circumstance of 12 kinds of rules, the accuracy of mission attribute configuration reaches over 80%. The reinforcement learning is applied in the mission planning and resource allocation so as to realize such comprehension utilization capabilities as coordination observation, mission coordination, information guidance and others of remote sensing satellites. This research has important theory value for further expanding reinforcement learning theory and the application field of the method.

       

    /

    返回文章
    返回