2.118

影响因子

    高级检索

    大模型驱动的迭代具身学习框架

    Iterative Embodied Learning Framework Driven by Large Language Model

    • 摘要: 针对现有具身学习方法存在跨场景泛化能力弱、探索效率低、策略规划易陷入局部最优等问题,提出了大模型驱动的迭代具身学习新范式,以大模型为决策核心构建闭环反馈系统,将具身学习问题解耦为高层规划与低级路径执行的协同。所提框架依托大模型,基于感知提示生成优化训练环境选择的环境选择策略和指导智能体探索环境的探索策略,迭代采集数据逐步提升感知性能。实验表明,该框架能显著提高感知模型对环境的理解能力。

       

      Abstract: The existing embodied learning approaches suffer from limited cross-scene generalization, low exploration efficiency, and local optimal policy planning and other problems. To solve and alleviate these issues, a novel large language model (LLM) based iterative embodied learning paradigm is proposed, which establishes a closed-loop feedback system with the LLM as the core decision-maker, decoupling the complex embodied exploration problem into a synergy of high-level planning and low-level path execution. The proposed framework employs the LLM to generate an environment selection policy for optimizing training environment choice and an exploration policy to guide the agent in exploring environments based on the perception prompt. Through iterative data collection, the perception performance is gradually improved. The experiments demonstrate that the proposed framework significantly enhances the environmental understanding ability of the model.

       

    /

    返回文章
    返回