2.118

影响因子

    高级检索

    面向具身智能机器人的快速精准抓取位姿估计

    Fast and Accurate Grasping Pose Estimation for Embodied Intelligent Robots

    • 摘要: 物体抓取位姿估计是具身机器人的一项重要视觉任务。大多数6自由度抓取位姿估计方法主要以点云进行抓取位姿估计,忽略了物体的纹理信息,并且计算过程较为耗时。针对这一问题,提出一种快速高效的抓取位姿预测方法。通过构建抓取区域预测模块,预测RGB−D 图像中的可抓取区域和6自由度抓取的部分参数,并构建非均匀抓取位姿搜索算法,提升抓取位姿的准确性与多样性。为实现机器人的具身性,结合视觉语言大模型,构建具身机器人抓取位姿预测系统,使得机器人能够准确理解人类指令并完成物体抓取任务。实验结果表明,所提方法同时兼顾实时性与准确性,能够有效应用至人机交互场景下的机器人抓取任务。

       

      Abstract: Object grasping pose estimation is an important visual task for embodied robots. Most 6-degree-of-freedom (6-DoF) grasping pose estimation methods mainly rely on point clouds for grasping pose estimation, neglecting the texture information of the object, and the computation process is time-consuming. To address this issue, a fast and efficient grasping pose prediction method is proposed. A grasping region prediction module is built to predict the grasping regions and some of the 6-DoF grasp parameters in the RGB-D images. Then, a non-uniform grasping pose search algorithm is established to improve the accuracy and diversity of the grasping pose. To achieve robot embodiment, a visual-language large model is integrated to build an embodied robot grasping pose prediction system, enabling the robot to accurately understand human instructions and perform object grasping tasks. The experimental results show that the proposed method balances both real-time performance and accuracy, and can be effectively applied to robot grasping tasks in human-robot interaction scenarios.

       

    /

    返回文章
    返回