2.118

影响因子

    高级检索

    基于图卷积神经网络的爱恩斯坦棋博弈策略建模

    EinStein W ü rfelt Nicht! Gaming Policy Modeling Based on Graph Convolutional Neural Networks

    • 摘要: 在智能博弈领域,爱恩斯坦棋移动策略需综合考虑随机掷骰、棋子位置及潜在交互关系,使得建模和预测难度较大。提出了一种基于图卷积神经网络构建策略网络,在蒙特卡洛树搜索选择和模拟阶段引导节点选择的方法。通过多个图卷积残差块提取局面特征,输出棋子移动的概率;结合启发式上限置信界树算法,形成基于图卷积策略网络引导的GP-UCT算法。实验表明,该算法对抗传统UCT和BetaPrune等基线算法,取得了至少60%的博弈胜率,验证了其在爱恩斯坦棋随机博弈中的有效性。

       

      Abstract: In the intelligent gaming field, EinStein würfelt nicht! gaming mobile policy should consider random dice rolls, chess piece positions, and potential interactions comprehensively, making the modeling and prediction more difficult. A strategy-building network based on the graph convolutional network (GCN) is proposed. A node selection method for the selection and simulation phases of Monte Carlo tree search (MCTS) is developed, and the situation features are extracted by multiple graph convolutional residual blocks and the probability of chess piece movement is output. By integrating the heuristic upper confidence bound for trees (heuristic UCT), the GP-UCT algorithm is formed based on the guidance of graph convolutional policy network. The experiment shows that the algorithm achieves at least a 60% win rate against the traditional UCT, BetaPrune and other baseline algorithms. The effectiveness of the proposed method in EinStein würfelt nicht! random gaming is verified.

       

    /

    返回文章
    返回