Article,

A game strategy model in the digital curling system based on NFSP

, , and .
Complex & Intelligent Systems, 8 (3): 1857--1863 (Jun 1, 2022)
DOI: 10.1007/s40747-021-00345-6

Abstract

The digital curling game is a two-player zero-sum extensive game in a continuous action space. There are some challenging problems that are still not solved well, such as the uncertainty of strategy, the large game tree searching, and the use of large amounts of supervised data, etc. In this work, we combine NFSP and KR-UCT for digital curling games, where NFSP uses two adversary learning networks and can automatically produce supervised data, and KR-UCT can be used for large game tree searching in continuous action space. We propose two reward mechanisms to make reinforcement learning converge quickly. Experimental results validate the proposed method, and show the strategy model can reach the Nash equilibrium.

Tags

Users

  • @cckonstanz

Comments and Reviews