A game strategy model in the digital curling system based on NFSP

Abstract

The digital curling game is a two-player zero-sum extensive game in a continuous action space. There are some challenging problems that are still not solved well, such as the uncertainty of strategy, the large game tree searching, and the use of large amounts of supervised data, etc. In this work, we combine NFSP and KR-UCT for digital curling games, where NFSP uses two adversary learning networks and can automatically produce supervised data, and KR-UCT can be used for large game tree searching in continuous action space. We propose two reward mechanisms to make reinforcement learning converge quickly. Experimental results validate the proposed method, and show the strategy model can reach the Nash equilibrium.

BibTeX key: Han2022
entry type: article
year: 2022
month: jun
day: 01
journal: Complex & Intelligent Systems
number: 3
pages: 1857--1863
volume: 8
issn: 2198-6053
DOI: 10.1007/s40747-021-00345-6
url: https://doi.org/10.1007/s40747-021-00345-6

BibSonomy

A game strategy model in the digital curling system based on NFSP

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on