General Game Playing with Stochastic CSP

Koriche, Frédéric; Lagrue, Sylvain; Piette, Eric; Tabary, Sébastien

DIAL.pr - BOREAL

Accès à distance ? S'identifier sur le proxy UCLouvain

General Game Playing with Stochastic CSP

Primary tabs

download

cp15.pdf

Open access
PDF
68.42 K

Koriche, Frédéric Lagrue, Sylvain Piette, Eric [UCL] Tabary, Sébastien

The aim of General Game Playing (GGP) is to devise game playing algorithms which are not dedicated to a particular strategic game, but are general enough to effectively play a wide variety of games. A tournament is held every year by AAAI, in which artificial game players are supplied the rules of arbitrary new games and, without human intervention, have to play these game optimally. Games rules are described in a declarative representation language, called GDL for Game Description Language. The latest version of this language is expressive enough to describe finite multi-player games with uncertain and incomplete information. GGP algorithms include, among others, answer set programming methods, automated construction of evaluation functions, and Monte-Carlo methods such as Upper Confidence bounds for Trees (UCT). Beyond its play value, GGP offers a rigorous setting for modelling and analysing sequential decision-making algorithms in multi-agent environments. By providing a declarative approach for representing and solving combinatorial problems, Constraint Programming appears as a promising technology to address the GGP challenge. Currently, several constraint-based formalisms have already been proposed to model and solve games; they include notably Quantified CSP, Strategic CSP and Constraint Games. Most of these formalisms are, however, restricted to deterministic, perfect information games: during each round of the game, players have full access to the current state and their actions have deterministic effects. This paper focuses on stochastic games, with chance events, using the framework of stochastic constraint networks. More precisely, we study a fragment of the Stochastic Constraint Satisfaction Problem (SCSP), that captures GDL games with uncertain (but complete) information. Interestingly, the SCSP for this class of games can be decomposed into a sequence of mu-SCSPs (a.k.a one-stage stochastic constraint satisfaction problems). Based on this decomposition, we propose a sequential decision-making algorithm, MAC-UCB, that combines the MAC algorithm (Maintaining Arc Consistency) for solving each mu-SCSP, and the multi-armed bandits Upper Confidence Bound (UCB) method for approximating the expected utility of strategies. We show that in practice MAC-UCB significantly outperforms (the multi-player version of) UCT, which is the state-of-the-art GGP algorithm for stochastic games. MAC-UCB also dominates FC-UCB, a variant where MAC is replaced with the Forward Checking (FC) algorithm. Such conclusions are drawn from comparing the performance of these algorithms, using extensive experiments (about 1,350,000 face-offs) over a wide range of GDL games.

metadata

Document type	Communication à un colloque (Conference Paper) – Conférence invitée / Keynote, Abstract
Access type	Accès libre
Publication date	2015
Language	Anglais
Conference	"Principles and Practice of Constraint Programming", Cork, Ireland (du 31/08/2015 au 04/09/2015)
Peer reviewed	yes
Host document	"Principles and Practice of Constraint Programming: 21st International Conference"- p. 727 (ISBN : 978-3319232188)
Publication status	Publié
Affiliation	UCL - SST/ICTM/INGI - Pôle en ingénierie informatique
Links	http://piette.info/eric/papers/cp15.pdf http://hdl.handle.net/2078/276812[Handle]

Bibliographic reference	Koriche, Frédéric ; Lagrue, Sylvain ; Piette, Eric ; Tabary, Sébastien. General Game Playing with Stochastic CSP.Principles and Practice of Constraint Programming (Cork, Ireland, du 31/08/2015 au 04/09/2015). In: Principles and Practice of Constraint Programming: 21st International Conference, 2015, p. 727
Permanent URL	http://hdl.handle.net/2078/276812

User menu

General Game Playing with Stochastic CSP

Primary tabs

Footer Help

Languages

Footer menu

User menu

Search form

You are here

General Game Playing with Stochastic CSP

Primary tabs

Footer Help

Languages

Footer menu