Towards a General Transfer Approach for Policy-Value Networks

Dennis J.N.J. Soemers; Mella Vegard; Piette, Eric; Matthew Stephenson; Cameron Browne; Olivier Teytaud

DIAL.pr - BOREAL

Accès à distance ? S'identifier sur le proxy UCLouvain

Towards a General Transfer Approach for Policy-Value Networks

Primary tabs

download

TMLR23.pdf

Open access
PDF
1.03 M

Dennis J.N.J. Soemers Mella Vegard Piette, Eric [UCL] Matthew Stephenson Cameron Browne Olivier Teytaud

Transferring trained policies and value functions from one task to another, such as one game to another with a different board size, board shape, or more substantial rule changes, is a challenging problem. Popular benchmarks for reinforcement learning (RL), such as Atari games and ProcGen, have limited variety, especially in terms of action spaces. Due to a focus on such benchmarks, the development of transfer methods that can also handle changes in action spaces has received relatively little attention. Furthermore, we argue that progress towards more general methods should include benchmarks where new problem instances can be described by domain experts, rather than machine learning experts, using convenient, high-level domain specific languages (DSLs). In addition to enabling end users to more easily describe their problems, user-friendly DSLs also contain relevant task information which can be leveraged to make effective zero-shot transfer plausibly achievable. As an example, we use the Ludii general game system, which includes a highly varied set of over 1000 distinct games described in such a language. We propose a simple baseline approach for transferring fully convolutional policy-value networks, which are used to guide search agents similar to AlphaZero, between any pair of games modelled in this system. Extensive results---including various cases of highly successful zero-shot transfer---are provided for a wide variety of source and target games.

metadata

Document type	Article de périodique (Journal article) – Article de recherche
Access type	Accès libre
Publication date	2023
Language	Anglais
Journal information	"Transactions on Machine Learning Research" - (2023)
Peer reviewed	yes
issn	2835-8856
Publication status	Publié
Affiliation	UCL - SST/ICTM/INGI - Pôle en ingénierie informatique
Keywords	Artificial Intelligence ; General Game Playing ; Machine Learning ; Transfer Learning ; Computer Science
Links	https://openreview.net/forum?id=vJcTm2v9Ku https://github.com/DennisSoemers/Transfer-DNNs-Ludii-Polygames[Dataset] http://hdl.handle.net/2078.1/281298[Handle]

Bibliographic reference	Dennis J.N.J. Soemers ; Mella Vegard ; Piette, Eric ; Matthew Stephenson ; Cameron Browne ; et. al. Towards a General Transfer Approach for Policy-Value Networks. In: Transactions on Machine Learning Research, (2023)
Permanent URL	http://hdl.handle.net/2078.1/281298

User menu

Towards a General Transfer Approach for Policy-Value Networks

Primary tabs

Footer Help

Languages

Footer menu

User menu

Search form

You are here

Towards a General Transfer Approach for Policy-Value Networks

Primary tabs

Footer Help

Languages

Footer menu