Learning to share and hide intentions using information regularization

Kleiman-Weiner, Max; Tenenbaum, Joshua B

Author(s)

Kleiman-Weiner, Max; Tenenbaum, Joshua B

DownloadPublished version (4.595Mb)

Terms of use

Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.

Metadata

Show full item record

Abstract

Learning to cooperate with friends and compete with foes is a key component of multi-agent reinforcement learning. Typically to do so, one requires access to either a model of or interaction with the other agent(s). Here we show how to learn effective strategies for cooperation and competition in an asymmetric information game with no such model or interaction. Our approach is to encourage an agent to reveal or hide their intentions using an information-theoretic regularizer. We consider both the mutual information between goal and action given state, as well as the mutual information between goal and state. We show how to optimize these regularizers in a way that is easy to integrate with policy gradient reinforcement learning. Finally, we demonstrate that cooperative (competitive) policies learned with our approach lead to more (less) reward for a second agent in two simple asymmetric information games.

Date issued

2018-12

URI

https://hdl.handle.net/1721.1/126610

Department

Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences

Journal

32nd Conference on Neural Information Processing Systems (NeurIPS 2018)

Publisher

Curran Associates

Citation

Version: Final published version

Collections

MIT Open Access Articles