Modeling Humans as Reinforcement Learners: How to Predict Human Behavior in Multi-Stage Games

Lee, Ritchie; Wolpert, David H.; Backhaus, Scott; Bent, Russell; Bono, James; Tracey, Brendan

This paper introduces a novel framework for modeling interacting humans in a multi-stage game environment by combining concepts from game theory and reinforcement learning. The proposed model has the following desirable characteristics: (1) Bounded rational players, (2) strategic (i.e., players account for one anothers reward functions), and (3) is computationally feasible even on moderately large real-world systems. To do this we extend level-K reasoning to policy space to, for the first time, be able to handle multiple time steps. This allows us to decompose the problem into a series of smaller ones where we can apply standard reinforcement learning algorithms. We investigate these ideas in a cyber-battle scenario over a smart power grid and discuss the relationship between the behavior predicted by our model and what one might expect of real human defenders and attackers.

Document ID

20120004027

Acquisition Source

Ames Research Center

Document Type

Conference Paper

Authors

Date Acquired

August 25, 2013

Publication Date

December 16, 2011

Subject Category

Report/Patent Number

Meeting Information

Meeting: Twenty-Fifth Annual Conference on Neural Information Processing Systems (NIPS2011)

Location: Granada

Country: Spain

Start Date: December 11, 2011

End Date: December 17, 2011

Sponsors: Neural Information Processing Systems Foundation

Funding Number(s)

Distribution Limits

Public

Public Use Permitted.

Available Downloads

Name

Type

20120004027.pdf

STI

No Preview Available

NTRS

NTRS - NASA Technical Reports Server

Available Downloads

Related Records