Learning to Recognize Daily Actions using Gaze

Fathi, Alireza; Li, Yin; Rehg, James M.

Title:

Learning to Recognize Daily Actions using Gaze

Files

ECCV12.pdf (5.07 MB)

Author(s)

Fathi, Alireza
Li, Yin
Rehg, James M.

Authors

Person

Rehg, James M.

Associated Organization(s)

Organizational Unit

Institute for Robotics and Intelligent Machines (IRIM)

Collections

Research Publications

Permanent Link

http://hdl.handle.net/1853/46311

Abstract

We present a probabilistic generative model for simultaneously recognizing daily actions and predicting gaze locations in videos recorded from an egocentric camera. We focus on activities requiring eye-hand coordination and model the spatio-temporal relationship between the gaze point, the scene objects, and the action label. Our model captures the fact that the distribution of both visual features and object occurrences in the vicinity of the gaze point is correlated with the verb-object pair describing the action. It explicitly incorporates known properties of gaze behavior from the psychology literature, such as the temporal delay between fixation and manipulation events. We present an inference method that can predict the best sequence of gaze locations and the associated action label from an input sequence of images. We demonstrate improvements in action recognition rates and gaze prediction accuracy relative to state-of-the-art methods, on two new datasets that contain egocentric videos of daily activities and gaze.

Date Issued

2012-10

Resource Type

Text

Resource Subtype

Article
Proceedings

Full item page

Title:

Learning to Recognize Daily Actions using Gaze

Files

Author(s)

Authors

Advisor(s)

Advisor(s)

Editor(s)

Associated Organization(s)

Series

Collections

Supplementary to

Permanent Link

Abstract

Sponsor

Date Issued

Extent

Resource Type

Resource Subtype

Rights Statement

Rights URI

Georgia Tech Library

Title: Learning to Recognize Daily Actions using Gaze

Files

Author(s)

Authors

Advisor(s)

Advisor(s)

Editor(s)

Associated Organization(s)

Series

Collections

Supplementary to

Permanent Link

Abstract

Sponsor

Date Issued

Extent

Resource Type

Resource Subtype

Rights Statement

Rights URI

Title:

Learning to Recognize Daily Actions using Gaze