Robot Perception and Learning: MIT CSAIL report: Latent-Dynamic Discriminative Models for Continuous Gesture Recognition

Authors: Morency, Louis-Philippe; Quattoni, Ariadna; Darrell, Trevor
Issue Date: 7-Jan-2007

Abstract: Many problems in vision involve the prediction of a class label for each frame in an unsegmented sequence. In this paper we develop a discriminative framework for simultaneous sequence segmentation and labeling which can capture both intrinsic and extrinsic class dynamics. Our approach incorporates hidden state variables which model the sub-structure of a class sequence and learn the dynamics between class labels. Each class label has a disjoint set of associated hidden states, which enables efficient training and inference in our model. We evaluated our method on the task of recognizing human gestures from unsegmented video streams and performed experiments on three different datasets of head and eye gestures. Our results demonstrate that our model for visual gesture recognition outperform models based on Support Vector Machines, Hidden Markov Models, and Conditional Random Fields.

PDF, PS

Robot Perception and Learning

Thursday, January 11, 2007

MIT CSAIL report: Latent-Dynamic Discriminative Models for Continuous Gesture Recognition

No comments:

Post a Comment