Conferences >2008 IEEE Workshop on Motion ...

Spatial-Temporal correlatons for unsupervised action classification

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Spatial-temporal local motion features have shown promising results in complex human action classification. Most of the previous works [6],[16],[21] treat these spatial-t...Show More

Metadata

Abstract:

Spatial-temporal local motion features have shown promising results in complex human action classification. Most of the previous works [6],[16],[21] treat these spatial-temporal features as a bag of video words, omitting any long range, global information in either the spatial or temporal domain. Other ways of learning temporal signature of motion tend to impose a fixed trajectory of the features or parts of human body returned by tracking algorithms. This leaves little flexibility for the algorithm to learn the optimal temporal pattern describing these motions. In this paper, we propose the usage of spatial-temporal correlograms to encode flexible long range temporal information into the spatial-temporal motion features. This results into a much richer description of human actions. We then apply an unsupervised generative model to learn different classes of human actions from these ST-correlograms. KTH dataset, one of the most challenging and popular human action dataset, is used for experimental evaluation. Our algorithm achieves the highest classification accuracy reported for this dataset under an unsupervised learning scheme.

Published in: 2008 IEEE Workshop on Motion and video Computing

Date of Conference: 08-09 January 2008

Date Added to IEEE Xplore: 17 June 2008

ISBN Information:

DOI: 10.1109/WMVC.2008.4544068

Conference Location: Copper Mountain, CO, USA

Contents

1. Introduction

Accurate human action classification is a fundamental problem in computer vision as well as an active field of research in recent years. However, it still remains a challenging task for computers to achieve robust action recognition due to cluttered background, camera motion, occlusion and geometric and photometric variances of the foreground per-sonts). A good example is shown in Fig. I. In this dataset, many different subjects perform the same action (e.g. walking, or hand waving) against different background (e.g. in-door, outdoor), recorded by a moving camera (e.g. zoom in and out).

References is not available for this document.

MIT Libraries

MIT Libraries

Spatial-Temporal correlatons for unsupervised action classification

Abstract:

Metadata

Abstract:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Spatial-Temporal correlatons for unsupervised action classification

Alerts

Abstract:

Metadata

Abstract:

1. Introduction

References