Home/Efficient Visual Event Detection using Volumetric Features

Efficient Visual Event Detection using Volumetric Features

Yan Ke, Rahul Sukthankar and Martial Hebert
Conference Paper, Carnegie Mellon University, International Conference on Computer Vision, Vol. 1, pp. 166 - 173, October, 2005

Download Publication (PDF)

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author’s copyright. These works may not be reposted without the explicit permission of the copyright holder.


This paper studies the use of volumetric features as an alternative to popular local descriptor approaches for event detection in video sequences. Motivated by the recent success of similar ideas in object detection on static images, we generalize the notion of 2D box features to 3D spatio-temporal volumetric features. This general framework enables us to do real-time video analysis. We construct a real-time event detector for each action of interest by learning a cascade of filters based on volumetric features that efficiently scans video sequences in space and time. This event detector recognizes actions that are traditionally problematic for interest point methods — such as smooth motions where insufficient space-time interest points are available. Our experiments demonstrate that the technique accurately detects actions on real-world sequences and is robust to changes in viewpoint, scale and action speed. We also adapt our technique to the related task of human action classification and confirm that it achieves performance comparable to a current interest point based human activity recognizer on a standard database of human activities.

BibTeX Reference
title = {Efficient Visual Event Detection using Volumetric Features},
author = {Yan Ke and Rahul Sukthankar and Martial Hebert},
booktitle = {International Conference on Computer Vision},
school = {Robotics Institute , Carnegie Mellon University},
month = {October},
year = {2005},
volume = {1},
pages = {166 - 173},
address = {Pittsburgh, PA},