/Motion Words for Videos

Motion Words for Videos

Ekaterina H. Taralova, Fernando De la Torre Frade and Martial Hebert
Conference Paper, European Conference on Computer Vision, January, 2014

Download Publication (PDF)

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author’s copyright. These works may not be reposted without the explicit permission of the copyright holder.


In the task of activity recognition in videos, computing the video representation often involves pooling feature vectors over spatially local neighborhoods. The pooling is done over the entire video, over coarse spatio-temporal pyramids, or over pre-determined rigid cuboids. Similarly to pooling image features over superpixels in images, it is nat- ural to consider pooling spatio-temporal features over video segments, e.g., supervoxels. However, since the number of segments is variable, this produces a video representation of variable size. We propose Motion Words – a new, fixed size video representation, where we pool features over supervoxels. To segment the video into supervoxels, we explore two recent video segmentation algorithms. The proposed representation en- ables localization of common regions across videos in both space and time. Importantly, since the video segments are meaningful regions, we can interpret the proposed features and obtain a better understanding of why two videos are similar. Evaluation on classification and retrieval tasks on two datasets further shows that Motion Words achieves state- of-the-art performance.

BibTeX Reference
author = {Ekaterina H. Taralova and Fernando De la Torre Frade and Martial Hebert},
title = {Motion Words for Videos},
booktitle = {European Conference on Computer Vision},
year = {2014},
month = {January},
keywords = {Video representations, action classification},