The Robotics Institute
Search the site
RI | Publications | Guide to the Carnegie Mellon University Multimodal Activity (CMU-MMAC) Database

Text only version of this site

Guide to the Carnegie Mellon University Multimodal Activity (CMU-MMAC) Database
F. De la Torre Frade, J.K. Hodgins, A.W. Bargteil, X. Martin Artal, J.C. Macey, A. Collado I Castells, and J. Beltran
tech. report CMU-RI-TR-08-22, Robotics Institute, Carnegie Mellon University, April, 2008.

Jump to: Download | Abstract | Text Reference | BibTeX Reference

Download [Help]

Adobe portable document format (pdf) [7543 KB]

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract

This document summarizes the technology, procedures, and database organization of the CMU Multi-Modal Activity Database (CMU-MMAC). The CMU-MMAC database contains multimodal measures of the human activity of subjects performing the tasks involved in cooking and food preparation. The CMU-MMAC database was collected in Carnegie Mellon University’s Motion Capture Lab. A kitchen was built and to date five subjects have been recorded cooking five different recipes: brownies,pizza, sandwich, salad and scrambled eggs. The following modalities were recorded: • Video: (1) Three high spatial resolution (1024 × 768) color video cameras at low temporal resolution (30 Hertz). (2) Two low spatial resolution (640 × 480) color video cameras at high temporal resolution (60 Hertz). (3) One wearable low spatial resolution (640×480) camera at low temporal resolution (12 Hertz). • Audio: (1) Five balanced microphones. (2) Wearable watch. • Motion capture: A Vicon motion capture system with 12 infrared MX-40 cameras. Each camera records images of 4 megapixel resolution at 120 Hertz. • Five 3-axis accelerometers and gyroscopes. Several computers were used for recording the various modalities. The computers were synchronized using the Network Time Protocol (NTP).

Text Reference

F. De la Torre Frade, J.K. Hodgins, A.W. Bargteil, X. Martin Artal, J.C. Macey, A. Collado I Castells, and J. Beltran, Guide to the Carnegie Mellon University Multimodal Activity (CMU-MMAC) Database, tech. report CMU-RI-TR-08-22, Robotics Institute, Carnegie Mellon University, April, 2008.

BibTeX Reference

@techreport{De la Torre Frade_2008_6085,
   author = "Fernando De la Torre Frade and Jessica K Hodgins and Adam W Bargteil and Xavier Martin Artal and Justin C Macey and Alexandre Collado I Castells and Josep Beltran",
   title = "Guide to the Carnegie Mellon University Multimodal Activity (CMU-MMAC) Database",
   institution = "Robotics Institute, Carnegie Mellon University",
   month = "April",
   year = "2008",
   number = "CMU-RI-TR-08-22",
   address = "Pittsburgh, PA"
}


The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University.
For updates and comments, please see these instructions.
This page maintained by robotwebmaster@ri.cmu.edu