The Robotics Institute
Search the site
RI | Publications | Robust Full-Motion Recovery of Head by Dynamic Templates and Re-registration Techniques

Text only version of this site

Robust Full-Motion Recovery of Head by Dynamic Templates and Re-registration Techniques
J. Xiao, T. Moriyama, T. Kanade, and J. Cohn
International Journal of Imaging Systems and Technology, Vol. 13, September, 2003, pp. 85 - 94.

Jump to: Download | Abstract | Notes | Text Reference | BibTeX Reference

Download [Help]

Adobe portable document format (pdf) [851 KB]

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract

This paper presents a method to recover the full-motion (3 rotations and 3 translations) of the head from an input video using a cylindrical head model. Given an initial reference template of the head image and the corresponding head pose, the head model is created and full head motion is recovered automatically. The robustness of the approach is achieved by a combination of three techniques. First, we use the iteratively re-weighted least squares (IRLS) technique in conjunction with the image gradient to accommodate non-rigid motion and occlusion. Second, while tracking, the templates are dynamically updated to diminish the effects of self-occlusion and gradual lighting changes and to maintain accurate tracking even when the face moves out of view of the camera. Third, to minimize error accumulation inherent in the use of dynamic templates, we re-register images to a reference template whenever head pose is close to that in the template. The performance of the method, which runs in real time, was evaluated in three separate experiments using image sequences (both synthetic and real) for which ground truth head motion was known. The real sequences included pitch and yaw as large as 40° and 75°, respectively. The average recovery accuracy of the 3D rotations was about 3°. In a further test, the method was used as part of a facial expression analysis system intended for use with spontaneous facial behavior in which moderate head motion is common. Image data consisted of 1-minute of video from each of 10 subjects while engaged in a 2-person interview. The method successfully stabilized face and eye images allowing for 98% accuracy in automatic blink recognition.

Notes

Associated center: VASC
Associated labs/groups: Face Group and Human Sensing
Associated project: Facial Expression Analysis

Number of pages: 18

Text Reference

J. Xiao, T. Moriyama, T. Kanade, and J. Cohn, "Robust Full-Motion Recovery of Head by Dynamic Templates and Re-registration Techniques," International Journal of Imaging Systems and Technology, Vol. 13, September, 2003, pp. 85 - 94.

BibTeX Reference

@article{Xiao_2003_4504,
   author = "Jing Xiao and Tsuyoshi Moriyama and Takeo Kanade and Jeffrey Cohn",
   title = "Robust Full-Motion Recovery of Head by Dynamic Templates and Re-registration Techniques",
   journal = "International Journal of Imaging Systems and Technology",
   month = "September",
   year = "2003",
   volume = "13",
   pages = "85 - 94"
}


The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University.
For updates and comments, please see these instructions.
This page maintained by robotwebmaster@ri.cmu.edu