Robust Full-Motion Recovery of Head by Dynamic Templates and Re-registration Techniques

Jing Xiao, Tsuyoshi Moriyama, Takeo Kanade, and Jeffrey Cohn
International Journal of Imaging Systems and Technology, Vol. 13, September, 2003, pp. 85 - 94.


Download
  • Adobe portable document format (pdf) (852KB)
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract
This paper presents a method to recover the full-motion (3 rotations and 3 translations) of the head from an input video using a cylindrical head model. Given an initial reference template of the head image and the corresponding head pose, the head model is created and full head motion is recovered automatically. The robustness of the approach is achieved by a combination of three techniques. First, we use the iteratively re-weighted least squares (IRLS) technique in conjunction with the image gradient to accommodate non-rigid motion and occlusion. Second, while tracking, the templates are dynamically updated to diminish the effects of self-occlusion and gradual lighting changes and to maintain accurate tracking even when the face moves out of view of the camera. Third, to minimize error accumulation inherent in the use of dynamic templates, we re-register images to a reference template whenever head pose is close to that in the template. The performance of the method, which runs in real time, was evaluated in three separate experiments using image sequences (both synthetic and real) for which ground truth head motion was known. The real sequences included pitch and yaw as large as 40?and 75? respectively. The average recovery accuracy of the 3D rotations was about 3? In a further test, the method was used as part of a facial expression analysis system intended for use with spontaneous facial behavior in which moderate head motion is common. Image data consisted of 1-minute of video from each of 10 subjects while engaged in a 2-person interview. The method successfully stabilized face and eye images allowing for 98% accuracy in automatic blink recognition.

Notes
Associated Center(s) / Consortia: Vision and Autonomous Systems Center
Associated Lab(s) / Group(s): Face Group and Component Analysis
Associated Project(s): Facial Expression Analysis
Number of pages: 18

Text Reference
Jing Xiao, Tsuyoshi Moriyama, Takeo Kanade, and Jeffrey Cohn, "Robust Full-Motion Recovery of Head by Dynamic Templates and Re-registration Techniques," International Journal of Imaging Systems and Technology, Vol. 13, September, 2003, pp. 85 - 94.

BibTeX Reference
@article{Xiao_2003_4504,
   author = "Jing Xiao and Tsuyoshi Moriyama and Takeo Kanade and Jeffrey Cohn",
   title = "Robust Full-Motion Recovery of Head by Dynamic Templates and Re-registration Techniques",
   journal = "International Journal of Imaging Systems and Technology",
   pages = "85 - 94",
   month = "September",
   year = "2003",
   volume = "13",
}