Carnegie Mellon University
Face View Synthesis Using A Single Image

Jiang Ni
doctoral dissertation, tech. report CMU-RI-TR-07-38, Robotics Institute, Carnegie Mellon University, November, 2007

  • Adobe portable document format (pdf) (3MB)
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Face view synthesis involves using one view of a face to artificially render another view. It is an interesting problem in computer vision and computer graphics, and can be applied in the entertainment industry for animated movies and video games. The fact that the input is only a single image, makes the problem very difficult. Previous approaches learn a linear model on pair of poses from 2D training data and then predict the unknown pose in the test example. Such 2D approaches are much more practical than approaches requiring 3D data and more computationally efficient. However they perform inadequately when dealing with large angles between poses. In this thesis, we seek to improve performance through better choices in probabilistic modeling. As a first step, we have implemented a statistical model combining distance in feature space (DIFS) and distance from feature space (DFFS) [27] for such pair of poses. Such a representation leads to better performance. As a second step, we model the relationship between the poses using a Bayesian network. This representation takes advantage of the sparse statistical structure of faces. In particular, we have observed that a given pixel is often statistically correlated with only a small number of other pixel variables. The Bayesian network provides a concise representation for this behavior reducing the susceptibility to over-fitting. Compared with the linear method, the Bayesian network more accurately predicts small and localized features. i

Number of pages: 90

Text Reference
Jiang Ni, "Face View Synthesis Using A Single Image," doctoral dissertation, tech. report CMU-RI-TR-07-38, Robotics Institute, Carnegie Mellon University, November, 2007

BibTeX Reference
   author = "Jiang Ni",
   title = "Face View Synthesis Using A Single Image",
   booktitle = "",
   school = "Robotics Institute, Carnegie Mellon University",
   month = "November",
   year = "2007",
   number= "CMU-RI-TR-07-38",
   address= "Pittsburgh, PA",