Designing Deep Networks for Surface Normal Estimation

Xiaolong Wang, David F. Fouhey, and Abhinav Gupta

Conference Paper, Proceedings of (CVPR) Computer Vision and Pattern Recognition, pp. 539 - 547, June, 2015

Abstract

In the past few years, convolutional neural nets (CNN) have shown incredible promise for learning visual representations. In this paper, we use CNNs for the task of predicting surface normals from a single image. But what is the right architecture we should use? We propose to build upon the decades of hard work in 3D scene understanding, to design new CNN architecture for the task of surface normal estimation. We show by incorporating several constraints (man-made, manhattan world) and meaningful intermediate representations (room layout, edge labels) in the architecture leads to state of the art performance on surface normal estimation. We also show that our network is quite robust and show state of the art results on other datasets as well without any fine-tuning.

BibTeX

@conference{Wang-2015-113349,
author = {Xiaolong Wang and David F. Fouhey and Abhinav Gupta},
title = {Designing Deep Networks for Surface Normal Estimation},
booktitle = {Proceedings of (CVPR) Computer Vision and Pattern Recognition},
year = {2015},
month = {June},
pages = {539 - 547},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.