GSIR: Generalizable 3D Shape Interpretation and Reconstruction - Robotics Institute Carnegie Mellon University

GSIR: Generalizable 3D Shape Interpretation and Reconstruction

Conference Paper, Proceedings of (ECCV) European Conference on Computer Vision, pp. 498 - 514, August, 2020

Abstract

3D shape interpretation and reconstruction are closely related to each other but have long been studied separately and often end up with priors that are highly biased towards the training classes. In this paper, we present an algorithm, Generalizable 3D Shape Interpretation and Reconstruction (GSIR), designed to jointly learn these two tasks to capture generic, class-agnostic shape priors for a better understanding of 3D geometry. We propose to recover 3D shape structures as cuboids from partial reconstruction and use the predicted structures to further guide full 3D reconstruction. The unified framework is trained simultaneously offline to learn a generic notion and can be fine-tuned online for specific objects without any annotations. Extensive experiments on both synthetic and real data demonstrate that introducing 3D shape interpretation improves the performance of single image 3D reconstruction and vice versa, achieving the state-of-the-art performance on both tasks for objects in both seen and unseen categories.

BibTeX

@conference{Wang-2020-126837,
author = {Jianren Wang and Zhaoyuan Fang},
title = {GSIR: Generalizable 3D Shape Interpretation and Reconstruction},
booktitle = {Proceedings of (ECCV) European Conference on Computer Vision},
year = {2020},
month = {August},
pages = {498 - 514},
}