Transforming and Projecting Images into Class-conditional Generative Networks - Robotics Institute Carnegie Mellon University

Transforming and Projecting Images into Class-conditional Generative Networks

Minyoung Huh, Richard Zhang, Jun-Yan Zhu, Sylvain Paris, and Aaron Hertzmann
Conference Paper, Proceedings of (ECCV) European Conference on Computer Vision, pp. 17 - 34, August, 2020

Abstract

We present a method for projecting an input image into the space of a class-conditional generative neural network. We propose a method that optimizes for transformation to counteract the model biases in generative neural networks. Specifically, we demonstrate that one can solve for image translation, scale, and global color transformation, during the projection optimization to address the object-center bias and color bias of a Generative Adversarial Network. This projection process poses a difficult optimization problem, and purely gradient-based optimizations fail to find good solutions. We describe a hybrid optimization strategy that finds good projections by estimating transformations and class parameters. We show the effectiveness of our method on real images and further demonstrate how the corresponding projections lead to better editability of these images. The project page and the code is available at https://minyoungg.github.io/GAN-Transform-and-Project/.

Notes
We thank David Bau, Phillip Isola, Lucy Chai, and Erik Härkönen for discussions, and David Bau for encoder training code.

BibTeX

@conference{Huh-2020-125671,
author = {Minyoung Huh and Richard Zhang and Jun-Yan Zhu and Sylvain Paris and Aaron Hertzmann},
title = {Transforming and Projecting Images into Class-conditional Generative Networks},
booktitle = {Proceedings of (ECCV) European Conference on Computer Vision},
year = {2020},
month = {August},
pages = {17 - 34},
}