/Cut Paste and Learn: Surprisingly Easy Synthesis for Instance Detection

Cut Paste and Learn: Surprisingly Easy Synthesis for Instance Detection

Debidatta Dwibedi, Ishan Misra and Martial Hebert
Conference Paper, International Conference on Computer Vision, October, 2017

Download Publication (PDF)

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author’s copyright. These works may not be reposted without the explicit permission of the copyright holder.


A major impediment in rapidly deploying object detection models for instance detection is the lack of large annotated datasets. For example, finding a large labeled dataset containing instances in a particular kitchen is unlikely. Each new environment with new instances requires expensive data collection and annotation. In this paper, we propose a simple approach to generate large annotated instance datasets with minimal effort. Our key insight is that ensuring only patch-level realism provides enough training signal for current object detector models. We automatically `cut’ object instances and `paste’ them on random backgrounds. A naive way to do this results in pixel artifacts which result in poor performance for trained models. We show how to make detectors ignore these artifacts during training and generate data that gives competitive performance on real data. Our method outperforms existing synthesis approaches and when combined with real images improves relative performance by more than 21%on benchmark datasets. In a cross-domain setting, our synthetic data combined with just 10% real data outperforms models trained on all real data.

Associated Lab - Vision and Mobile Robotics Lab, Associated Center - Vision and Autonomous Systems Center (VASC)

BibTeX Reference
author = {Debidatta Dwibedi and Ishan Misra and Martial Hebert},
title = {Cut Paste and Learn: Surprisingly Easy Synthesis for Instance Detection},
booktitle = {International Conference on Computer Vision},
year = {2017},
month = {October},
keywords = {computer vision, object detection, image synthesis, instance detection},