Enriching Visual Knowledge Bases via Object Discovery and Segmentation

Xinlei Chen, Abhinav Shrivastava, and Abhinav Gupta
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), March, 2014.


Download
  • Adobe portable document format (pdf) (5MB)
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract
There have been some recent efforts to build visual knowledge bases from Internet images. But most of these approaches have focused on bounding box representation of objects. In this paper, we propose to enrich these knowledge bases by automatically discovering objects and their segmentations from noisy Internet images. Specifically, our approach combines the power of generative modeling for segmentation with the effectiveness of discriminative models for detection. The key idea behind our approach is to learn and exploit top-down segmentation priors based on visual subcategories. The strong priors learned from these visual subcategories are then combined with discriminatively trained detectors and bottom up cues to produce clean object segmentations. Our experimental results indicate state-of-the-art performance on the difficult dataset introduced by Rubinstein et al. We have integrated our algorithm in NEIL for enriching its knowledge base. As of 14th April 2014, NEIL has automatically generated approximately 500K segmentations using web data.

Notes

Text Reference
Xinlei Chen, Abhinav Shrivastava, and Abhinav Gupta, "Enriching Visual Knowledge Bases via Object Discovery and Segmentation," IEEE Conference on Computer Vision and Pattern Recognition (CVPR), March, 2014.

BibTeX Reference
@inproceedings{Shrivastava_2014_7581,
   author = "Xinlei Chen and Abhinav Shrivastava and Abhinav Gupta",
   title = "Enriching Visual Knowledge Bases via Object Discovery and Segmentation",
   booktitle = "IEEE Conference on Computer Vision and Pattern Recognition (CVPR)",
   month = "March",
   year = "2014",
}