Which Edges Matter?

Aayush Bansal, Adarsh Kowdle, Devi Parikh, Andrew Gallagher, and Charles Zitnick
2013 IEEE International Conference on Computer Vision Workshop on 3D Representation and Recognition., December, 2013.


Download
  • Adobe portable document format (pdf) (3MB)
Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract
In this paper, we investigate the ability of humans to recognize objects using different types of edges. Edges arise in images because of several different physical phenomena, such as shadow boundaries, changes in material albedo or reflectance, changes to surface normals, and occlusion boundaries. By constructing synthetic photorealistic scenes, we control which edges are visible in a rendered image to investigate the relationship between human visual recognition and that edge type. We evaluate the information conveyed by each edge type through human studies on object recognition tasks. We find that edges related to surface normals and depth are the most informative edges, while texture and shadow edges can confuse recognition tasks. This work corroborates recent advances in practical vision systems where active sensors capture depth edges (e.g. Microsoft Kinect) as well as in edge detection where progress is being made towards finding object boundaries instead of just pixel gradients. Further, we evaluate seven standard and state-of-the-art edge detectors based on the types of edges they find by comparing the detected edges with known informative edges in the synthetic scene. We suggest that this evaluation method could lead to more informed metrics for gauging developments in edge detection, without requiring any human labeling. In summary, this work shows that human proficiency at object recognition is due to surface normal and depth edges and suggests that future research should focus on explicitly modeling edge types to increase the likelihood of finding informative edges.

Notes

Text Reference
Aayush Bansal, Adarsh Kowdle, Devi Parikh, Andrew Gallagher, and Charles Zitnick, "Which Edges Matter?," 2013 IEEE International Conference on Computer Vision Workshop on 3D Representation and Recognition., December, 2013.

BibTeX Reference
@inproceedings{Bansal_2013_7504,
   author = "Aayush Bansal and Adarsh Kowdle and Devi Parikh and Andrew Gallagher and Charles Zitnick",
   title = "Which Edges Matter?",
   booktitle = "2013 IEEE International Conference on Computer Vision Workshop on 3D Representation and Recognition.",
   month = "December",
   year = "2013",
}