Synthetic Data for Object Detection: Improving Robustness and Revealing Vulnerabilities - Robotics Institute Carnegie Mellon University
Loading Events

MSR Thesis Presentation

April

23
Thu
Xiao Fang MSR Student Robotics Institute,
Carnegie Mellon University
Thursday, April 23
3:00 pm to 4:00 pm
Newell-Simon Hall 3305
Synthetic Data for Object Detection: Improving Robustness and Revealing Vulnerabilities
Abstract:
Synthetic data has emerged as a promising solution to the growing challenges of data acquisition in object detection. Modern detectors rely heavily on large-scale annotated datasets, yet collecting real-world data with high-quality labels is often costly, labor-intensive, and impractical in diverse or rare scenarios. By enabling controllable generation with automatic annotations, synthetic data provides a scalable alternative. In this thesis, we investigate its two complementary roles: improving robustness under domain shifts and revealing fundamental vulnerabilities of object detection models.

First, we propose a synthetic data generation framework based on diffusion models to bridge distribution gaps between source and target domains in aerial imagery. By synthesizing high-quality images and corresponding annotations through cross-attention-guided labeling and multi-stage knowledge transfer, our approach significantly improves detection robustness in unseen environments, outperforming supervised learning on source domain data, weakly supervised and unsupervised domain adaptation methods, open-set object detectors, and vision large language models.
Second, we explore the adversarial potential of synthetic data via a controllable image-editing framework for realistic camouflage attacks. By formulating camouflaged adversarial example generation as a conditional image-editing problem, we design image-level and scene-level strategies that produce stealthy, physically plausible camouflages while effectively degrading detector performance. Extensive experiments demonstrate strong attack effectiveness, improved human-perceived stealthiness, and transferability to black-box models and to the physical world.
Together, these complementary perspectives highlight the dual utility of synthetic data for object detection: as a powerful tool for improving robustness under domain shifts and as a principled lens for uncovering model vulnerabilities.
Committee:
Prof. Fernando De la Torre (Advisor)
Prof. Deva Kannan Ramanan
Jianjin Xu