Name: Synthetic Data for Object Detection: Improving Robustness and Revealing Vulnerabilities
Start: 2026-04-23T15:00:00-04:00
End: 2026-04-23T16:00:00-04:00
Location: Newell-Simon Hall 3305

Xiao Fang MSR Student Robotics Institute,
Carnegie Mellon University

Thursday, April 23
3:00 pm to 4:00 pm
Newell-Simon Hall 3305

Synthetic Data for Object Detection: Improving Robustness and Revealing Vulnerabilities

Abstract:

Synthetic data has emerged as a promising solution to the growing challenges of data acquisition in object detection. Modern detectors rely heavily on large-scale annotated datasets, yet collecting real-world data with high-quality labels is often costly, labor-intensive, and impractical in diverse or rare scenarios. By enabling controllable generation with automatic annotations, synthetic data provides a scalable alternative. In this thesis, we investigate its two complementary roles: improving robustness under domain shifts and revealing fundamental vulnerabilities of object detection models.

First, we propose a synthetic data generation framework based on diffusion models to bridge distribution gaps between source and target domains in aerial imagery. By synthesizing high-quality images and corresponding annotations through cross-attention-guided labeling and multi-stage knowledge transfer, our approach significantly improves detection robustness in unseen environments, outperforming supervised learning on source domain data, weakly supervised and unsupervised domain adaptation methods, open-set object detectors, and vision large language models.

Second, we explore the adversarial potential of synthetic data via a controllable image-editing framework for realistic camouflage attacks. By formulating camouflaged adversarial example generation as a conditional image-editing problem, we design image-level and scene-level strategies that produce stealthy, physically plausible camouflages while effectively degrading detector performance. Extensive experiments demonstrate strong attack effectiveness, improved human-perceived stealthiness, and transferability to black-box models and to the physical world.

Together, these complementary perspectives highlight the dual utility of synthetic data for object detection: as a powerful tool for improving robustness under domain shifts and as a principled lens for uncovering model vulnerabilities.

Committee:

Prof. Fernando De la Torre (Advisor)

Prof. Deva Kannan Ramanan

Jianjin Xu

MSR Thesis Presentation

April

Event Navigation

MSR Thesis Presentation

April

Share This Event!

Event Navigation