RI PhD Thesis Defense - Brian Yang - Robotics Institute Carnegie Mellon University
Loading Events

PhD Thesis Defense

March

30
Mon
Brian Yang PhD Student Robotics Institute,
Carnegie Mellon University
Monday, March 30
10:00 am to 12:00 pm
Newell-Simon Hall 3305
RI PhD Thesis Defense – Brian Yang
Date: 30th March 2026
Time: 10:00 a.m. (ET)
Location: NSH 3305
Zoom
Link
Type: Ph.D. Thesis Defense
Who: Brian Yang
Title
A Language-Guided Generative Data Engine for Autonomous Driving
 
Abstract:
Autonomous driving systems struggle with the long tail of rare and safety-critical scenarios that are sparsely represented in real-world data. While recent learning-based planners perform well on common situations, their reliability degrades in edge cases, and existing approaches based on simulation or world modeling remain costly and difficult to scale.

This thesis proposes a complementary paradigm for scaling driving policies by leveraging language supervision and generative models as a unified data engine. Rather than relying solely on logged demonstrations, we use vision-language models to infer structured constraints and reward functions from high-level specifications of desired behavior. These programmatic rewards are then used to guide trajectory diffusion models, enabling the generation of diverse, physically consistent, and behaviorally targeted driving trajectories.

We first show that combining pretrained diffusion policies with reward guidance enables zero-shot instruction following, without requiring language-annotated driving data. We then demonstrate how guided diffusion can be used as a scalable offline data generator, producing counterfactual trajectories that expand coverage of rare and safety-critical scenarios. Policies distilled from this synthetic data outperform those trained purely on logged datasets and no longer require test-time guidance. Finally, we close the loop by learning reward programs automatically via vision-language feedback, enabling iterative improvement of both the data generator and the policy.

Together, this work presents a scalable framework in which language specifies what should happen, and generative models produce how it happens, yielding a practical path toward robust driving policies without exhaustive real-world data collection or high-fidelity simulation.


Thesis Committee Members:
Katerina Fragkiadaki (co-chair)
Jeff Schneider (co-chair)
Maxim Likhachev
Philipp Krähenbühl (UT Austin)