Object-Centric Goal Prediction Towards Precise, Generalizable Placement - Robotics Institute Carnegie Mellon University
Loading Events

MSR Thesis Defense

July

14
Mon
Eric Cai MSR Student Robotics Institute,
Carnegie Mellon University
Monday, July 14
10:30 am to 11:30 am
Newell-Simon Hall 3305
Object-Centric Goal Prediction Towards Precise, Generalizable Placement
Abstract:
Recent advances in robotic manipulation have established the effectiveness of learning from demonstration. However, while end-to-end policies excel in expressivity and flexibility for complex skill learning, they struggle to efficiently adapt to task variations in geometry and configuration. Conversely, alternative approaches relying on object-centric goal prediction offer improved generalizability, but rely on restrictive assumptions regarding object rigidity, scene complexity, and task execution.

In this thesis, we present two works with the aim of bridging this gap. In the first work – TAX3D (Non-rigid Relative Placement through 3D Dense Diffusion) – we extend the principles of object-centric reasoning methods to deformable transformations by modeling geometric relationships through dense diffusion. In the second – TAX3Dv2 (Object-Centric Point Diffusion for 3D Goal Prediction) – we further generalize our method into a hierarchical goal-prediction framework for object placement. Specifically, we model global scene-level placements through a novel Dense Gaussian Mixture Model (GMM), and local object-level configurations through a disentangled diffusion objective. By decoupling placement prediction into these two stages, our method generalizes effectively to scene multi-modality while supporting high-precision and non-rigid placements. Importantly, we also show that our method can easily be incorporated into policy learning with dramatic implications for performance and sample efficiency, circumventing the need for task-specific primitives typically present in object-centric goal prediction methods. We validate our approach across a suite of challenging tasks in simulation and the real world, demonstrating strong performance in both rigid and non-rigid settings.

Committee:
Prof. David Held (Chair)
Prof. Shubham Tulsiani
Bardienus Duisterhof