Name: Grounded Task Axes: Zero-Shot Semantic Skill Generalization via Task-Axis Controllers and Visual Foundation Models
Start: 2025-11-20T14:00:00-05:00
End: 2025-11-20T15:00:00-05:00
Location: Newell-Simon Hall 3305

Muhammet Yunus Seker PhD Student Robotics Institute,
Carnegie Mellon University

Thursday, November 20
2:00 pm to 3:00 pm
Newell-Simon Hall 3305

Grounded Task Axes: Zero-Shot Semantic Skill Generalization via Task-Axis Controllers and Visual Foundation Models

Abstract:
Transferring skills between different objects remains one of the core challenges of open-world robot manipulation. Generalization needs to take into account the high-level structural differences between distinct objects while still maintaining similar low-level interaction control. In this paper, we propose an example-based zero-shot approach to skill transfer. Rather than treating skills as atomic, we decompose skills into a prioritized list of grounded task-axis (GTA) controllers. Each GTAC defines an adaptable controller, such as a position or force controller, along an axis. Importantly, the GTACs are grounded in object key points and axes, e.g., the relative position of a screw head or the axis of its shaft. Zero-shot transfer is thus achieved by finding semantically similar grounding features on novel target objects. We achieve this example-based grounding of the skills through the use of foundation models, such as SD-DINO, that can detect semantically similar keypoints of objects. We evaluate our framework on real-robot experiments, including screwing, pouring, and spatula scraping tasks, and demonstrate robust and versatile controller transfer for each.

Committee:
Prof. Oliver Kroemer

Prof. Katerina Fragkiadaki

Prof. Zeynep Temel
Mark Lee

PhD Speaking Qualifier

November

Event Navigation

PhD Speaking Qualifier

November

Share This Event!

Event Navigation