Events from January 20, 2017 – May 19, 2026 – Robotics Institute Carnegie Mellon University
2026-05-19T00:00:00-04:00
  • PhD Thesis Defense
    PhD Student
    Robotics Institute,
    Carnegie Mellon University

    3D Video Models through Point Tracking, Reconstructing, and Forecasting

    NSH 4305

    Abstract: This thesis advances 3D video understanding by bridging reconstruction and dynamics forecasting from monocular video, with applications in robotics, autonomy, and immersive environments. We introduce a novel pipeline that translates 2D video into 4D scenes by combining object-centric tracking, learned 2D view synthesis priors, and Gaussian splatting, enabling accurate geometry and motion recovery even [...]

    PhD Thesis Defense
    PhD Student
    Robotics Institute,
    Carnegie Mellon University

    Unified 3D Perception and Generative Control for Generalist Robots

    Newell-Simon Hall 4305

    Abstract: To build robot generalists, we need models that can operate across diverse tasks, scenes, and embodiments. While recent efforts scale data and model capacity and incorporate expressive generative objectives, most still rely on 2D inputs to predict inherently 3D actions—introducing a mismatch between perception and control. In my thesis, I explore how unifying 3D [...]

    Faculty Events
    Assistant Professor
    Robotics Institute,
    Carnegie Mellon University

    Towards Open World Robot Safety

    Newell-Simon Hall 4305

    Abstract:  Robot safety is a nuanced concept. We commonly equate safety with collision-avoidance, but in complex, real-world environments (i.e., the "open world'') it can be much more: for example, a mobile manipulator should understand when it is not confident about a requested task, that areas roped off by caution tape should never be breached, and that [...]

    PhD Thesis Defense
    PhD Student
    Robotics Institute,
    Carnegie Mellon University

    Reachable Sets for Control and Planning: from Reactive Safety to Contact-Rich Manipulation

    GHC 4215

    Abstract: Robots are increasingly deployed in settings where safety, performance, or both are mission-critical—from agile aerial vehicles avoiding collisions at high speed to manipulators executing intricate, contact-rich tasks. In my thesis, I present a unifying approach to these seemingly disparate challenges through the lens of reachable sets, a versatile but underutilized computational primitive in robotics. In [...]

    MSR Thesis Presentation
    MSR Student
    Robotics Institute,
    Carnegie Mellon University

    Toward Personalized Assistive Systems: Leveraging Large Language Models for Prediction and Intervention

    GHC 6501

    Abstract: Many older adults, particularly those with Mild Cognitive Impairments (MCI) struggle with complex, sequential tasks such as meal preparation. In this thesis, we present a framework for personalized sequence prediction and assistance detection during meal preparation to support older adults, particularly those with Mild Cognitive Impairments (MCI). By leveraging the reasoning capabilities of large language [...]

    MSR Thesis Presentation
    MSR Student / Research Assistant
    Robotics Institute,
    Carnegie Mellon University

    Towards Generalizable Robotic Policies via Data-Driven Learning at Scale

    Newell-Simon Hall 4305

    Abstract: To enable robots to operate seamlessly in complex, real-world environments, they must master fine-grained manipulation skills and exhibit robust, adaptive behavior across diverse environments. This thesis explores a data-driven approach to learning generalizable and reactive manipulation policies by leveraging efficient data generation pipelines and expressive neural models. We first introduce BiDex, a low-cost teleoperation [...]

    PhD Thesis Defense
    PhD Student
    Robotics Institute,
    Carnegie Mellon University

    Dataset-Driven and Generative Approaches to Domain Generalization in Human-Centric Vision

    GHC 6121

    Abstract: Human-centered computer vision technology relies heavily on large, diverse datasets, yet even the largest collections cannot fully capture the variability of human appearance, motion, and viewpoint. At the same time, collecting data from human subjects is time-consuming, labor-intensive, and raises privacy concerns. To overcome these challenges while maintaining efficiency, researchers increasingly turn to two [...]

    PhD Thesis Defense
    PhD Student
    Robotics Institute,
    Carnegie Mellon University

    Influence-Aware Safety for Human-Robot Interaction

    Newell-Simon Hall 3305

    Abstract: In recent years, we have seen how influential (and potentially harmful) algorithms can be in our lives through recommender systems and language models; sometimes creating polarization and conspiracies that lead to unsafe behavior. Now that robots are also growing more common in the real world, we must be very careful to ensure that AI-driven [...]

    PhD Thesis Proposal
    PhD Student
    Robotics Institute,
    Carnegie Mellon University

    Scalable Sim-to-Real Learning for General-Purpose Humanoid Skills

    GHC 4405

    Abstract: Humanoids represent the most versatile robotic platform, capable of walking, manipulating, and collaborating with people in human-centered environments. Yet, despite recent advances, building humanoids that can operate reliably in the real world remains a fundamental challenge. Progress has been hindered by difficulties in whole-body control, robust perceptive reasoning, and bridging the sim-to-real gap. This [...]

  • MSR Thesis Presentation
    PhD Student
    Robotics Institute,
    Carnegie Mellon University

    Object-Centric Grounding for Deployable and Interactive Vision-Language Navigation Agents

    Newell-Simon Hall 4305

    Abstract: Robots that operate in human-centric environments must integrate perception, reasoning, and action across multiple modalities to complete tasks according to user instructions. For these robots, being able to navigate according to a natural language instruction about the environment is an important capability, which requires 3D spatial reasoning, semantic scene understanding, and the ability to [...]