PhD Thesis Defense
Robust Inverse Rendering with Physics-Based Light Transport and Active Sensors
Abstract: Inverse rendering is the process of recovering the shape, materials, and lighting conditions of an environment from a set of images. Both this process as a whole and its individual components are fundamental to applications ranging from medical imaging to astronomy, and from AR/VR to embodied intelligence. In the thesis work discussed in this [...]
Communication Efficient and Differentially Private Optimization
Abstract: In modern machine learning, the abundance of data generated across diverse and distributed sources has made distributed training a central paradigm, particularly in large-scale applications such as Federated Learning. However, two key challenges arise in distributed training: ensuring communication efficiency and preserving the privacy of sensitive data used during training. This thesis addresses these [...]
Learning Generalizable Robot Skills for Dynamic and Interactive Tasks
Abstract: Recent years have seen growing interest in developing robots capable of lifelong reliable operation in human-centric environments. Despite impressive recent progress towards long-horizon tasks such as laundry folding, current efforts are predominantly focused on quasi-static tasks in structured settings. General-purpose assistive robots should be capable of performing a wider range of dynamic and dexterous [...]
Towards Robotic Convoying in Unstructured Environments
Abstract: Multi-agent robotic teaming is the only realistic solution to many large-scale autonomous operations. Conventionally, operations are modeled as a set of tasks that are largely decoupled from each other and the environment at execution time. However, this operational model fails when the successful execution of a task requires multiple agents to synchronize their actions [...]
Learning to Create 3D Content
Abstract: With the popularity of Virtual Reality (VR), Augmented Reality (AR), and other 3D applications, developing methods that let everyday users capture and create their own 3D content has become increasingly essential. However, current 3D creation pipelines often require either tedious manual effort or specialized capture setups. Additionally, resulting assets often suffer from baked-in lighting, [...]
Learning From People: Assistive Robotics and Optimization from Preferences
Abstract: Robotic algorithms rarely come perfectly pre-configured, and when choosing parameters, tradeoffs must often be made: between performance and robustness; efficiency and safety; the comfort of the user and the comfort of bystanders. While engineers can tune parameters by hand or carefully design reward functions to optimize over, this is not always a straightforward task. [...]
Advancing Multimodal Sensing and Robotic Interfaces for Chronic Care
Abstract: The healthcare system prioritizes reactive care for acute illnesses, often overlooking the ongoing needs of individuals with chronic conditions that require long-term management and personalized care. Addressing this gap through technology can empower patients to better manage their conditions, greatly enhancing quality of life and independence. Multimodal sensing, incorporating inertial, acoustic, and vision-based sensors, [...]
Vision-based Human Motion Modeling and Analysis
Abstract: Modern computer vision has achieved remarkable success in tasks such as detecting, segmenting, and estimating human pose in images and videos—often reaching or even surpassing human-level performance. However, significant challenges remain in predicting and analyzing future human motion. This thesis explores how vision-based methods can improve the fidelity and accuracy of human motion modeling [...]
Building richer 3D maps: utilizing a hybrid geometry representation and auxiliary inputs in neural surface reconstruction
Abstract: As robots are increasingly deployed in real-world environments, their perception systems face growing demands. Tasks such as tracking and manipulation require maps with both high spatial fidelity and detailed object-level organization, which must be delivered faster to support timely decision-making and control. Concurrently, advances in vision foundation models allow us to build powerful prediction [...]
Lowering Barriers in Human-Robot Communication
Abstract: For robots to collaborate naturally in homes, they must interpret diverse forms of human expression - visual gestures, natural language instructions, environmental context - and translate them into actions. Existing robot policies typically rely on structured language goals and static visual observations, which restricts both the sensory context and the ways users can specify [...]