PhD Thesis Proposal
Toward Aligned Vision Models
Abstract: Modern vision and vision–language models (VLMs) achieve remarkable perceptual performance, yet their internal representations often misalign with human-understandable concepts, clinical reasoning, or the causal structure of data. Such misalignment limits trust, generalization, and safety - particularly in high-stakes domains such as medical imaging. This thesis proposes a comprehensive framework for model alignment, developing methods [...]
Learning Dynamic and Competitive Human Skills and Strategies for Animation and Robotics
Abstract: Humanoid control in animation and robotics requires physically realistic motion as well as the ability to adapt, coordinate actions over time, and make decisions in response to changing environments and other agents. Human motion data provides a powerful source of prior knowledge for learning natural and stable movement, but many existing approaches rely on [...]