RI PhD Thesis Proposal - Xinyu (Rachel) Li - Robotics Institute Carnegie Mellon University
Loading Events

PhD Thesis Proposal

April

13
Mon
Xinyu Li PhD Student Robotics Institute,
Carnegie Mellon University
Monday, April 13
3:15 pm to 5:00 pm
GHC 6121
RI PhD Thesis Proposal – Xinyu (Rachel) Li
Date: April 13, 2026
Time: 03:15 PM (ET)
Location: GHC 6121
Zoom Link

Type: Ph.D. Thesis Proposal

Who: Xinyu (Rachel) Li
Title: Towards Accessible AI Agents
 
Abstract:
Empowered by large language models (LLMs), AI agents have shown strong potential across tasks such as general-purpose assistance, software coding, and scientific research. However, their practical utility in applications involving consequential decisions such as healthcare, remains constrained by three major challenges.

Evaluation. Existing agent evaluations often focus on well-structured tasks and final outcomes, failing to fully capture the complexity of real-world workflows. We propose evaluation frameworks grounded in realistic machine learning engineering workflows, providing skill-based, multi-artifact, and holistic assessments that systematically evaluate the practical utility of AI agents.

Learning. Improving LLMs for agentic use typically relies on reinforcement learning with large amounts of high-quality labeled data, which are costly and difficult to obtain in expert domains including healthcare. To address this limitation, we aim to develop learning frameworks that require minimal external supervision, improving the scalability and efficiency of agent learning.

Specialization. AI agents typically follow a one-size-fits-all paradigm at the time of deployment, lacking mechanisms to account for task-specific or user-specific requirements. We propose methods that enable agent specialization for downstream tasks and users, expanding their applicability across heterogeneous deployment settings.

This thesis aims to make AI agents more broadly accessible and impactful in important real-world applications by enhancing their practical utility, making them more measurable, more capable, and better tailored to the needs of their users and applications.

Thesis committee members:
Artur Dubrawski (Chair)
Andrea Bajcsy
Barnabás Póczos
Daniel McDuff (Google)