I am a PhD candidate in Computer Engineering at UIUC advised by Ravishankar K. Iyer in the DEPEND research group. My research interests are at the intersection of computer systems and machine learning, with a focus on improving the dependability and efficiency of large-scale systems.
During my PhD, I was fortunate to intern and collaborate with amazing mentors at IBM Research, Google Network Infrastructure, and Meta AI Infrastructure.
I am honored to be selected as a 2025 Machine Learning and Systems Rising Star.
Selected Publications
QLM: Queue Management for SLO-oriented Large Language Model Serving.
SoCC 2024.
- Integrated into vLLM v0.6.2+ (release notes)
- Used in production at ByteDance (blog), IBM, Snowflake (presentation), and others