Personal Information

I am a PhD candidate in the Department of Computer Science at Purdue University, advised by Ruqi Zhang. I completed my BE in computer science and technology at Tianjin University, advised by Changqing Zhang.

Research Interests

I am interested in the statistical frameworks for stable and efficient ML algorithms. Recently, I focus on the reinforcement learning in LLM post-training, especially the exploration boundary of complex reasoning tasks. I am also broadly interested in preference alignment, (multimodal) LLM safety, and Bayesian deep learning.

News

[04/30/2026] 1 paper accepted by ICML 2026

[04/06/2026] 1 paper accepted by ACL 2026

[03/23/2026] Start an internship at Apple MLR

[01/26/2026] 1 paper accepted by ICLR 2026

[08/20/2025] 1 short paper accepted by EMNLP 2025

[07/08/2025] 2 papers accepted by COLM 2025