4th-year PhD (Machine Learning), Georgia Tech. Advised by Yongxin Chen.
I'm broadly interested in Generative AI across vision and robotics, with a current focus on physical understanding and physical alignment of Vision–Language Models and Video Diffusion Models, through post-training. Earlier in my PhD I worked on adversarial attacks & defenses for GenAI and developed algorithms across diffusion models and diffusion-based policy: Diff-PGD, SDS-Attack, PDM-Pure, and DP-Attacker.
Previously, I completed research internships at
Adobe Firefly (2025),
NVIDIA DIR (2024), and
Microsoft Research Asia (2021).
I was also a visiting student at
MIT CSAIL (2021).
I earned my B.E. in Computer Science (Honors) from
Shanghai Jiao Tong University (Zhiyuan College) in 2022.
Adobe Firefly (Summer+Fall) to work on post-training for video diffusion.
NVIDIA DIR Group.See Google Scholar for the full, latest list.