Haotian Xue

/haʊtiˈæn ; ʃweɪ/

4th-year PhD (Machine Learning), Georgia Tech. Advised by Yongxin Chen.

Email: htxue.ai [at] gatech [dot] edu

Address: CODA E1011, 756 W Peachtree St NW, Atlanta, GA 30332

Links: Google Scholar · GitHub · X (Twitter) · Curriculum Vitae (Nov 2025)

About & Research Interests

I do research in machine learning and computer vision. Currently, I am interested in physics AI regarding multi-modal language models, video diffusion model and world models. I am also committed to safety problem in generative AI, focusing on adversarial protection of diffusion model.

I did research at Adobe Firefly (2025), NVIDIA DIR (2024), and Microsoft Research Asia (2021). I was also a visiting student at MIT CSAIL (2021). I earned my B.E. in Computer Science (Honors) from Shanghai Jiao Tong University (Zhiyuan Honor Program) in 2022.

News

2025-10 We propose MoGAN, a novel post-training to improve motion quality for few-step video diffusion models.
2025-09 We propose PIO-Bench, a visual-grounding-centric benchmark for embodied reasoning of VLMs.
2025-04 Joining Adobe Firefly (Summer+Fall) to work on post-training for video diffusion.
2024-10 NeurIPS 2024 Scholar Award — see you in Vancouver!
2024-09 Three papers accepted to NeurIPS 2024: DP-Attacker, RefDrop, QueST.
2024-05 Started summer research intern at NVIDIA DIR Group.
2024-04 Released PDM-Pure, a universal purifier against diffusion models.
2024-03 ICLR 2024 Travel Award.
2024-01 SDS-Attack accepted to ICLR 2024.
2023-10 NeurIPS 2023 Scholar Award; invited reviewer for TPAMI.
2023-09 Diff-PGD and 3D-IntPhys accepted to NeurIPS 2023.
2023-08 Invited reviewer for ICLR 2024.
2023-05 Proposed Diff-PGD, a diffusion-based adversarial sample framework.
2022-12 Selected as a Top Reviewer of NeurIPS 2022.
2022-10 Distance-Transformer accepted to EMNLP 2022 Findings.
2022-08 Started PhD at ML@GT.

Selected Publications

MoGAN: Improving Motion Quality in Video Diffusion via Few-Step Motion Adversarial Post-Training

Haotian Xue, Qi Chen, Zhonghao Wang, Xun Huang, Eli Shechtman, Jinrong Xie, Yongxin Chen

Arxiv 2025

arXiv · code · project
Point-It-Out: Benchmarking Embodied Reasoning for Vision Language Models in Multi-Stage Visual Grounding

Haotian Xue, Yunhao Ge, Yu Zeng, Max Li, Ming-Yu Liu, Yongxin Chen, Jiaojiao Fan

Arxiv 2025

arXiv · code · project
RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance

Jiaojiao Fan, Haotian Xue, Qinsheng Zhang, Yongxin Chen

NeurIPS 2024

arXiv · project
QueST: Self-Supervised Skill Abstractions for Learning Continuous Control

Arthava Mete, Haotian Xue, Albert Wilcox, Yongxin Chen, Animesh Garg

NeurIPS 2024

arXiv · project · code
Diffusion Policy Attacker: Crafting Adversarial Attacks for Diffusion-based Policies

Yipu Chen*, Haotian Xue*, Yongxin Chen

NeurIPS 2024

arXiv · project
Towards More Effective Protection Against Diffusion-Based Mimicry with Score Distillation

Haotian Xue, Chumeng Liang*, Xiaoyu Wu*, Yongxin Chen

ICLR 2024

arXiv · code · poster
Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think

Haotian Xue, Yongxin Chen

NeurIPSW: SafeGenAI 2024 · arXiv 2024

arXiv · code
Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability

Haotian Xue, Alexandre Araujo, Bin Hu, Yongxin Chen

NeurIPS 2023

arXiv · code · poster
3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics

Haotian Xue, Antonio Torralba, Joshua B. Tenenbaum, Daniel L. K. Yamins, Yunzhu Li, Hsiao-Yu Tung

NeurIPS 2023

arXiv · poster

More & older publications

See Google Scholar for the full, latest list.

Reviewer Experience

Conferences

NeurIPS ’22–’25
ICLR ’24–’25
ICML ’22–’25
AISTATS ’25
CVPRW ’25

Journals

TPAMI
TCSVT