Yosemite National Park, CA
2nd June 2024
Credit to Rebecca Sun

Haotian Xue 薛昊天 /haʊtiˈæn ; ʃweɪ/

Email: htxue.ai [at] gatech [dot] edu

Office: CODA E1011, 756 W Peachtree St NW, Atlanta, GA 30332

I am currently a 3rd-year ML Ph.D. student at Microsoft Logo ML@GaTech, advised by Dr. Yongxin Chen. Previously, I obtained my B.E. of Computer Science from Microsoft Logo Shanghai Jiao Tong University with honor in Zhiyuan College in 2022.

I did my research intern at Nvidia Logo Nvidia (24 Summer) and Microsoft Logo Microsoft Research Asia (21 Winter). I was a Visiting Student (2021) at MIT Logo MIT CSAIL.

📢 Feel free to contact me if you want to discuss or collaborate! If you want to do research under my mentorship, please refer to [Supervision Guidance].

News 📢
  • [2024.09] 3 Papers (DP-Attacker, RefDrop, QueST) are accepted to NeurIPS 2024 !
  • [2024.05] I started as a summer research intern at Nvidia DIR Group !
  • [2024.05] We propose DP-Attacker, a framework to attack Diffusion-based policy generator.
  • [2024.04] We release PDM-Pure, a universal purifier for protection against diffusion models
  • [2024.03] I received the ICLR'2024 travel award, thanks!
  • [2024.01] Our Diff-Protect is accepted by ICLR'2024!
  • [2023.10] I was awarded the NeurIPS'2023 scholar award, thanks!
  • [2023.10] I was invited as a reviewer for TPAMI.
  • [2023.10] We propose Diff-Protect, a more effective protection framework against AI mimicry.
  • [2023.09] Diff-PGD and 3D-IntPhys are accepted by NeurIPS'2023!
  • [2023.08] I was invited as a reviewer for ICLR'2024.
  • [2023.05] We propose Diff-PGD, a diffusion-based adv-sample generation framework.
  • [2022.12] I am selected as the Top Reviewer of NeurIPS 2022.
  • [2022.10] Our Distance-Transformer is accepted to EMNLP2022 Findings.
  • [2022.08] I will start as a PhD student at (ML@GT) starting from 2022Fall.
Research Interest

My research interest lies in broad aspects of Machine Learning topics such as Computer Vision and Natural Language Processing. Currently, I focus on the following topics:

  • Generative Models (e.g. diffusion models, auto-regressive generative models)
  • Generalized Embodied AI(e.g. diffusion policy, vision-language models)
  • Safety and Robustness of GenAI(e.g. adversarial attacks, protection of diffusion models)
Reviewer Experience

I have reviewed 40+ papers for ML conferences like:

  • NeurIPS@22/23/24/25
  • ICLR@24/25
  • AISTATS@25
  • ICML@22/23/24

Publications ( show selected / show all by date / show all by topic )

Topics: Vision / NLP/ Robot Learning/ GenAI/ Explainable AI / (*/†: indicates equal contribution.)

RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance
Jiaojiao Fan, Haotian Xue, Qinsheng, Zhang, Yongxin Chen

[NeurIPS 2024] [Project Website]

QueST: Self-Supervised Skill Abstractions for Learning Continuous Control
Arthava Mete, Haotian Xue, Albert Wilcox, Yongxin Chen, Animesh Garg

[NeurIPS 2024] [Webpage] [GitHub]

Diffusion Policy Attacker: Crafting Adversarial Attacks for Diffusion-based Policies
Yipu Chen* , Haotian Xue*, Yongxin Chen

[NeurIPS 2024] [Project Website]

Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think
Haotian Xue, Yongxin Chen

[Arxiv] [GitHub]

[Abridged in NeurIPS 2024:SafeGenAI]

Towards More Effective Protection Against Diffusion-Based Mimicry with Score Distillation
Haotian Xue, Chumeng Liang*, Xiaoyu Wu*, Yongxin Chen

[ICLR 2024] [GitHub] [Poster]

Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability
Haotian Xue, Alexandre Araujo, Bin Hu, Yongxin Chen

[NeurIPS 2023] [GitHub] [Poster]

3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes
Haotian Xue, Antonio Torralba, Joshua B. Tenenbaum, Daniel LK Yamins, Yunzhu Li, Hsiao-Yu Tung

[NeurIPS 2023] [AIhub] [Poster]

[Abridged in CVPR 2023 3DVR & Precognition]

Syntax-guided Localized Self-attention by Constituency Syntactic Distance
Shengyuan Hou*, Jushi Kai*, Haotian Xue*, Bingyu Zhu, Bo Yuan, Longtao Huang, Xinbin Wang, Zhouhan Lin

[EMNLP2022, Findings] [GitHub]

Learning to Adaptively Incorporate External Syntax through Gated Self-Attention
[TBA]

[In Submission]

A Hypothesis For The Cognitive Difficulty of Images
Xu Chen, Xin Wang, Haotian Xue, Zhengyang Liang, Xin Jin, Quanshi Zhang

[Arxiv]

Evaluation of Attribution Explanations without Ground Truth
Hao Zhang, Haotian Xue, Jiayi Chen, Yiting Chen, Wen Shen, Quanshi Zhang

[OpenReview]

Active Adversarial Learning
Haotian Xue,

Advisor: Nanyang Ye

[Bachelor Thesis]